Commits · seanpedrickcase/llm_topic

Minor prompt improvements. Bedrock client and keys should now be correctly passed to validation function

70cb346

seanpedrickcase commited on 23 days ago

Added topic validation and llm deduplication functionality. Now checks for input token count before LLM calls.

8af499b

seanpedrickcase commited on 30 days ago

Added model compatibility for OpenAI and Azure endpoints. Added some Bedrock models, now compatible with thinking models

3085585

seanpedrickcase commited on Oct 9

Changed config defaults to enable successful app run without local model inference on first run

5fed34d

seanpedrickcase commited on Oct 9

Added deduplication with LLM functionality. Minor package updates. Updated installation documentation.

6f3d42c

seanpedrickcase commited on Oct 9

Added examples for structured summaries and groups. Adapted functions for structured summaries. Simplified front tab GUI

5ed844b

seanpedrickcase commited on Oct 8

Optimised prompts. Updated Gradio. Added example for zero shot topics. Added support for Granite 4 local model

9e8c029

seanpedrickcase commited on Oct 7

No longer str capitalize summaries

3ee11fd

seanpedrickcase commited on Oct 4

Corrected blank KV_QUANT_LEVEL values. Removed erroneous extract topics output

7ae3b47

seanpedrickcase commited on Sep 21

Corrected KV quant value when not specified (set to None)

4cd2443

seanpedrickcase commited on Sep 21

Corrected KV quantisation definition in config. Moved examples to top of screen under intro.

73188ff

seanpedrickcase commited on Sep 19

Generally improved inference for low vram systems, unsloth usage improvements, updated packages, switched default local model to Qwen 3 4b

bd1a015

seanpedrickcase commited on Sep 19

Updated output logging files. Updated examples and readme. Minor prompt adjustments and package updates

fff212b

seanpedrickcase commited on Sep 12

Corrected reasoning setting

4998b3c

seanpedrickcase commited on Sep 11

Revised intro and readme. Reasoning suffix setting simplified. All in one xlsx returns correct tables.

8161b79

seanpedrickcase commited on Sep 11

tracking model state through all_in_one_function

1a1a845

seanpedrickcase commited on Sep 11

Corrected usage of stop strings, streaming

6154c1e

seanpedrickcase commited on Sep 11

Added stop strings, optimised llama-cpp-python inference for streaming

6eaced0

seanpedrickcase commited on Sep 11

Trying out inference with unsloth vs transformers

4d01a46

seanpedrickcase commited on Sep 11

Added 'all-in-one' function. Corrected local model load when not loaded initially. Environment variables for max data rows and topics.

d6ff533

seanpedrickcase commited on Sep 10

Added speculative decoding to transformers calls for gemma 3

7cadb40

seanpedrickcase commited on Sep 9

It should now be possible to load the local model globally at the start to avoid repeated loading throughout the stages of topic extraction

fd02514

seanpedrickcase commited on Sep 9

Added framework of support for Azure models (although untested)

138286c

seanpedrickcase commited on Sep 8

Corrected misplaced example file reference. Added AWS Nova models to model list

5fa40a6

seanpedrickcase commited on Sep 8

Minor corrections to transformers inference

6797022

seanpedrickcase commited on Sep 8

Further optimisations to transformers inference

0895a36

seanpedrickcase commited on Sep 8

Enabled GPU-based local model inference with the transformers package

72d517c

seanpedrickcase commited on Sep 8

Added possibility of adding examples quickly to the input files

8c54223

seanpedrickcase commited on Sep 4

Enhanced app functionality by adding new UI elements for summary file management, put in Bedrock model toggle, and refined logging messages. Updated Dockerfile and requirements for better compatibility and added install guide to readme. Removed deprecated code and unnecessary comments.

ba1a951

seanpedrickcase commited on Sep 3

Removed unnecessary print statements

6da6ac6

seanpedrickcase commited on Sep 2

Enhanced app functionality by adding new logging variables, refining file input options, and updating prompts for better user experience. Updated Dockerfile for improved environment setup and adjusted requirements for compatibility. Removed unnecessary print statements and added error handling in data loading functions.

714810a

seanpedrickcase commited on Sep 2

Corrected full_prompt save

d25e491

seanpedrickcase commited on Sep 2

Minor fixes for Gemini, model calls. Updated Dockerfile for non-GPU systems

8ec0f3d

seanpedrickcase commited on Sep 2

Merge branch 'dev' into gpt-oss-compat

7497349
unverified

Sean Pedrick-Case commited on Sep 1

GPT-OSS 20b should now work correctly

12c4a40

Sonnyjim commited on Sep 1

Added GPT-OSS 20b support. Moved to Llama cpp python chat_completion function

c61bb70

seanpedrickcase commited on Sep 1

Added DynamoDB logging. Optimised topics lists with descriptions

ef5252d

seanpedrickcase commited on Aug 29

Added cost code logging functionality. More customised logging output, can now save logs to DynamoDB.

9bd035b

seanpedrickcase commited on Aug 18

Updated app to work correctly in API mode. Updated xlsx outputs. Some package updates

1803cc3

seanpedrickcase commited on Jul 15

Allowed for save to xlsx file. Some package updates

11004c5

seanpedrickcase commited on Jul 14

You can now create structured summaries by group by changing a setting under 'I have my own list of topics'

4753a16

seanpedrickcase commited on Jul 11

Corrected system prompt usage. Upgraded Bedrock to converse API. Replaced Sonnet 3 with Sonnet 3.7. Updated prompts with response prefill and more specific system prompt for tables.

e66ea5f

seanpedrickcase commited on Jul 8

Further improved grouping implementation, improved summarisation prompts

92003de

seanpedrickcase commited on Jun 25

Corrected LLM_MAX_GPU_LAYERS variable assignment

b897713

seanpedrickcase commited on Jun 23

Added LLM_MAX_GPU_LAYERS to config

26a7e72

seanpedrickcase commited on Jun 23

Updated requirements and config to renable Gemma 2 2B as the default for zero GPU space compatibility

3a1c74c

seanpedrickcase commited on Jun 23

Fixed some issues with output folder references. Minor changes to column names, summary output

ad90f21

seanpedrickcase commited on Jun 20

Changed default local model to Gemma 1B

0ce2848

seanpedrickcase commited on Jun 20

Code reorganisation to better use config files. Adapted code to use Gemma 3 as local model. Minor package updates

0c0a08a

seanpedrickcase commited on Jun 20

Commit History

Minor prompt improvements. Bedrock client and keys should now be correctly passed to validation function 70cb346

Added topic validation and llm deduplication functionality. Now checks for input token count before LLM calls. 8af499b

Added model compatibility for OpenAI and Azure endpoints. Added some Bedrock models, now compatible with thinking models 3085585

Changed config defaults to enable successful app run without local model inference on first run 5fed34d

Added deduplication with LLM functionality. Minor package updates. Updated installation documentation. 6f3d42c

Added examples for structured summaries and groups. Adapted functions for structured summaries. Simplified front tab GUI 5ed844b

Optimised prompts. Updated Gradio. Added example for zero shot topics. Added support for Granite 4 local model 9e8c029

No longer str capitalize summaries 3ee11fd

Corrected blank KV_QUANT_LEVEL values. Removed erroneous extract topics output 7ae3b47

Corrected KV quant value when not specified (set to None) 4cd2443

Corrected KV quantisation definition in config. Moved examples to top of screen under intro. 73188ff

Generally improved inference for low vram systems, unsloth usage improvements, updated packages, switched default local model to Qwen 3 4b bd1a015

Updated output logging files. Updated examples and readme. Minor prompt adjustments and package updates fff212b

Corrected reasoning setting 4998b3c

Revised intro and readme. Reasoning suffix setting simplified. All in one xlsx returns correct tables. 8161b79

tracking model state through all_in_one_function 1a1a845

Corrected usage of stop strings, streaming 6154c1e

Added stop strings, optimised llama-cpp-python inference for streaming 6eaced0

Trying out inference with unsloth vs transformers 4d01a46

Added 'all-in-one' function. Corrected local model load when not loaded initially. Environment variables for max data rows and topics. d6ff533

Added speculative decoding to transformers calls for gemma 3 7cadb40

It should now be possible to load the local model globally at the start to avoid repeated loading throughout the stages of topic extraction fd02514

Added framework of support for Azure models (although untested) 138286c

Corrected misplaced example file reference. Added AWS Nova models to model list 5fa40a6

Minor corrections to transformers inference 6797022

Further optimisations to transformers inference 0895a36

Enabled GPU-based local model inference with the transformers package 72d517c

Added possibility of adding examples quickly to the input files 8c54223

Enhanced app functionality by adding new UI elements for summary file management, put in Bedrock model toggle, and refined logging messages. Updated Dockerfile and requirements for better compatibility and added install guide to readme. Removed deprecated code and unnecessary comments. ba1a951

Removed unnecessary print statements 6da6ac6

Corrected full_prompt save d25e491

Minor fixes for Gemini, model calls. Updated Dockerfile for non-GPU systems 8ec0f3d

Merge branch 'dev' into gpt-oss-compat 7497349 unverified

GPT-OSS 20b should now work correctly 12c4a40

Added GPT-OSS 20b support. Moved to Llama cpp python chat_completion function c61bb70

Added DynamoDB logging. Optimised topics lists with descriptions ef5252d

Added cost code logging functionality. More customised logging output, can now save logs to DynamoDB. 9bd035b

Updated app to work correctly in API mode. Updated xlsx outputs. Some package updates 1803cc3

Allowed for save to xlsx file. Some package updates 11004c5

You can now create structured summaries by group by changing a setting under 'I have my own list of topics' 4753a16

Corrected system prompt usage. Upgraded Bedrock to converse API. Replaced Sonnet 3 with Sonnet 3.7. Updated prompts with response prefill and more specific system prompt for tables. e66ea5f

Further improved grouping implementation, improved summarisation prompts 92003de

Corrected LLM_MAX_GPU_LAYERS variable assignment b897713

Added LLM_MAX_GPU_LAYERS to config 26a7e72

Updated requirements and config to renable Gemma 2 2B as the default for zero GPU space compatibility 3a1c74c

Fixed some issues with output folder references. Minor changes to column names, summary output ad90f21

Changed default local model to Gemma 1B 0ce2848

Code reorganisation to better use config files. Adapted code to use Gemma 3 as local model. Minor package updates 0c0a08a

Minor prompt improvements. Bedrock client and keys should now be correctly passed to validation function

70cb346

Added topic validation and llm deduplication functionality. Now checks for input token count before LLM calls.

8af499b

Added model compatibility for OpenAI and Azure endpoints. Added some Bedrock models, now compatible with thinking models

3085585

Changed config defaults to enable successful app run without local model inference on first run

5fed34d

Added deduplication with LLM functionality. Minor package updates. Updated installation documentation.

6f3d42c

Added examples for structured summaries and groups. Adapted functions for structured summaries. Simplified front tab GUI

5ed844b

Optimised prompts. Updated Gradio. Added example for zero shot topics. Added support for Granite 4 local model

9e8c029

No longer str capitalize summaries

3ee11fd

Corrected blank KV_QUANT_LEVEL values. Removed erroneous extract topics output

7ae3b47

Corrected KV quant value when not specified (set to None)

4cd2443

Corrected KV quantisation definition in config. Moved examples to top of screen under intro.

73188ff

Generally improved inference for low vram systems, unsloth usage improvements, updated packages, switched default local model to Qwen 3 4b

bd1a015

Updated output logging files. Updated examples and readme. Minor prompt adjustments and package updates

fff212b

Corrected reasoning setting

4998b3c

Revised intro and readme. Reasoning suffix setting simplified. All in one xlsx returns correct tables.

8161b79

tracking model state through all_in_one_function

1a1a845

Corrected usage of stop strings, streaming

6154c1e

Added stop strings, optimised llama-cpp-python inference for streaming

6eaced0

Trying out inference with unsloth vs transformers

4d01a46

Added 'all-in-one' function. Corrected local model load when not loaded initially. Environment variables for max data rows and topics.

d6ff533

Added speculative decoding to transformers calls for gemma 3

7cadb40

It should now be possible to load the local model globally at the start to avoid repeated loading throughout the stages of topic extraction

fd02514

Added framework of support for Azure models (although untested)

138286c

Corrected misplaced example file reference. Added AWS Nova models to model list

5fa40a6

Minor corrections to transformers inference

6797022

Further optimisations to transformers inference

0895a36

Enabled GPU-based local model inference with the transformers package

72d517c

Added possibility of adding examples quickly to the input files

8c54223

Enhanced app functionality by adding new UI elements for summary file management, put in Bedrock model toggle, and refined logging messages. Updated Dockerfile and requirements for better compatibility and added install guide to readme. Removed deprecated code and unnecessary comments.

ba1a951

Removed unnecessary print statements

6da6ac6

Corrected full_prompt save

d25e491

Minor fixes for Gemini, model calls. Updated Dockerfile for non-GPU systems

8ec0f3d

Merge branch 'dev' into gpt-oss-compat

7497349
unverified

GPT-OSS 20b should now work correctly

12c4a40

Added GPT-OSS 20b support. Moved to Llama cpp python chat_completion function

c61bb70

Added DynamoDB logging. Optimised topics lists with descriptions

ef5252d

Added cost code logging functionality. More customised logging output, can now save logs to DynamoDB.

9bd035b

Updated app to work correctly in API mode. Updated xlsx outputs. Some package updates

1803cc3

Allowed for save to xlsx file. Some package updates

11004c5

You can now create structured summaries by group by changing a setting under 'I have my own list of topics'

4753a16

Corrected system prompt usage. Upgraded Bedrock to converse API. Replaced Sonnet 3 with Sonnet 3.7. Updated prompts with response prefill and more specific system prompt for tables.

e66ea5f

Further improved grouping implementation, improved summarisation prompts

92003de

Corrected LLM_MAX_GPU_LAYERS variable assignment

b897713

Added LLM_MAX_GPU_LAYERS to config

26a7e72

Updated requirements and config to renable Gemma 2 2B as the default for zero GPU space compatibility

3a1c74c

Fixed some issues with output folder references. Minor changes to column names, summary output

ad90f21

Changed default local model to Gemma 1B

0ce2848

Code reorganisation to better use config files. Adapted code to use Gemma 3 as local model. Minor package updates

0c0a08a