Commit History

Minor prompt improvements. Bedrock client and keys should now be correctly passed to validation function
70cb346

seanpedrickcase commited on

Added topic validation and llm deduplication functionality. Now checks for input token count before LLM calls.
8af499b

seanpedrickcase commited on

Added model compatibility for OpenAI and Azure endpoints. Added some Bedrock models, now compatible with thinking models
3085585

seanpedrickcase commited on

Changed config defaults to enable successful app run without local model inference on first run
5fed34d

seanpedrickcase commited on

Added deduplication with LLM functionality. Minor package updates. Updated installation documentation.
6f3d42c

seanpedrickcase commited on

Added examples for structured summaries and groups. Adapted functions for structured summaries. Simplified front tab GUI
5ed844b

seanpedrickcase commited on

Optimised prompts. Updated Gradio. Added example for zero shot topics. Added support for Granite 4 local model
9e8c029

seanpedrickcase commited on

Corrected blank KV_QUANT_LEVEL values. Removed erroneous extract topics output
7ae3b47

seanpedrickcase commited on

Corrected KV quant value when not specified (set to None)
4cd2443

seanpedrickcase commited on

Corrected KV quantisation definition in config. Moved examples to top of screen under intro.
73188ff

seanpedrickcase commited on

Generally improved inference for low vram systems, unsloth usage improvements, updated packages, switched default local model to Qwen 3 4b
bd1a015

seanpedrickcase commited on

Updated output logging files. Updated examples and readme. Minor prompt adjustments and package updates
fff212b

seanpedrickcase commited on

Revised intro and readme. Reasoning suffix setting simplified. All in one xlsx returns correct tables.
8161b79

seanpedrickcase commited on

tracking model state through all_in_one_function
1a1a845

seanpedrickcase commited on

Corrected usage of stop strings, streaming
6154c1e

seanpedrickcase commited on

Added stop strings, optimised llama-cpp-python inference for streaming
6eaced0

seanpedrickcase commited on

Trying out inference with unsloth vs transformers
4d01a46

seanpedrickcase commited on

Added 'all-in-one' function. Corrected local model load when not loaded initially. Environment variables for max data rows and topics.
d6ff533

seanpedrickcase commited on

Added speculative decoding to transformers calls for gemma 3
7cadb40

seanpedrickcase commited on

It should now be possible to load the local model globally at the start to avoid repeated loading throughout the stages of topic extraction
fd02514

seanpedrickcase commited on

Added framework of support for Azure models (although untested)
138286c

seanpedrickcase commited on

Corrected misplaced example file reference. Added AWS Nova models to model list
5fa40a6

seanpedrickcase commited on

Minor corrections to transformers inference
6797022

seanpedrickcase commited on

Further optimisations to transformers inference
0895a36

seanpedrickcase commited on

Enabled GPU-based local model inference with the transformers package
72d517c

seanpedrickcase commited on

Added possibility of adding examples quickly to the input files
8c54223

seanpedrickcase commited on

Enhanced app functionality by adding new UI elements for summary file management, put in Bedrock model toggle, and refined logging messages. Updated Dockerfile and requirements for better compatibility and added install guide to readme. Removed deprecated code and unnecessary comments.
ba1a951

seanpedrickcase commited on

Enhanced app functionality by adding new logging variables, refining file input options, and updating prompts for better user experience. Updated Dockerfile for improved environment setup and adjusted requirements for compatibility. Removed unnecessary print statements and added error handling in data loading functions.
714810a

seanpedrickcase commited on

Minor fixes for Gemini, model calls. Updated Dockerfile for non-GPU systems
8ec0f3d

seanpedrickcase commited on

Merge branch 'dev' into gpt-oss-compat
7497349
unverified

Sean Pedrick-Case commited on

GPT-OSS 20b should now work correctly
12c4a40

Sonnyjim commited on

Added GPT-OSS 20b support. Moved to Llama cpp python chat_completion function
c61bb70

seanpedrickcase commited on

Added DynamoDB logging. Optimised topics lists with descriptions
ef5252d

seanpedrickcase commited on

Added cost code logging functionality. More customised logging output, can now save logs to DynamoDB.
9bd035b

seanpedrickcase commited on

Updated app to work correctly in API mode. Updated xlsx outputs. Some package updates
1803cc3

seanpedrickcase commited on

Allowed for save to xlsx file. Some package updates
11004c5

seanpedrickcase commited on

You can now create structured summaries by group by changing a setting under 'I have my own list of topics'
4753a16

seanpedrickcase commited on

Corrected system prompt usage. Upgraded Bedrock to converse API. Replaced Sonnet 3 with Sonnet 3.7. Updated prompts with response prefill and more specific system prompt for tables.
e66ea5f

seanpedrickcase commited on

Further improved grouping implementation, improved summarisation prompts
92003de

seanpedrickcase commited on

Corrected LLM_MAX_GPU_LAYERS variable assignment
b897713

seanpedrickcase commited on

Updated requirements and config to renable Gemma 2 2B as the default for zero GPU space compatibility
3a1c74c

seanpedrickcase commited on

Fixed some issues with output folder references. Minor changes to column names, summary output
ad90f21

seanpedrickcase commited on

Changed default local model to Gemma 1B
0ce2848

seanpedrickcase commited on

Code reorganisation to better use config files. Adapted code to use Gemma 3 as local model. Minor package updates
0c0a08a

seanpedrickcase commited on