AI & ML interests

The next generation of large language models focuses on optimization for excellent reasoning, multi-task knowledge, and multilingual.

lamhieu 
posted an update 23 days ago
view post
Post
187
(tiếng Việt bên dưới ⬇️)
🔍 AI is redefining the search experience.
From the limits of keyword-based search to the power of NLP, conversational AI, and personalization — search is no longer just typing and filtering, but understanding intent.
This slide deck shares key insights, real-world use cases, and recommendations for e-commerce and retail businesses — especially in food & FMCG:
- 📈 Higher conversions
- 🛒 Better shopping experience
- 🚀 Stronger, future-ready brand positioning

Check it out here:
👉 https://docs.google.com/presentation/d/1_Ix_jmK-kbOV_Ayi5SNbzhfwpmpenDmtBub9ucR2xGI/edit?usp=sharing

💬 Open to discuss if you’re exploring AI-powered search!
---
🔍 AI đang tái định nghĩa trải nghiệm tìm kiếm.
Từ giới hạn của tìm kiếm theo từ khóa đến sức mạnh của NLP, AI đàm thoại và cá nhân hóa — tìm kiếm không còn chỉ là gõ rồi lọc, mà là hiểu đúng nhu cầu.
Slide deck này chia sẻ góc nhìn tổng quan, ví dụ thực tiễn và khuyến nghị cụ thể cho doanh nghiệp TMĐT, bán lẻ — đặc biệt trong ngành thực phẩm & FMCG:
- 📈 Tăng chuyển đổi
- 🛒 Trải nghiệm mua sắm tốt hơn
- 🚀 Xây dựng thương hiệu hiện đại, dẫn đầu xu hướng

Xem tại đây:
👉 https://docs.google.com/presentation/d/1_Ix_jmK-kbOV_Ayi5SNbzhfwpmpenDmtBub9ucR2xGI/edit?usp=sharing

💬 Rất sẵn lòng trao đổi nếu bạn quan tâm ứng dụng AI vào doanh nghiệp!
lamhieu 
posted an update 5 months ago
view post
Post
1667
Power Up RAG, Virtual Assistants, and Perplexity Alternatives! 🚀

🔗 Docsifer + Lightweight Embeddings API = The perfect duo for next-gen solutions!

- 📄 Docsifer: Seamlessly convert PDFs, Word, JSON, and URLs to Markdown—ideal for building clean, structured knowledge bases.
- ✨ Lightweight Embeddings API: Create multilingual and multimodal embeddings for fast, accurate search, reranking, and understanding.

🤖 Build smarter RAG pipelines, enhance virtual assistants, or craft powerful Perplexity-like applications with this free, production-ready combo.

👉 Start optimizing today:
- Docsifer: lamhieu/docsifer
- Lightweight Embeddings API: lamhieu/lightweight-embeddings

💡 Faster insights. Better recommendations. Global reach. 🚀
  • 1 reply
·
lamhieu 
posted an update 5 months ago
view post
Post
755
🚀 Docsifer: Convert Anything to Markdown! 📝

Transform your files into Markdown with Docsifer—your all-in-one tool for diverse formats like PDF, Word, Excel, JSON, HTML, CSV, ZIP, and even audio/images. Supports URL-to-Markdown too! 🔗✨

🌟 Why Docsifer?
- Multi-Format: Convert virtually any document type.
- Flexible & Accurate: Powered by MarkItDown and optional LLMs for advanced text extraction.
- Privacy-First: No data storage—only minimal anonymous stats.
- Open Source: Transparent and community-driven.
- Production-Ready: Docker, API, and interactive playground on Hugging Face Spaces.

👉 Try it out or contribute:
🌐 Hugging Face: lamhieu/docsifer
💻 GitHub: https://github.com/lh0x00/docsifer

Convert smarter. Collaborate better. Start now! 🚀
lamhieu 
posted an update 5 months ago
view post
Post
1877
Unlock seamless document conversion with Docsifer, powered by MarkItDown at its core! 🚀 Effortlessly transform PDFs, Word, Excel, images, audio, HTML, and more into clean, structured Markdown—perfect for developers, writers, and content creators. With optional LLM-enhanced extraction and robust format support, Docsifer ensures accuracy, speed, and privacy.
🌟 Try it now and experience professional-grade Markdown conversion: lamhieu/docsifer
lamhieu 
posted an update 6 months ago
view post
Post
2240
🚀 Unlock the power of a completely free, unlimited multilingual API!
🌐 The Lightweight Embeddings API offers state-of-the-art text and image embeddings, advanced reranking, and seamless support for over 100 languages — no limits, no restrictions.
🌟 Try it now: lamhieu/lightweight-embeddings
lamhieu 
posted an update 10 months ago
view post
Post
1760
🎯 Ghost 8B Beta 1608: Empowering Your AI Assistant
📦 Unlock the Power of Ghost 8B Beta 1608: Build Your Personal AI Companion
Ghost 8B Beta 1608 empowers you to create a safe and multilingual AI assistant tailored to your needs, directly on your personal computer. 🧑‍💻 Leverage AI's capabilities within your own space! 🚀 Ghost 8B Beta 1608 is ready to become your AI companion.
~
📦 개인용 AI 보조 도구로 Ghost 8B Beta 1608를 활용하세요!
Ghost 8B Beta 1608, AI의 힘을 활용하여 안전하고 개인화된 언어 지원을 제공하는 AI 보조 도구를 직접 구축할 수 있습니다. 🧑‍💻 개인 컴퓨터에서 AI의 혜택을 누리세요! 🚀 Ghost 8B Beta 1608는 당신의 AI 파트너가 될 준비가 되어 있습니다.
lamhieu/ghost-8b-beta-8k
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
lamhieu 
posted an update 10 months ago
view post
Post
3257
🚀 We’re excited to launch Ghost 8B Beta (1608), a top-performing language model with unmatched multilingual support and cost efficiency.

Key Highlights:
- Superior Performance: Outperforms Llama 3.1 8B Instruct, GPT-3.5 Turbo, Claude 3 Opus, GPT-4, and more in winrate scores.
- Expanded Language Support: Now supports 16 languages, including English, Vietnamese, Spanish, Chinese, and more.
- Enhanced Capabilities: Improved math, reasoning, and instruction-following for better task handling.

With two context options (8k and 128k), Ghost 8B Beta is perfect for complex, multilingual applications, balancing power and cost-effectiveness.

🔗 Learn More: https://ghost-x.org/docs/models/ghost-8b-beta
ghost-x/ghost-8b-beta-668ead6179f93be717db4542
lamhieu 
updated a Space 10 months ago
lamhieu 
posted an update 11 months ago
view post
Post
2109
🎉 Ghost 8B Beta Released: Game-Changing Language Model
--
Ghost 8B Beta is a groundbreaking language model developed with a clear vision: to deliver exceptional multilingual support, superior knowledge capabilities, and all while remaining cost-effective. This model comes in two context length variations, 8k and 128k, ensuring flexibility for various tasks. Moreover, it boasts built-in multilingual functionality, making it a powerful tool for global communication and understanding.
--
* See detailed article: https://huggingface.co/blog/lamhieu/ghost-8b-beta-released-game-changing-language-mode
* Model card: ghost-x/ghost-8b-beta
* Official website: https://ghost-x.org/docs/models/ghost-8b-beta
lamhieu 
posted an update 11 months ago
view post
Post
2130
🤯 Ghost 8B Beta emerges as a clear leader, surpassing even proprietary models like xAI Grok 1, OpenAI GPT 3.5, and Mistral Mixtral 8x7B. This dominance extends to its parity with Mistral Medium, further solidifying its position as a top-tier language model. Furthermore, Ghost 8B Beta stands out as one of only three models employing the zero-shot method for evaluation, alongside Claude 2 and Claude 3, showcasing its unique capabilities and potential for groundbreaking applications.
---
💬 Chat with the model here:
- Playground with Ghost 8B Beta (β, 8k): lamhieu/ghost-8b-beta-8k
- Playground with Ghost 8B Beta (β, 128k): lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta/
  • 2 replies
·
lamhieu 
posted an update 12 months ago
view post
Post
4292
🎉 The Ghost 8B Beta model outperforms prominent models such as Llama 3 8B Instruct, GPT 3.5 Turbo in the lc_winrate score. In addition, it also outperforms Claude 3 Opus, Claude 3 Sonnet, GPT-4, and Mistral Large when comparing the winrate score of AlpacaEval 2.0.

Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default.
The languages supported are 🇺🇸 English, 🇫🇷 French, 🇮🇹 Italian, 🇪🇸 Spanish, 🇵🇹 Portuguese, 🇩🇪 German, 🇻🇳 Vietnamese, 🇰🇷 Korean and 🇨🇳 Chinese.

Explore the Potential:
To learn more about this groundbreaking language model, visit the official website or explore the online demo platforms:
- Ghost 8B Beta (β, 8k) on Spaces: lamhieu/ghost-8b-beta-8k.
- Ghost 8B Beta (β, 128k) on Spaces: lamhieu/ghost-8b-beta-128k
- Official website: https://ghost-x.org/docs/models/ghost-8b-beta
·
lamhieu 
posted an update 12 months ago
view post
Post
1513
Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default.
* The languages supported are 🇺🇸 English, 🇫🇷 French, 🇮🇹 Italian, 🇪🇸 Spanish, 🇵🇹 Portuguese, 🇩🇪 German, 🇻🇳 Vietnamese, 🇰🇷 Korean and 🇨🇳 Chinese.
* 👨‍💻 Try on Spaces: lamhieu/ghost-8b-beta-8k
* 📋 Official website: https://ghost-x.org/docs/models/ghost-8b-beta
  • 1 reply
·
lamhieu 
posted an update about 1 year ago
view post
Post
2934
Wow, this is amazing! 🤯
Samba is a powerful hybrid model with an unlimited context length, combining Mamba, MLP, Sliding Window Attention, and MLP stacking. Samba largest version, Samba-3.8B, trained on 3.2 trillion tokens, excels in benchmarks like MMLU, GSM8K, and HumanEval, and shines in long-context tasks with minimal tuning.
---
Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"
Github: https://github.com/microsoft/Samba
lamhieu 
posted an update about 1 year ago
view post
Post
1346
Haloooo, continue experimenting with a checkpoint version of Ghost Beta (small version) during training in stage 1 (trained progress: 41%).

Supported languages: 🇺🇸 English, 🇪🇸 Spanish, 🇵🇹 Portuguese, 🇫🇷 French, 🇮🇹 Italian, 🇩🇪 German, 🇻🇳 Vietnamese, 🇰🇷 Korean, 🇨🇳 Chinese, and !?

Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* ghost-x

Ghost X is currently very open to invitations to cooperate, share and support.
🤯👇
  • 1 reply
·
lamhieu 
posted an update about 1 year ago
view post
Post
859
With the previous survey, Ghost Beta (small version) will support 9+ languages ​​fluently. It is revealed that the model will be designed for 3 stages of training, showing a checkpoint to try at stage 1 (trained progress: 29%).

Supported languages: 🇺🇸 English, 🇪🇸 Spanish, 🇵🇹 Portuguese, 🇫🇷 French, 🇮🇹 Italian, 🇩🇪 German, 🇻🇳 Vietnamese, 🇰🇷 Korean, 🇨🇳 Chinese, and !?

Note that this is not a conclusion, this is just a sharing of the state of the model. If you find it interesting, please follow the project at:
* https://x.com/ghostx_ai
* https://ghost-x.org/
* ghost-x

🤯👇