QuentinFuxa / WhisperLiveKit
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
{{ message }}
See what the GitHub community is most excited about this week.
Real-time & local speech-to-text, translation, and speaker diarization. With server & web UI.
E-mails, subdomains and names Harvester - OSINT
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Open Source Alternative to NotebookLM / Perplexity, connected to external sources such as Search Engines, Slack, Linear, Jira, ClickUp, Confluence, Notion, YouTube, GitHub, Discord and more. Join our discord: https://discord.gg/ejRNvftDp9
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API endpoints to MCP, composes virtual MCP servers with added security and observability, and converts between protocols (stdio, SSE, Streamable HTTP).
ð Freely available programming books
Mobile-Agent: The Powerful GUI Agent Family
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. æ¥è¿GPT-4o表ç°ç弿ºå¤æ¨¡æå¯¹è¯æ¨¡å
All Algorithms implemented in Python
Generate audiobooks from e-books
SoTA open-source TTS
ð¯ åå«ä¿¡æ¯è¿è½½ï¼åªççæ£å ³å¿çæ°é» - å¤å¹³å°çç¹èåå·¥å ·ï¼ä¸é®çæ§ä»æ¥å¤´æ¡ãç¾åº¦çæãå¾®åãæé³ãç¥ä¹ãBç«ç35个平å°ï¼æºè½å ³é®è¯çéï¼èªå¨çæçç¹åææ¥åãæ¯æä¼ä¸å¾®ä¿¡ãé£ä¹¦ãééãTelegramæ¨éï¼30ç§ç½é¡µé¨ç½²ï¼1åéææºéç¥ï¼æ?éç¼ç¨åºç¡ã乿¯ædockerç§äººé¨ç½²â è®©ç®æ³ä¸ºä½?æå¡ï¼èéè¢«ç®æ³ç»æ¶
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
the only cheat sheet you need
A collection of projects showcasing RAG, agents, workflows, and other AI use cases