zhang
AI & ML interests
Recent Activity
Organizations
- Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
- Running665
First Agent Template
β‘665Generate images, get time, and more with an AI assistant
- Runtime errorFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
- RunningFeatured141
smolagents LLM leaderboard
π141A leaderboard for LLMs powering smolagents
- Running on ZeroFeatured1.71k
Joy Caption Alpha Two
π1.71kGenerate customized captions for any image
- Runtime error40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 34 β’ 4 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 84 β’ 3
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 424 β’ 11 - Running on ZeroFeatured2.84k
F5-TTS
π£2.84kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
- Runtime errorFeatured696
Fish Audio S1
π696Convert text to natural-sounding speech audio
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text β’ 8B β’ Updated β’ 51.7k β’ 700 - Running on ZeroFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
- Running on ZeroMCP405
Multimodal OCR
π405Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- Running on ZeroMCPFeatured142
Multimodal OCR2
π»142FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
-
chflame163/ComfyUI_LayerStyle
Updated β’ 348 β’ 111 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 24.1k β’ 565 - Running on Zero249
Chroma
π₯249Generate detailed fantasy and realistic images from text descriptions
- RunningMCP46
Doc Mcp
π46RAG on documentations for your agent
- Running on Zero1.68k
Flux.1-dev Upscaler
π1.68kUpscale lowβresolution images to higher resolution
- Running on Zero459
InvSR
π459Image Super-resolution via Diffusion Inversion
- Paused241
FLUX Upsacle Image
π₯241Upscale images with control and customization
- Running on L4Featured283
Thera Arbitrary-Scale Super-Resolution
π₯283Upscale photos to any size with neural superβresolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 510 - Running on ZeroFeatured939
OminiControl
π939Generate custom images from a reference photo and text
- Build error394
FLUXllama gpt-oss
π394mcp_server & FLUX 4-bit Quantization + Enhanced
- Running on L4Featured2.24k
MagicQuill
πͺΆ2.24kEdit photos with scribbles and AI-driven color changes
- Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
- Running665
First Agent Template
β‘665Generate images, get time, and more with an AI assistant
- Runtime errorFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
- RunningFeatured141
smolagents LLM leaderboard
π141A leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text β’ 8B β’ Updated β’ 51.7k β’ 700 - Running on ZeroFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
- Running on ZeroMCP405
Multimodal OCR
π405Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- Running on ZeroMCPFeatured142
Multimodal OCR2
π»142FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
-
chflame163/ComfyUI_LayerStyle
Updated β’ 348 β’ 111 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 24.1k β’ 565 - Running on Zero249
Chroma
π₯249Generate detailed fantasy and realistic images from text descriptions
- RunningMCP46
Doc Mcp
π46RAG on documentations for your agent
- Running on ZeroFeatured1.71k
Joy Caption Alpha Two
π1.71kGenerate customized captions for any image
- Runtime error40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 34 β’ 4 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 84 β’ 3
- Running on Zero1.68k
Flux.1-dev Upscaler
π1.68kUpscale lowβresolution images to higher resolution
- Running on Zero459
InvSR
π459Image Super-resolution via Diffusion Inversion
- Paused241
FLUX Upsacle Image
π₯241Upscale images with control and customization
- Running on L4Featured283
Thera Arbitrary-Scale Super-Resolution
π₯283Upscale photos to any size with neural superβresolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 510 - Running on ZeroFeatured939
OminiControl
π939Generate custom images from a reference photo and text
- Build error394
FLUXllama gpt-oss
π394mcp_server & FLUX 4-bit Quantization + Enhanced
- Running on L4Featured2.24k
MagicQuill
πͺΆ2.24kEdit photos with scribbles and AI-driven color changes
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 424 β’ 11 - Running on ZeroFeatured2.84k
F5-TTS
π£2.84kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
- Runtime errorFeatured696
Fish Audio S1
π696Convert text to natural-sounding speech audio