Skip to content
View zhangyan612's full-sized avatar

Block or report zhangyan612

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.

Jupyter Notebook 2,912 345 Updated Mar 28, 2025

[CVPR 2025] EgoLife: Towards Egocentric Life Assistant

Python 244 16 Updated Mar 19, 2025

This is ROSKA repository.

Python 5 1 Updated Aug 30, 2024

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,852 116 Updated Mar 28, 2025

Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Python 268 29 Updated Mar 6, 2025

A Conversational Speech Generation Model

Python 11,907 1,000 Updated Mar 27, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 14,614 1,689 Updated Mar 28, 2025

"Your Fully-Automated Personal AI Assistant, and Open-Source & Cost-Efficient Alternative to OpenAI's Deep Research"

Python 837 109 Updated Feb 23, 2025

Make websites accessible for AI agents

Python 49,939 5,229 Updated Mar 29, 2025

Automate browser-based workflows with LLMs and Computer Vision

Python 12,776 975 Updated Mar 28, 2025

A small robot especially for rl training locomotion

C 32 2 Updated Feb 19, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,305 643 Updated Mar 27, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,932 426 Updated Mar 5, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 1,954 211 Updated Mar 25, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,395 271 Updated Mar 24, 2025
Python 7 Updated Nov 14, 2024

TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation

Python 223 38 Updated Mar 6, 2025
Python 17 1 Updated Feb 19, 2025

Two conversational AI agents switching from English to sound-level protocol after confirming they are both AI agents

TypeScript 4,065 328 Updated Mar 12, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,100 171 Updated Mar 28, 2025

✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。

Python 6,788 406 Updated Mar 28, 2025

Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".

C++ 188 14 Updated Mar 28, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,287 1,004 Updated Mar 28, 2025

一个全开源低成本的双足机器人(2万元($3000))A Fully Opensourced Humanoid Robot with only $3000

C 140 26 Updated Mar 18, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 21,133 1,730 Updated Mar 26, 2025
Python 47 7 Updated Feb 18, 2025

Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation

Python 105 8 Updated Nov 13, 2024

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 458 43 Updated Mar 28, 2025

Action Chunking Transformers with In-the-Wild Learning Framework

Python 16 1 Updated Sep 28, 2023
Python 4,086 328 Updated Mar 12, 2025
Next
Showing results