DX

最近更新: 18小时前

ROCK

A construction kit for reinforcement learning environment management.

最近更新: 18小时前

unified-audio

最近更新: 18小时前

mteb

MTEB: Massive Text Embedding Benchmark (Fork for contributing to mteb. All changes intended for upstream PRs.)

最近更新: 18小时前

identity-grpo

Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning

最近更新: 18小时前

SmartResume

最近更新: 18小时前

InferSim

A Lightweight LLM Inference Performance Simulator

最近更新: 18小时前

SKYLENAGE-GameCodeGym

最近更新: 18小时前

Taobao3D

最近更新: 18小时前

Helios

最近更新: 18小时前

vstyle

最近更新: 18小时前

seckit

最近更新: 18小时前

page-agent

最近更新: 18小时前

TPO

[EMNLP 2025] The official implementation of the paper "Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination...

最近更新: 18小时前

sec-code-bench

SecCodeBench is a benchmark suite focusing on evaluating the security of code generated by large language models (LLMs).

最近更新: 18小时前

alibabacloud-nas-utils

最近更新: 18小时前

Logics-Parsing

最近更新: 18小时前

loongsuite-java-agent

Based on OpenTelemetry Java Instrumentation, open source parts of Alibaba Instrumentations and Extensions

最近更新: 18小时前

agui-chain

最近更新: 18小时前

ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

最近更新: 18小时前
成就
445
Star
239
Fork
成员(1)
镜像

搜索帮助