Skip to content

Popular repositories Loading

  1. lna-es lna-es Public

    あらゆるジャンルのテキストをLLMを使いNeo4Jグラフ化して、グラフのみのデータから意味的復元をするシステムのスターター(MCP対応予定)

    16 1

  2. blackwell-geforce-nvfp4-gemm blackwell-geforce-nvfp4-gemm Public

    NVFP4 inference on Blackwell GeForce (RTX 5090/5080/5070 Ti/RTX PRO 6000) — SM120 patches for vLLM + FlashInfer + CUTLASS. 175 tok/s on Qwen3.6-35B MoE.

    Python 11

  3. NVFP4studio NVFP4studio Public

    An open-source cross-platform studio for running NVFP4 models locally, featuring a chat interface, OpenAI-compatible API, performance benchmarking, and multilingual support for English, Chinese, an…

    Python 3

  4. GGUF-to-NVFP4-SM120 GGUF-to-NVFP4-SM120 Public

    Lna-Lab production pipeline: GGUF -> modelopt-format NVFP4 + working MTP head for vLLM on RTX PRO 6000 Blackwell (SM120). Stages 2 (NVFP4) and 3 (MTP graft) are Lna-Lab originals; stage 1 (GGUF->bf…

    Python 2 1

  5. VLLM-TurboQuant-SM120 VLLM-TurboQuant-SM120 Public

    Blackwell-ready TurboQuant KV cache compression for Trinity-Large-Thinking on vLLM.

    Python 1

  6. DeepGEMM-for-SM120e DeepGEMM-for-SM120e Public

    Cuda 1

Repositories

Showing 10 of 11 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…