-
Tsinghua University & PJLab
- http://www.jifengdai.org/
Stars
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Speech Recognition using DeepSpeech2.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
这是 某宝 卖大几千的压枪源码,不做任何数据读取以及侵入,这里采用外数据采集(IMG), 至今可以使用,无视任何更新(新武器,以及新武器的压枪规则,需要自己调试,在data_config下);
PUBG - 罗技鼠标宏 | 兴趣使然的项目,完虐收费宏!点个Star支持一下作者![PUBG - Logitech mouse macro | Support 12 kinds of guns without recoil!]
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
OpenMMLab Detection Toolbox and Benchmark
Bottom-up Object Detection by Grouping Extreme and Center Points
SNIPER / AutoFocus is an efficient multi-scale object detection training / inference algorithm
Relation Networks for Object Detection
Flow-Guided Feature Aggregation for Video Object Detection
Deformable Convolutional Networks + MST + Soft-NMS
Deep Feature Flow for Video Recognition
Deformable Convolutional Networks
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
microsoft / caffe
Forked from BVLC/caffeCaffe on both Linux and Windows
Fully Convolutional Instance-aware Semantic Segmentation
R-FCN with joint training and python support
Instance-aware Semantic Segmentation via Multi-task Network Cascades