Tencent’s tech team has optimized DeepSeek’s open-source DeepEP communication framework,mother and son having sex x-videos boosting its performance across different network environments, according to the Chinese AI startup. Testing showed a 100% improvement on RoCE networks and a 30% gain on InfiniBand (IB), offering more efficient solutions for AI model training. On GitHub, DeepSeek acknowledged the Chinese tech giant’s contribution had led to a “huge speedup.” DeepEP is a communication library tailored for a mixture of experts (MoE) and expert parallelism (EP), supporting high-throughput, low-latency GPU kernels and low-precision computing, including FP8. Tencent’s Starlink Networking team identified two main bottlenecks: underutilized dual-port NIC bandwidth and CPU control latency. After targeted optimizations, performance doubled on RoCE and improved by 30% on IB. The enhanced framework is now fully open-source and has been successfully deployed in training Tencent’s Hunyuan large model, demonstrating strong versatility within environments built on Tencent’s Starlink and H20 servers, Chinese tech media outlet iThome reported. [iThome, in Chinese]
Related Articles
2025-06-27 06:45
2749 views
Best Soundcore by Anker Space A40 earbuds deal: Save $35 at Amazon
SAVE $35:As of Jan. 14, the Soundcore by Anker Space A40 earbuds are on sale for $44.99. This is 44%
Read More
2025-06-27 05:35
2725 views
Tinder tests height as a paid preference
Tinder's incoming CEO wants to rid the app of its hookup app reputation, but the app is testing a pr
Read More
2025-06-27 05:00
2209 views
Meghan O'Rourke on 'The Long Goodbye' by Thessaly La Force
Meghan O’Rourke on ‘The Long Goodbye’By Thessaly La ForceApril 25, 2011At WorkPhot
Read More