Alibaba Cloud claims its new Aegaeon pooling system reduced the number of Nvidia GPUs required to serve large language models ...
Better scheduling and resource-sharing for inferencing workloads using multiple models, not a training breakthrough Chinese ...
This innovation was tested in Alibaba Cloud's model marketplace for over three months, according to a research paper ...
The new Aegaeon system can serve dozens of large language models using a fraction of the GPUs previously required, ...
Scheduler squeezes more work from fewer H20 accelerators Alibaba Cloud boffins have emerged from their smoke filled labs having found a way to make Nvidia’s costly GPUs actually earn their keep. The ...
Alibaba Cloud has beta-deployed the solution in its model marketplace, where it is serving tens of models. The company claims ...
Alibaba Group has launched a computing pooling system, Aegaeon, which reportedly reduces the use of Nvidia GPUs in its AI ...
Investing.com -- Alibaba Cloud has published a paper detailing its Aegaeon GPU resource optimization solution for large language model (LLM) concurrent inferencing, the company announced Monday. The ...
Alibaba Group Holding (NYSE:BABA) is making headlines after announcing a major collaboration with Nvidia in artificial intelligence and humanoid robotics. The company’s shares jumped over 9% in Hong ...
Nvidia is on a dealmaking spree: Days after committing to taking a $5 billion stake in Intel and a whopping $100 billion investment in OpenAI, the GPU maker has now struck a partnership with China’s ...