Alibaba Cloud claims its new Aegaeon pooling system reduced the number of Nvidia GPUs required to serve large language models ...
Scheduler squeezes more work from fewer H20 accelerators Alibaba Cloud boffins have emerged from their smoke filled labs ...
This innovation was tested in Alibaba Cloud's model marketplace for over three months, according to a research paper ...
Better scheduling and resource-sharing for inferencing workloads using multiple models, not a training breakthrough ...
Alibaba Group Holding has introduced a computing pooling solution that it said led to an 82 per cent cut in the number of Nvidia graphics processing units (GPUs) needed to serve its artificial ...
Alibaba Cloud has beta-deployed the solution in its model marketplace, where it is serving tens of models. The company claims ...
Alibaba Group has launched a computing pooling system, Aegaeon, which reportedly reduces the use of Nvidia GPUs in its AI ...
Nvidia Corp. (NVDA) stock rose moderately in Monday’s early premarket session as traders digested mixed catalysts concerning ...
Investing.com -- Alibaba Cloud has published a paper detailing its Aegaeon GPU resource optimization solution for large language model (LLM) concurrent inferencing, the company announced Monday. The ...