Filter
Exclude
Time range
-
Near
Congrats to my co-authors @RachaelSim2 @YvonneFan12 @snoidetx @michael_xinyi @pjaillet!!! @icmlconf #ICML2026 #CollaborativeLearning involves training high-quality models using datasets from a number of sources. To incentivize sources to share data, existing #DataValuation methods fairly reward each source based on its data submitted as is. However, as these methods do not verify nor incentivize data truthfulness, the sources can manipulate their data (e.g., by submitting duplicated or noisy data) to artificially increase their valuations and rewards or prevent others from benefiting. This paper presents the first mechanism that provably ensures (F) collaborative #fairness and incentivizes (T) #truthfulness at equilibrium for Bayesian models. Our mechanism combines semivalues (e.g., #ShapleyValue), which ensure fairness, and a truthful data valuation function (DVF) based on a validation set that is unknown to the sources. As semivalues are influenced by others' data, we introduce an additional condition to prove that a source can maximize its expected data values in coalitions and semivalues by submitting a dataset that captures its true knowledge. Additionally, we discuss the implications and suitable relaxations of (F) and (T) when the mediator has a limited budget for rewards or lacks a validation set. Our theoretical findings are validated on synthetic and real-world datasets.
#ICML2026 @icmlconf has spoken! Generated by @GeminiApp. Stay tuned for more ๐ŸŽผ ๐ŸŽต ๐ŸŽถ...
2
4
24
4,522
17 Jun 2025
Not all data is equal. xKnownโ€™s AI Agent listens, analyzes, and assigns a real-time value to every voice snippet you upload โ€” based on content, uniqueness, and usefulness. You speak. It evaluates. You earn. #xKnown #DataValuation
2
1
3
171
2๏ธโƒฃ 94.7% completion rate; a testament to the dedication and smooth experience. Massive thanks to everyone who participated; this is what decentralized intelligence looks like in action. #Codatta #AIResearch #CMU #DataValuation
5
4
267
๐Ÿšจ๐Ÿค” How can we reduce the cost of cooperative game-based #DataValuation without retraining a model for every coalition? ๐Ÿ’ก๐Ÿ” DUPRE โ€” Data Utility PREdiction for efficient data valuation โ€” fits a #GaussianProcess predictor with a sliced Wasserstein kernel to estimate each coalitionโ€™s utility from just a handful of evaluated subsets. ๐Ÿ‘ฉโ€๐Ÿ’ป๐Ÿ‘จโ€๐Ÿ’ป nguyen pham @RachaelSim2 @qphong see kiong ng @bryanklow ๐Ÿš€ Paper: arxiv.org/abs/2502.16152 ๐Ÿ“Œ Catch DUPRE @AAMASconf #AAMAS2025 โ€” 22 May (afternoon) ๐Ÿ—ฃOral in Ambassador Ballroom โ€ข Salon 3 (3 pm) ๐Ÿ–ผ Poster #1303 in Ontario Exhibit Hall (3rd floor) ( p.m.) Drop by to see how DUPRE: โœ… exploits ownersโ€™ data similarity to predict utilities โœ… plugs into any cooperative game theory techniques โœ… delivers uncertainty-aware data valuations
10
539
Distributionally robust #DataValuation @icmlconf #ICML2024!
1
4
346
The @icmlconf #ICML2024 work of @xiaoqiang_98 @michael_xinyi @WuZhaoxuan et Al. presents distributionally robust #DataValuation without a known validation distribution. #DataCentricAI Paper: openreview.net/forum?id=mbBeโ€ฆ Visit us at Poster Session 3 Wed 24 Jul 11:30AM Hall C 4-9 #2402
11
2
12
754
The @icmlconf #ICML2024 work of @RachaelSim2 @YvonneFan12 @snoidetx et al. presents DADS to select data for model training while anticipating #DataDeletions. #DataSelection #ActiveLearning #DataValuation openreview.net/forum?id=ecvuโ€ฆ Poster Session 4 Wed 24 Jul 1:30PM Hall C 4-9 #2306
11
16
405
Our research group & collaborators have put together 4 chapters in the #FederatedLearning: Theory and Practice book: fairness (ch.8), #DataValuation (ch.15) & incentives (ch.16) in #FederatedLearning, and federated sequential decision making (ch.14). sciencedirect.com/book/97804โ€ฆ (1/n)
4
6
30
2,122
#DataDeletion challenges fairness & interpretability of #DataValuation when they co-exist. The #AAAI2024 @RealAAAI work of @snoidetx fan jue @RachaelSim2 introduces DeRDaVa to solve this problem... #ShapleyValue (1/n)
2
2
9
723
Congrats to @ZhuanghuaL luo luo @snoidetx fan jue @RachaelSim2 for their accepted papers to #AAAI2024 @RealAAAI #Optimization #ShapleyValue #DataValuation
1
14
1,179
The #NeurIPS2023 @NeurIPSConf work of @RachaelSim2 yehong @nghiaht87 @michael_xinyi @pjaillet introduces #DifferentialPrivacy as an incentive for collaborative ML, besides fairness, individual rationality... #ShapleyValue #DataValuation #FederatedLearning neurips.cc/virtual/2023/postโ€ฆ
1
2
20
759
The #NeurIPS2023 @NeurIPSConf work of @michael_xinyi @chi_thanh_lam chuan-sheng proposed the model #ShapleyValue for equitable model valuation (in contrast to #DataValuation). #FederatedLearning
1
2
16
786
5 Sep 2023
Value of Saudi data as a national treasure SAR 467 billion. Learn More โžก๏ธyourdataconnect.com/product/ #DataValue #DataValuation #DataQuality #GDP #DataManagement #Data #Analytics #CDO #CheifDataOfficer #YDC
1
1
38