This paper presents the first framework to deliberately train neural networks for accuracy and agreement between feature attribution techniques: PEAR (Post hoc Explainer Agreement Regularizer). In addition to the conventional task loss, PEAR incorporates a correlation-based consensus loss that combines Pearson and Spearman correlation measures, promoting alignment across explainers like Grad and Integrated Gradients. By using a soft ranking approximation to address differentiability issues, the loss function is completely trainable by backpropagation. Tested on three OpenML tabular datasets, multilayer perceptrons trained using PEAR surpass linear baselines in accuracy and explanation consensus, and in certain instances, even compete with XGBoost. The findings advance reliable and interpretable AI by showing that consensus-aware training successfully reduces explanation disagreement while maintaining prediction performance.This paper presents the first framework to deliberately train neural networks for accuracy and agreement between feature attribution techniques: PEAR (Post hoc Explainer Agreement Regularizer). In addition to the conventional task loss, PEAR incorporates a correlation-based consensus loss that combines Pearson and Spearman correlation measures, promoting alignment across explainers like Grad and Integrated Gradients. By using a soft ranking approximation to address differentiability issues, the loss function is completely trainable by backpropagation. Tested on three OpenML tabular datasets, multilayer perceptrons trained using PEAR surpass linear baselines in accuracy and explanation consensus, and in certain instances, even compete with XGBoost. The findings advance reliable and interpretable AI by showing that consensus-aware training successfully reduces explanation disagreement while maintaining prediction performance.

Notes on Training Neural Networks for Consensus

2025/09/21 13:46

Abstract and 1. Introduction

1.1 Post Hoc Explanation

1.2 The Disagreement Problem

1.3 Encouraging Explanation Consensus

  1. Related Work

  2. Pear: Post HOC Explainer Agreement Regularizer

  3. The Efficacy of Consensus Training

    4.1 Agreement Metrics

    4.2 Improving Consensus Metrics

    [4.3 Consistency At What Cost?]()

    4.4 Are the Explanations Still Valuable?

    4.5 Consensus and Linearity

    4.6 Two Loss Terms

  4. Discussion

    5.1 Future Work

    5.2 Conclusion, Acknowledgements, and References

Appendix

3 PEAR: POST HOC EXPLAINER AGREEMENT REGULARIZER

Our contribution is the first effort to train models to be both accurate and to be explicitly regularized via consensus between local explainers. When neural networks are trained naturally (i.e. with a single task-specific loss term like cross-entropy), disagreement between post hoc explainers often arises. Therefore, we include an additional loss term to measure the amount of explainer disagreement during training to encourage consensus between explanations. Since human-aligned notions of explanation consensus can be captured by more than one agreement metric (listed in A.3), we aim to improve several agreement metrics with one loss function.[2]

\ Our consensus loss term is a convex combination of the Pearson and Spearman correlation measurements between the vectors of attribution scores (Spearman correlation is just the Pearson correlation on the ranks of a vector).

\ To paint a clearer picture of the need for two terms in the loss, consider the examples shown in Figure 3. In the upper example, the raw feature scores are very similar and the Pearson correlation coefficient is in fact 1 (to machine precision). However, when we rank these scores by magnitude, there is a big difference in their ranks as indicated by the Spearman value. Likewise, in the lower portion of Figure 3 we show that two explanations with identical magnitudes will show a low Pearson correlation coefficient. Since some of the metrics we use to measure disagreement involve ranking and others do not, we conclude that a mixture of these two terms in the loss is appropriate.

\ While the example in Figure 3 shows two explanation vectors with similar scale, different explanation methods do not always

\ Figure 2: Our loss function measures the task loss between the model outputs and ground truth (task loss), as well as the disagreement between explainers (consensus loss). The weight given to the consensus loss term is controlled by a hyperparameter 𝜆. The consensus loss term term is a convex combination of the Spearman and Pearson correlation measurements between feature importance scores, since increasing both rank correlation (Spearman) and raw-score correlation (Pearson) are useful for improving explainer consensus on our many agreement metrics.

\ Figure 3: Example feature attribution vectors where Pearson and Spearman show starkly different scores. Recall, both Pearson and Spearman correlation range from −1 to +1. Both of these pairs of vectors satisfy some human-aligned notions of consensus. But in each circumstance, one of the correlation metrics gives a low similarity score. Thus, in order to successfully encourage explainer consensus (by all of our metrics), we use both types of correlation in our consensus loss term.

\ align. Some explainers have the sums of their attribution scores constrained by various rules, whereas other explainers have no such constraints. The correlation measurements we use in our loss provide more latitude when comparing explainers than a direct difference measurement like mean absolute error or mean squared error, allowing our correlation measurement.

\

\ We refer to the first term in the loss function as the task loss, or ℓtask, and for our classification tasks we use cross-entropy loss. A graphical depiction of the flow from data to loss value is shown in Figure 2. Formally, our complete loss function can be expressed as follows with two hyperparameters 𝜆, 𝜇 ∈ [0, 1]. We weight the influence of our consensus term with 𝜆, so lower values give more priority to task loss. We weight the influence between the two explanation correlation terms with 𝜇, so lower values give more weight to Pearson correlation and higher values give more weight to Spearman correlation.

\

3.1 Choosing a Pair of Explainer

The consensus loss term is defined for any two explainers in general, but since we train with standard backpropagation we need these explainers to be differentiable. With this constraint in mind, and with some intuition about the objective of improving agreement metrics, we choose to train for consensus between Grad and IntGrad. If Grad and IntGrad align, then the function should become more locally linear in logit space. IntGrad computes the average gradient along a path in input space toward each point being explained. So, if we train the model to have a local gradient at each point (Grad) closer to the average gradient along a path to the point (IntGrad), then perhaps an easy way for the model to accomplish that training objective would be for the gradient along the whole path to equal the local gradient from Grad. This may push the model to be more similar to a linear model. This is something we investigate with qualitative and quantitative analysis in Section 4.

3.2 Differentiability

On the note of differentiability, the ranking function 𝑅 is not differentiable. We substitute a soft ranking function from the torchsort package [3]. This provides a floating point approximation of the ordering of a vector rather than an exact integer computation of the ordering of a vector, which allows for differentiation

4 THE EFFICACY OF CONSENSUS TRAINING

In this section we present each experiment with the hypothesis it is designed to test. The datasets we use for our experiments are Bank Marketing, California Housing, and Electricity, three binary classification datasets available on the OpenML database [39]. For each dataset, we use a linear model’s performance (logistic regression) as a lower bound of realistic performance because linear models are considered inherently explainable.

\ The models we train to study the impact of our consensus loss term are multilayer perceptrons (MLPs). While the field of tabular deep learning is still growing, and MLPs may be an unlikely choice for most data scientists on tabular data, deep networks provide the flexibility to adapt training loops for multiple objectives [1, 10, 17, 28, 31, 35]. We also verify that our MLPs outperform linear models on each dataset, because if deep models trained to reach consensus are less accurate than a linear model, we would be better off using the linear model.

\ We include XGBoost [6] as a point of comparison for our approach, as it has become a widely popular method with high performance and strong consensus metrics on many tabular datasets (figures in Appendix A.7). There are cases where we achieve more explainer consensus than XGBoost, but this point is tangential as we are invested in exploring a loss for training neural networks.

\ For further details on our datasets and model training hyperparameters, see Appendices A.1 and A.2.

\

:::info Authors:

(1) Avi Schwarzschild, University of Maryland, College Park, Maryland, USA and Work completed while working at Arthur (avi1umd.edu);

(2) Max Cembalest, Arthur, New York City, New York, USA;

(3) Karthik Rao, Arthur, New York City, New York, USA;

(4) Keegan Hines, Arthur, New York City, New York, USA;

(5) John Dickerson†, Arthur, New York City, New York, USA (john@arthur.ai).

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

[2] The PEAR package will be publicly for download on the Package Installer for Python (pip), and it is also available upon request from the authors.

\ [3] When more than one of the entries have the same magnitude, they get a common ranking value equal to the average rank if they were ordered arbitrarily.

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Share Insights

You May Also Like

Michigan’s Stalled Reserve Bill Advances After 7 Months

Michigan’s Stalled Reserve Bill Advances After 7 Months

The post Michigan’s Stalled Reserve Bill Advances After 7 Months appeared on BitcoinEthereumNews.com. After seven months of inactivity, Michigan’s Bitcoin Reserve Bill, HB 4087, made progress Thursday by advancing to the second reading in the state House of Representatives. The bill, introduced in February, aims to establish a strategic bitcoin BTC$115,427.11 reserve by authorizing the state treasury to invest up to 10% of its reserves in the largest cryptocurrency and possibly others. It has now been referred to the Committee on Government Operations. If approved, Michigan would join the three states — Texas, New Hampshire and Arizona — that have enacted bitcoin reserve laws. While Texas allocated $10 million to purchase BTC in June, the other two have yet to fund the reserve with state money. Recently, the U.S. House directed the Treasury Department to study the feasibility and governance of a strategic bitcoin reserve, including key areas such as custody, cybersecurity and accounting standards. Sovereign adoption of bitcoin has emerged as one of the defining trends of 2025, with several U.S. states and countries considering or implementing BTC reserves as part of their public finance strategy. That’s in addition to the growing corporate adoption of bitcoin in company treasuries. This institutional embrace has contributed to a significant boost in bitcoin’s market valuation. The BTC price has increased 25% this year, and touched a record high near $124,500 in August, CoinDesk data show. Despite the enthusiasm, skeptics remain concerned about the risks posed by bitcoin’s notorious price volatility. Source: https://www.coindesk.com/policy/2025/09/19/michigan-s-stalled-bitcoin-reserve-bill-advances-after-7-months
Share
BitcoinEthereumNews2025/09/20 04:26
Share
The SEC Finally Approves Investment Giant Grayscale’s Multi-Crypto Fund! What Altcoins Does the Fund Contain? Here Are the Details

The SEC Finally Approves Investment Giant Grayscale’s Multi-Crypto Fund! What Altcoins Does the Fund Contain? Here Are the Details

The post The SEC Finally Approves Investment Giant Grayscale’s Multi-Crypto Fund! What Altcoins Does the Fund Contain? Here Are the Details appeared on BitcoinEthereumNews.com. The U.S. Securities and Exchange Commission (SEC) has approved Grayscale’s application for a multi-cryptoasset exchange-traded product (ETP) as part of its efforts to expedite the approval process for crypto funds. SEC Approves Grayscale’s Multi-Crypto Fund Including XRP, Solana, and Cardano Grayscale CEO Peter Mintzberg announced the approval of the Grayscale Digital Large Cap Fund (GDLC) on Wednesday via social media platform X. Mintzberg stated that GDLC will be the first multi-cryptoasset ETP to be traded on the market. The fund offers investment opportunities in Bitcoin, Ethereum, XRP, Solana, and Cardano. According to Grayscale’s official website, the fund has a net asset value of $57.7 per share and over $915 million in total assets under management. The SEC previously postponed the filing in July and began reviewing its conversion to trade on NYSE Arca. On the same day, the SEC also approved “expedited” public listing standards for crypto ETF issuers. SEC Chairman Paul Atkins stated that this step would provide investors with more options and lower barriers to accessing digital asset products. According to experts, this decision could lead to the launch of more than 100 new crypto ETFs in the next 12 months. Bloomberg ETF Analyst Eric Balchunas emphasized that this could be a critical turning point for the crypto market, noting that previous similar regulations have tripled ETF launches. *This is not investment advice. Follow our Telegram and Twitter account now for exclusive news, analytics and on-chain data! Source: https://en.bitcoinsistemi.com/the-sec-finally-approves-investment-giant-grayscales-multi-crypto-fund-what-altcoins-does-the-fund-contain-here-are-the-details/
Share
BitcoinEthereumNews2025/09/19 02:39
Share
Fintech 3.0: Blockchain Eats the World

Fintech 3.0: Blockchain Eats the World

Author: Harj Taggar (YC), Jesse Pollak (Base) Compiled by Tim, PANews PANews Editor's Note: Y Combinator, a well-known Silicon Valley incubator in the United States, and crypto giant Coinbase have jointly launched a crypto entrepreneurship camp. This article is a "call for action" and hopes that more entrepreneurs will start on-chain development. The article mainly explains the current development of the crypto industry, such as how stablecoins and tokenized assets will penetrate people's lives, and also indicates the investment areas they are interested in. We believe the time is ripe for a shift to on-chain development. Over the past decade, the relevant tools have continued to develop, and with the emergence of low-gas public chains, the global circulation of stablecoins, an easy-to-use wallet ecosystem, and a growing user base, the infrastructure is finally in place. We've observed several key trends that are creating tremendous opportunities for developers worldwide. This starts with the fact that we are at the beginning of a new era in FinTech – FinTech 3.0. Fintech 1.0 was the initial digitization of the financial industry in the 1990s, driven by companies like PayPal. The key breakthrough during this period was the rise in consumer acceptance of online payment methods. Fintech 2.0, which took place over the past decade and was driven by companies like Stripe, Plaid, Brex, and Chime, is centered around building application programming interfaces (APIs) on top of the existing financial system. A key breakthrough in this phase was the emergence of banking-as-a-service providers, which enabled startups to innovate and develop on top of the legacy financial system. We are entering the FinTech 3.0 era. This era will see the financial system restructured with code, with payments and settlements delivered instantly around the clock around the world. User assets will be stored in digital wallets and fully controlled by individuals, and traditional banks will no longer be the only option for asset custody. For years, regulatory uncertainty has been a major obstacle to building Fintech 3.0. With the enactment of the GENIUS Act and the potential imminent introduction of the CLARITY Act, the US now has a clear regulatory framework for cryptocurrencies, enabling entrepreneurs to confidently build groundbreaking businesses on-chain. This represents the greatest opportunity for cryptocurrency startups in years, and Y Combinator and Coinbase are committed to providing funding and support to help you seize this opportunity. While the list below is by no means exhaustive, there are several key areas where we will be particularly focused and looking to invest. Stablecoins Stablecoins are the first major success story in the FinTech 3.0 era. Stablecoins are on-chain assets whose value is pegged to fiat currencies or assets like gold, designed to maintain price stability. As a payment tool, stablecoins offer significant advantages over traditional financial transactions, particularly in cross-border payments. Users can transfer stablecoins to anywhere in the world 24/7, at a cost of less than a cent, in less than a second, and without foreign exchange fees. This isn't just a theoretical assumption; trillions of dollars in stablecoins are already being used to settle payments in real time. People are already building stablecoin applications with millions of users. YC alumni companies like Kontigo, DolarApp, and Aspora are providing instant, low-cost payment and remittance services to millions of users across Latin America and South Asia. El Dorado, a platform for sending and receiving stablecoins in Latin America, backed by Coinbase Ventures, has processed $200 million in transactions for nearly 1 million users over the past year, demonstrating the region's growing demand for cryptocurrencies as a hedge against currency devaluation. This isn’t just an attempt by startups: Coinbase has just partnered with Shopify to launch an open-source commercial payment protocol that supports any traditional online business scenario and on-chain stablecoin payment processes. It combines all the advantages of crypto payments (lightning-fast settlement speeds and near-zero transaction fees) with the security and scalability of typical e-commerce functions (delay capture, final tax confirmation, and refund capabilities). Despite regulatory headwinds, the continued success of stablecoins demonstrates strong market demand for them. Following the successful passage of the GENIUS Act in the United States, stablecoin adoption is poised for explosive growth. This legislation creates a comprehensive federal regulatory framework for stablecoins, similar to that of the banking system. Since the GENIUS Act's enactment, the total stablecoin market capitalization has grown by over $30 billion, with major corporations such as Amazon and Walmart expressing interest in issuing their own stablecoins. There are many directions for development in the stablecoin space, but we are particularly interested in the following: Full access to stablecoins: Platforms that process payments, lending, and other financial services can achieve significant efficiency gains through stablecoins. Enabling businesses and consumers to transact seamlessly on these platforms will unlock significant value. Local currency stablecoins: Stablecoins pegged to local currencies allow citizens in countries with high inflation to reap the benefits of cryptocurrency without relying solely on the US dollar. Governments and consumers concerned about dollarization can use these stablecoins as the cornerstone of local payment, savings, and credit systems. Crypto-native businesses: With the emergence of the Commerce Payments protocol and other tools, merchants, lenders, and consumers will have the opportunity to process acceptance, credit, and payments in a crypto-native manner. This will open up new possibilities for serving customers, given the global nature of the platform. Tokenization and Trading The infrastructure that powers stablecoins can be used for any asset. This is what makes Fintech 3.0 truly fascinating. Through tokenization, we will fundamentally change the definition of assets and the range of holders. Tokenization involves representing real-world assets (such as government bonds, startup equity, art, or loans) as digital tokens on a blockchain. Its core value lies in making assets that have historically lacked liquidity and been monopolized by layers of middlemen accessible to anyone, anywhere, at any time. In reality, this might mean: Instead of waiting a month for a check, you can receive your share of the rental income of your properties in real time, every second. Rather than going through complicated paperwork to exercise your startup stock options, you could have a “live cap sheet” that converts your equity into programmable tokens that you actually own and can buy and sell freely on the open market. Instead of investing millions in private lending, you can simply purchase tokens that represent a portion of a decentralized loan portfolio. This is already happening. Mainstream institutions like JPMorgan Chase are bringing deposit tokens to the blockchain, while startups like Courtyard are tokenizing physical collectibles. We're also witnessing a wave of tokenization of new on-chain native assets like creator tokens and content tokens on platforms like Zora and Pump.fun. All of this is giving rise to a ton of new things: companies like Axiom, a YC alumnus, have become the fastest-growing YC alumnus we've ever seen. The core infrastructure is in place, and we are looking for founders to develop products that will bring all types of assets online. We are particularly interested in: New credit market: Lending protocols leverage on-chain identity and reputation to provide undercollateralized loans, providing funding to individuals and businesses overlooked by the traditional financial system. On-chain capital structuring: A tool for startups to raise funds directly from users, managing equity structure tables through programmable tokens, replacing the traditional model of spreadsheets and legal services. New trading front ends: The surge in assets presents new trading and investment opportunities for consumers and businesses. Applications and Agents On-chain technology also opens up new frontiers for applications and intelligent agents, unattainable in the previous internet era. Think of blockchain as a new operating system: a globally shared platform that makes application development an order of magnitude more efficient than traditional models. No single company holds a monopoly, and any developer can build products on it without permission. With its "money as software" nature, intelligent agents are inherently equipped to participate in this new economic landscape. We believe this will trigger a surge in new applications. Social, financial, collaborative, gaming—you name it. We're already seeing this trend with platforms like Base: you can use these apps to do everything from getting a loan instantly, to earning money while playing games, to supporting your favorite creators and earning money yourself. We believe these applications will also appear in chats in the form of agents. AI agents equipped with digital wallets will be super-empowered to help people participate in and navigate the rapidly growing global economy. They will simplify and improve the user experience, just as they do in other areas of commerce around the world.
Share
PANews2025/09/24 16:49
Share