|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 258 occurrences of 169 keywords
|
|
|
Results
Found 2090 publication records. Showing 2090 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
13 | Aimee Kendall Roundtree |
AI Explainability, Interpretability, Fairness, and Privacy: An Integrative Review of Reviews. |
HCI (40) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jonathan Ugalde, Eduardo Godoy, Diego Mellado, Eduardo Cavieres, Bastian Carvajal, Carlos Fernández, Pamela Illescas, Rodrigo H. Avaria, Claudia Díaz, Rodrigo Ferreira, Marvin Querales, Scarlett Lever, Julio Sotelo, Stéren Chabert, Rodrigo Salas 0001 |
Torwards Trustworthy Machine Learning based systems: Evaluating breast cancer predictions interpretability using Human Centered Machine Learning and UX Techniques. |
HCI (47) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Ehsan Bonabi Mobaraki, Arijit Khan 0001 |
A Demonstration of Interpretability Methods for Graph Neural Networks. |
GRADES-NDA@SIGMOD |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Pedro Serrano e Silva, Ricardo P. M. Cruz, A. S. M. Shihavuddin, Tiago Gonçalves |
Interpretability-Guided Human Feedback During Neural Network Training. |
IbPRIA |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Prateek Goel |
Investigating the duality of CBR: Performance and Interpretability. |
ICCBR Workshops |
2023 |
DBLP BibTeX RDF |
|
13 | Corentin Ambroise, Antoine Grigis, Edouard Duchesnay, Vincent Frouin |
Multi-View Variational Autoencoders Allow for Interpretability Leveraging Digital Avatars: Application to the HBN Cohort. |
ISBI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Xiaochuan Gou, Lijie Hu, Di Wang 0015, Xiangliang Zhang 0001 |
A Fundamental Model with Stable Interpretability for Traffic Forecasting. |
GeoPrivacy@SIGSPATIAL |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Genicleito Carvalho Beltrão Gonçalves, Tatiane Nogueira Rios |
GEnI-FR: Granularity to Ensure Interpretability of the Fuzzy Rules. |
FUZZ |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Joe Germino, Nuno Moniz, Nitesh V. Chawla |
Fairness-Aware Mixture of Experts with Interpretability Budgets. |
DS |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Md Abdul Kadir, Gowtham Krishna Addluri, Daniel Sonntag |
Harmonizing Feature Attributions Across Deep Learning Architectures: Enhancing Interpretability and Consistency. |
KI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Gaofeng Shi, Fangfang Zhang 0003, Yi Mei 0001 |
Interpretability-Aware Multi-Objective Genetic Programming for Scheduling Heuristics Learning in Dynamic Flexible Job Shop Scheduling. |
CEC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Rui Branco, Carlos Agostinho, Sergio Gusmeroli, Eleni Lavasa, Zoumpolia Dikopoulou, David Monzo, Fenareti Lampathaki |
Explainable AI in Manufacturing: an Analysis of Transparency and Interpretability Methods for the XMANAI Platform. |
ICE/ITMC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Vitor A. C. Horta, Robin Sobczyk, Maarten C. Stol, Alessandra Mileo |
Semantic Interpretability of Convolutional Neural Networks by Taxonomy Extraction. |
NeSy |
2023 |
DBLP BibTeX RDF |
|
13 | Qifan He, Ruilin Xie, Li Li, Zhanqi Cui |
DeepIA: An Interpretability Analysis based Test Data Generation Method for DNN. |
QRS |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Abdelbaki Souid, Soufiene Ben Othman, Mohamed Hamroun 0001, Hedi Sakli |
Full Interpretability CBMIR to Help Minimize Radiologist Analysis Search Time. |
IWCMC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Dipkamal Bhusal, Rosalyn Shin, Ajay Ashok Shewale, Monish Kumar Manikya Veerabhadran, Michael Clifford, Sara Rampazzi, Nidhi Rastogi |
SoK: Modeling Explainability in Security Analytics for Interpretability, Trustworthiness, and Usability. |
ARES |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Zhenxiao Cheng, Jie Zhou 0015, Wen Wu 0006, Qin Chen, Liang He 0001 |
Tell Model Where to Attend: Improving Interpretability of Aspect-Based Sentiment Classification via Small Explanation Annotations. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yasitha Warahena Liyanage, Daphney-Stavroula Zois |
Interpretability in the Context of Sequential Cost-Sensitive Feature Acquisition. |
ICASSP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Rajan Prasad Tripathi, Sunil Kumar Khatri, Darelle van Greunen, Danish Ather |
Unleashing the Power of Machine Learning: A Precision Paradigm for Breast Cancer Subtype Classification Using Open-Source Data, with Caution on Dataset Size and Interpretability. |
IC3I |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Huan Li, Yue Wang |
An Interpretability Case Study of Unknown Unknowns Taking Clothes Image Classification CNNs as an Example. |
CGI (3) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Xinliang Zhou, Dan Lin, Ziyu Jia, Jiaping Xiao, Chenyu Liu, Liming Zhai, Yang Liu 0003 |
An EEG Channel Selection Framework for Driver Drowsiness Detection via Interpretability Guidance. |
EMBC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jindong Wang 0001, Haoliang Li, Haohan Wang, Sinno Jialin Pan, Xing Xie 0001 |
Trustworthy Machine Learning: Robustness, Generalization, and Interpretability. |
KDD |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Zhiqiang Zhong, Davide Mottin |
Knowledge-augmented Graph Machine Learning for Drug Discovery: From Precision to Interpretability. |
KDD |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Baiyin Huang, Boheng Tan, Xiaoqin Tang, Guoqiang Xiao 0001 |
Enhancing Interpretability in CT Reconstruction Using Tomographic Domain Transform with Self-supervision. |
PRICAI (3) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Masaki Higashi, Minje Sung, Daiki Yamane, Kenta Inamuro, Shota Nagai, Ken Kobayashi, Kazuhide Nakata |
Decision Tree Clustering for Time Series Data: An Approach for Enhanced Interpretability and Efficiency. |
PRICAI (2) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Hajar Hakkoum, Ali Idri, Ibtissam Abnane, José Luis Fernades-Aleman |
Does Categorical Encoding Affect the Interpretability of a Multilayer Perceptron for Breast Cancer Classification? |
DATA |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Ahana Roy Choudhury, Radu Paul Mihail, Sorin Dan Chiriac |
Prediction of Thyroid Malignancy Using Contextual Semantic Interpretability from Sonograms. |
BIOIMAGING |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Alfredo Cuzzocrea, Qudrat E. Alahy Ratul, Islam Belmerabet, Edoardo Serra |
An AI Framework for Modelling and Evaluating Attribution Methods in Enhanced Machine Learning Interpretability. |
COMPSAC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Shide Du, Zihan Fang, Shiyang Lan, Yanchao Tan, Manuel Günther, Shiping Wang, Wenzhong Guo |
Bridging Trustworthiness and Open-World Learning: An Exploratory Neural Approach for Enhancing Interpretability, Generalization, and Robustness. |
ACM Multimedia |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Tsun-Hsuan Wang, Wei Xiao 0003, Tim Seyde, Ramin M. Hasani, Daniela Rus |
Measuring Interpretability of Neural Policies of Robots with Disentangled Representation. |
CoRL |
2023 |
DBLP BibTeX RDF |
|
13 | Vinícus Oliveira Barros, Celso A. M. Lopes Junior, Byron Leite Dantas Bezerra |
Interpretability of an Automatic Handwritten Signature Verification Model. |
LA-CCI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Oksana Tkachman, Emily Sadlier-Brown, Roger Yu-Hsiang Lo, Carla L. Hudson Kam |
Interpretability in Sign Language Animal Signs. |
CogSci |
2023 |
DBLP BibTeX RDF |
|
13 | Mark Fedyk, Monika Ray |
How to Leverage Machine Learning Interpretability and Explainability to Generate Hypotheses in Cognitive Psychology. |
CogSci |
2023 |
DBLP BibTeX RDF |
|
13 | Filippo Neri |
Explainability and Interpretability in Agent based Modelling to Approximate Market Indexes. |
ICMLT |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Han Xuanyuan, Pietro Barbiero, Dobrik Georgiev, Lucie Charlotte Magister, Pietro Liò |
Global Concept-Based Interpretability for Graph Neural Networks via Neuron Analysis. |
AAAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Arshdeep Sekhon, Hanjie Chen, Aman Shrivastava, Zhe Wang 0025, Yangfeng Ji, Yanjun Qi |
Improving Interpretability via Explicit Word Interaction Graph Layer. |
AAAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jiahao Chen 0006, Zitao Liu 0001, Shuyan Huang, Qiongqiong Liu, Weiqi Luo 0002 |
Improving Interpretability of Deep Sequential Knowledge Tracing Models with Question-centric Cognitive Representations. |
AAAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Minkyung Jung, Hakseung Kim, Seho Lee, Jung-Bin Kim, Dong-Joo Kim |
Interpretability of Hybrid Feature Using Graph Neural Networks from Mental Arithmetic Based EEG. |
BCI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Sargam Yadav, Abhishek Kaushik 0002, Kevin McDaid |
Understanding Interpretability: Explainable AI Approaches for Hate Speech Classifiers. |
xAI (3) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Berkant Turan |
Extending Merlin-Arthur Classifiers for Improved Interpretability. |
xAI (Late-breaking Work, Demos, Doctoral Consortium) |
2023 |
DBLP BibTeX RDF |
|
13 | Milan Bhan, Nina Achache, Victor Legrand, Annabelle Blangero, Nicolas Chesneau |
Evaluating Self-attention Interpretability Through Human-Grounded Experimental Protocol. |
xAI (3) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yilun Zhou, Julie Shah |
The Solvability of Interpretability Evaluation Metrics. |
EACL (Findings) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Assia Najm, Abdelali Zakrani |
On the Interpretability of a Tree-based Ensemble for Predicting Software Effort. |
CiSt |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yashrajsinh Chudasama, Disha Purohit, Philipp D. Rohde, Maria-Esther Vidal |
Enhancing Interpretability of Machine Learning Models over Knowledge Graphs. |
SEMANTiCS (Posters & Demos) |
2023 |
DBLP BibTeX RDF |
|
13 | Lennart Schneider, Bernd Bischl, Janek Thomas |
Multi-Objective Optimization of Performance and Interpretability of Tabular Supervised Machine Learning Models. |
GECCO |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Narges Ashtari, Ryan Mullins, Crystal Qian, James Wexler, Ian Tenney, Mahima Pushkarna |
From Discovery to Adoption: Understanding the ML Practitioners' Interpretability Journey. |
Conference on Designing Interactive Systems |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Natalia Flechas Manrique, Wanqian Bao, Aurélie Herbelot, Uri Hasson |
Enhancing Interpretability Using Human Similarity Judgements to Prune Word Embeddings. |
BlackboxNLP@EMNLP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Egon W. Stemle, Martina Tebaldini, Francesca Bonanni, Filippo Pellegrino, Paolo Brasolin, Greta H. Franzini, Jennifer-Carmen Frey, Olga Lopopolo, Stefania Spina |
bot.zen at LangLearn: regressing towards interpretability (short paper). |
EVALITA |
2023 |
DBLP BibTeX RDF |
|
13 | Emina Tahirovic, Senka Krivic |
Interpretability and Explainability of Logistic Regression Model for Breast Cancer Detection. |
ICAART (3) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Rajhans Singh, Ankita Shukla, Pavan K. Turaga |
Improving Shape Awareness and Interpretability in Deep Networks Using Geometric Moments. |
CVPR Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Xiongren Chen, Jiuyong Li, Jixue Liu, Stefan Peters, Lin Liu 0003, Thuc Duy Le, Anthony Walsh |
Improve interpretability of Information Bottlenecks for Attribution with Layer-wise Relevance Propagation. |
IEEE Big Data |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Sarah A. Reynolds, Abigail Butka, Brian Butka |
Semantic Validation and Interpretability of Object Detection. |
ICSC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Amir L. Rifi, Febe Geirnaert, Camille Raets, Chaïmae El Aisati, Inès Dufait, Mark De Ridder, Kurt Barbé |
Murine in vivo tumor model to explain the interpretability of radiomic features. |
MeMeA |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Flávio Arthur Oliveira Santos, Cleber Zanchettin |
Exploring Image Classification Robustness and Interpretability with Right for the Right Reasons Data Augmentation. |
ICCV (Workshops) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang |
Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP. |
ICCV (Workshops) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Raissa Barcellos, Flávia Bernardini, José Viterbo, Anneke Zuiderwijk |
Hippolyta: a framework to enhance open data interpretability and empower citizens. |
DG.O |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Xiaoqin Tang, Baiyin Huang, Guoqiang Xiao 0001 |
Enhancing Precision and Interpretability of CT Image Reconstruction via Self-supervised Adaptive Domain Transformation. |
BIBM |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Qingfeng Wu, Shan Cao, Lu Cao, Qunsheng Ruan |
Visualization Analysis of Tongue Syndrome Diagnosis by Interpretability Neural Networks. |
BIBM |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Vincenzo Norman Vitale, Loredana Schettino, Francesco Cutugno |
On Incrementing Interpretability of Machine Learning Models from the Foundations: A Study on Syllabic Speech Units. |
CLiC-it |
2023 |
DBLP BibTeX RDF |
|
13 | Minyan Zeng, Yutong Xie, Minh-Son To, Lauren Oakden-Rayner, Luke Whitbread, Stephen Bacchi, Alix Bird, Luke Smith, Rebecca Scroop, Timothy Kleinig, Jim Jannes, Lyle J. Palmer, Mark Jenkinson |
Improved Flexibility and Interpretability of Large Vessel Stroke Prognostication Using Image Synthesis and Multi-task Learning. |
MICCAI (5) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Giuseppe Attanasio, Flor Miriam Plaza del Arco, Debora Nozza, Anne Lauscher |
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation. |
EMNLP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Sean Xie, Soroush Vosoughi, Saeed Hassanpour |
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models. |
EMNLP (Findings) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Andrea W. Wen-Yi, David Mimno |
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings. |
EMNLP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jonathan Crabbé, Mihaela van der Schaar |
Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Roland S. Zimmermann, Thomas Klein, Wieland Brendel |
Scale Alone Does not Improve Mechanistic Interpretability in Vision Models. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | David Lindner, János Kramár, Sebastian Farquhar, Matthew Rahtz, Tom McGrath, Vladimir Mikulik |
Tracr: Compiled Transformers as a Laboratory for Interpretability. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Usha Bhalla, Suraj Srinivas, Himabindu Lakkaraju |
Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Sarah Schwettmann, Tamar Rott Shaham, Joanna Materzynska, Neil Chowdhury, Shuang Li, Jacob Andreas, David Bau, Antonio Torralba 0001 |
FIND: A Function Description Benchmark for Evaluating Interpretability Methods. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Lu Yan, Zhuo Zhang 0002, Guanhong Tao 0001, Kaiyuan Zhang 0002, Xuan Chen, Guangyu Shen, Xiangyu Zhang |
ParaFuzz: An Interpretability-Driven Technique for Detecting Poisoned Samples in NLP. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Zhengxuan Wu, Atticus Geiger, Thomas Icard, Christopher Potts, Noah D. Goodman |
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Arthur Conmy, Augustine N. Mavor-Parker, Aengus Lynch, Stefan Heimersheim, Adrià Garriga-Alonso |
Towards Automated Circuit Discovery for Mechanistic Interpretability. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Jiawen Chen, Wancen Mu, Yun Li, Didong Li |
On the Identifiability and Interpretability of Gaussian Process Models. |
NeurIPS |
2023 |
DBLP BibTeX RDF |
|
13 | Ilia Smirnov, Scott Sanner, Baher Abdulhai |
A Case for Monte Carlo Tree Search in Adaptive Traffic Signal Control: Modifiability, Interpretability and Generalization. |
ITSC |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Abhigya Verma, Pooja Gera, Shweta Singhal, A. K. Mohapatra |
Advanced Regression Models for Accurate House Price Prediction: An Analysis of Performance and Interpretability. |
ICCCNT |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Zhao Ren, Thanh Tam Nguyen, Mohammad Mehdi Zahedi, Wolfgang Nejdl |
Self-Explaining Neural Networks for Respiratory Sound Classification with Scale-free Interpretability. |
IJCNN |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yoshinari Motokawa, Toshiharu Sugawara |
Interpretability for Conditional Coordinated Behavior in Multi-Agent Reinforcement Learning. |
IJCNN |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Svetlana Pavlitska, Christian Hubschneider, Lukas Struppek, J. Marius Zöllner |
Sparsely-gated Mixture-of-Expert Layers for CNN Interpretability. |
IJCNN |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Catarina Silva 0001, Tiago Faria, Bernardete Ribeiro |
Towards Interpretability in Fintech Applications via Knowledge Augmentation. |
EPIA (1) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Jiaming Qu, Jaime Arguello, Yue Wang 0035 |
Understanding the Cognitive Influences of Interpretability Features on How Users Scrutinize Machine-Predicted Categories. |
CHIIR |
2023 |
DBLP DOI BibTeX RDF |
|
13 | |
IEEE/ACM International Workshop on Interpretability and Robustness in Neural Software Engineering, InteNSE@ICSE 2023, Melbourne, Australia, May 14, 2023 |
InteNSE@ICSE |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Ting-Wei Wu, Jia-Hong Huang, Joseph Lin, Marcel Worring |
Expert-defined Keywords Improve Interpretability of Retinal Image Captioning. |
WACV |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yixiao Lu, Yokiu Lee, Haoran Feng, Johnathan Leung, Alvin Cheung, Katharina Dost, Katerina Taskova, Thomas Lacombe |
Interpretability Meets Generalizability: A Hybrid Machine Learning System to Identify Nonlinear Granger Causality in Global Stock Indices. |
PAKDD (2) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Yuanjie Duan, Xingquan Zuo, Hai Huang, Binglin Wu, Xinchao Zhao |
A Local Interpretability Model-Based Approach for Black-Box Adversarial Attack. |
DMBD (2) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Finn Schofield, Luis Slyfield, Andrew Lensen |
A Genetic Programming Encoder for Increasing Autoencoder Interpretability. |
EuroGP |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Luca Patanè, Francesca Sapuppo, Giuseppa Scipilliti, Maria Gabriella Xibilia |
Model-Agnostic Soft Sensor Interpretability. |
MetroXRAINE |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Amir Hosein Oveis, Elisa Giusti, Selenia Ghio, Giulio Meucci, Marco Martorella |
Credible Recognition of Radar Images: Interpretability Metric and Classification Score. |
IGARSS |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Andrea Colombo, Laura Fiorenza, Sofia Mongardi |
A Flexible Metric-Based Approach to Assess Neural Network Interpretability in Image Classification. |
XAI.it@AI*IA |
2023 |
DBLP BibTeX RDF |
|
13 | Kevin Ro Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, Jacob Steinhardt |
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Kieran A. Murphy, Danielle S. Bassett |
Interpretability with full complexity by constraining feature information. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Neel Nanda, Lawrence Chan, Tom Lieberum, Jess Smith, Jacob Steinhardt |
Progress measures for grokking via mechanistic interpretability. |
ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Lennart Brocki, Neo Christopher Chung |
Fidelity of Interpretability Methods and Perturbation Artifacts in Neural Networks. |
Tiny Papers @ ICLR |
2023 |
DBLP BibTeX RDF |
|
13 | Nathanael Jo, Sina Aghaei, Jack Benson, Andrés Gómez 0001, Phebe Vayanos |
Learning Optimal Fair Decision Trees: Trade-offs Between Interpretability, Fairness, and Accuracy. |
AIES |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Hyeongrok Han, Siwon Kim, Hyun-Soo Choi, Sungroh Yoon |
On the Impact of Knowledge Distillation for Model Interpretability. |
ICML |
2023 |
DBLP BibTeX RDF |
|
13 | Timothy Doyeon Kim, Tankut Can, Kamesh Krishnamurthy |
Trainability, Expressivity and Interpretability in Gated Neural ODEs. |
ICML |
2023 |
DBLP BibTeX RDF |
|
13 | Bishwamittra Ghosh |
Interpretability and Fairness in Machine Learning: A Formal Methods Approach. |
IJCAI |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Wenjie Zhou, Zhaoyang Han, Chuan Ma, Zhe Liu 0001, Piji Li |
Interpretability-Based Cross-Silo Federated Learning. |
CICAI (2) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Mingrong Xiang, Wei Luo 0001, Jingyu Hou 0001, Wenjing Tao |
Bridging the Interpretability Gap in Coupled Neural Dynamical Models. |
ADMA (1) |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Takaya Okamoto, Chunzhi Gu, Jun Yu, Chao Zhang 0030 |
Generating Smooth Interpretability Map for Explainable Image Segmentation. |
GCCE |
2023 |
DBLP DOI BibTeX RDF |
|
13 | Rui Xu, Wenkang Qin, Peixiang Huang, Hao Wang, Lin Luo 0006 |
SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training. |
BMVC |
2023 |
DBLP BibTeX RDF |
|
13 | Dong Wang, Qifei Li, Yingming Gao, Yong Liu 0002, Ya Li |
Exploring the interpretability in speech-based adolescent depression detection by SHAP. |
ICCIP |
2023 |
DBLP DOI BibTeX RDF |
|
Displaying result #401 - #500 of 2090 (100 per page; Change: ) Pages: [ <<][ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ 11][ 12][ 13][ 14][ >>] |
|