
Usama Fayyad
· Professor of the Practice, Executive Director Institute for EAIVerifiedNortheastern University · Artificial Intelligence and Data Science
Active 1988–2024
About
Usama Fayyad is a professor of the practice in the Khoury College of Computer Sciences and the executive director of the Institute for Experiential AI at Northeastern University, based at the Roux Institute in Portland, Maine. His research interests include data mining, image processing, machine intelligence, machine learning, and inductive learning. Fayyad has received multiple awards and honors for his research, such as the ACM SIGKDD Innovation Award, an ACM fellowship, and the US Government Medal from NASA. He has been published in prominent outlets including the Harvard Data Science Review and Communications of the ACM. His educational background includes a PhD in Computer Science and Engineering, an MS in Mathematics and Computer Science and Engineering, and a BS in Electrical Engineering and Computer Engineering, all from the University of Michigan.
Research topics
- Computer Science
- Data Mining
- Sociology
- Psychology
- Economics
- Data science
- Business
- Economy
- World Wide Web
- Internet privacy
Selected publications
Responsible AI: An Urgent Mandate
IEEE Intelligent Systems · 2024-01-01 · 26 citations
articleSenior authorAI is rapidly becoming essential in various industries, raising societal expectations. AI’s societal consequences include impacts on mental health; misinformation; workforce displacement; and economic, regulatory, and law enforcement challenges. Indeed, the regulation of AI usage is on the horizon, with the European Union and China already taking big steps, while the United States drafted its first AI-related bill of rights last year. Professional associations and other nonprofits are also contributing to AI ethics and regulations, increasing the urgency and criticality of this area. In this new context, public services and regulated institutions must ensure responsible AI to avoid biased or inaccurate decision-making. Similarly, companies using AI responsibly can stand out, increase efficiency, and avoid future legal problems. This article highlights the issues and problems that result in many organizations not knowing how to do responsible AI in practice, as they need to identify potential problems, set up safeguards, and conduct ethical impact assessments, among other actions. We present the issues to consider toward a comprehensive approach to responsible AI that should include defining a responsible AI strategy road map; assessing models, processes, and products; and training individuals at different levels. By covering the pressing issues related to the urgent need for adopting responsible AI, we hope to highlight the importance for corporations to seriously consider responsible AI as they rush to adopt this technology for competitive advantage.
Understanding and Fostering Regional Artificial Intelligence Ecosystems: A Case Study in Maine
Harvard Data Science Review · 2023-10-27 · 2 citations
articleOpen accessArtificial intelligence (AI) represents a novel force in both global and regional developments, transcending geographical, industrial, and academic borders.This article presents a case study in surveying AI challenges and opportunities in the state of Maine, in exploring ways to develop its AI ecosystem, and in fostering collaboration and development aligned with its strengths.It also showcases the potential when academics, government, and industry work together.
From Stochastic Parrots to Intelligent Assistants—The Secrets of Data and Human Interventions
IEEE Intelligent Systems · 2023-05-01 · 17 citations
article1st authorCorrespondingGenerative AI is all the rage nowadays—primarily driven by ChatGPT capturing the public imagination and attracting hundreds of millions of users in record time, reaching 100 million users in two months. However, there is much ambiguity from the providers about the technology, the methodology, and the way OpenAI makes it work. This compounds the mystique and speculation. I focus on what we know, with a particular emphasis on the aspects that the makers of ChatGPT avoid discussing with the public—namely, the underlying dependence on much manual intervention in training data curation, data labeling, operational interventions by humans, and reinforcement learning. Unfortunately, despite the criticality of these issues to the scientific community, they are hardly discussed. In this article, I attempt to address some of the issues in the hope of stimulating further studies of these less glorified but critical topics.
The Secrets of Data Science Deployments
IEEE Intelligent Systems · 2022-07-01 · 4 citations
article1st authorCorrespondingMuch attention is paid to data science and machine learning as an effective means for getting value out of data and as a means for dealing with the large amounts of data we are accumulating at companies and organizations. This has gained importance with the major waves of digitization we have seen, especially with the COVID-19 pandemic accelerating digital everything. However, in reality, most machine learning models, despite achieving good technical solutions to predictive problems wind up not being deployed. The reasons for this are many and have their origin in data scientists and machine learning practitioners not paying enough attention to issues of deployment in production. The issues range all the way from establishing trust by business stakeholders and users, to failure to explain why models work and when they do not, to failing to appreciate the importance of establishing a robust quality data pipeline, to ignoring many constraints that apply to deployed models, and finally to a lack of understanding the true cost of production deployment and the associated ROI. We discuss many of these problems and we provide what we believe is a pragmatic approach to getting data science models successfully deployed in working environments.
3rd IADSS Workshop on Data Science Standards - Hiring, Assessing and Upskilling Data Science Talent
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining · 2022-08-12
article1st authorCorrespondingThe 3rd IADSS Workshop on Data Science Standards follows a tradition of two prior KDD workshops and the initial workshop at ICDM-2018. The theme of the 2022 workshop is: Hiring, Assessing and Upskilling Data Science Talent. Organized by the Initiative for Analytics and Data Science (IADSS.org) at KDD, the workshop provides a platform to discuss industry needs and practices around external and internal talent pipeline development in data science. We aim to provide an understanding of the data science job market, and the critical role of collaboration between academic institutions and industry to meet the increasing need for talent. IADSS conducts ongoing research in this domain. In this workshop, we share detailed findings and observations from this research. In addition to contribution from researchers and industry practitioners through an open call for papers, the workshop features several invited presentations and invited speakers. In order to achieve intended aim, interactive panels will discuss topics of interest and feedback from the workshop will be used to produce post-workshop learnings. This workshop is designed as a half-day working meeting with short talks, invited panels and discussion sessions to plan for future steps in the topic. Post conference, learnings from the workshop will be available at the workshop's home page: https://www.iadss.org/kdd2022
From Unicorn Data Scientist to Key Roles in Data Science: Standardizing Roles
Harvard Data Science Review · 2022-07-28 · 7 citations
articleOpen access1st authorCorrespondingThe lack of an agreed-upon classification of job roles related to data science is causing much confusion that is challenging to the industry, educational sector, and practitioners. Prior work in this area has considered different aspects from different fields or points of view and has shown that more detail is needed in subcategorizing data science professionals. However, other prior work has also shown that avoiding the detailed subcategorization leads to challenging problems, for example, the pursuit of the elusive âdata science unicorns.â In this article, we target a simplification of prior work and an anchor categorization of job roles with clear definitions and expectations from each. We achieve this through analysis of survey results, LinkedIn profiles and job descriptions, and in-depth interviews with managing and hiring executives in data science. We also use our judgment as long-term practitioners and employers of data scientists to provide a practice-guided view of the problem.Our analysis has led to a simplification into three key role families with complementary skills: data analyst, data scientist, and data engineer. We believe this anchor categorization helps resolve several problems, including recruiting, forming, training, managing, and retaining effective data science teams. Although we realize there are and will continue to be many variations of these proposed anchor roles, this simplification is an effective tool to bypass the data science unicorn issue, and it can be used as a basis to establish more specialized or domain-specific roles. The combined skills in these role categories converge on the body of knowledge specification from the Initiative for Analytics and Data Science Standards (IADSS) data science knowledge framework (Fayyad & Hamutcu, 2020). The concise and familiar role categories simplify the problem and decompose it into more solvable subchallenges. We describe the essential knowledge required for each role and how, when, and in what ways it can be varied and extended. This description helps align expectations and serves as a step to tackle the pressing issue of training, evaluating, and building effective data science teams.
The Attention Economy and the Impact of Artificial Intelligence
Springer eBooks · 2021 · 22 citations
Senior authorCorresponding- Computer Science
- Sociology
- Business
Abstract The growing ubiquity of the Internet and the information overload created a new economy at the end of the twentieth century: the economy of attention. While difficult to size, we know that it exceeds proxies such as the global online advertising market which is now over $300 billion with a reach of 60% of the world population. A discussion of the attention economy naturally leads to the data economy and collecting data from large-scale interactions with consumers. We discuss the impact of AI in this setting, particularly of biased data, unfair algorithms, and a user-machine feedback loop tainted by digital manipulation and the cognitive biases of users. The impact includes loss of privacy, unfair digital markets, and many ethical implications that affect society as a whole. The goal is to outline that a new science for understanding, valuing, and responsibly navigating and benefiting from attention and data is much needed.
How Can We Train Data Scientists When We Can’t Agree on Who They Are?
Harvard Data Science Review · 2021-02-25 · 4 citations
articleOpen access1st authorCorrespondingHow Can We Train Data Scientists When We Can't Agree on Who They Are? 2
Statistical Analysis and Data Mining The ASA Data Science Journal · 2020-11-02
paratextOpen accessStatistical Analysis and Data Mining The ASA Data Science Journal · 2020-03-10
paratextOpen access
Frequent coauthors
- 82 shared
Geoffrey I. Webb
- 81 shared
Ee‐Peng Lim
Singapore Management University
- 81 shared
Hervé Marti
National Institute of Informatics
- 81 shared
Gabriella Pasi
- 81 shared
Xin Yao
- 81 shared
Ravi Kumar
- 32 shared
S. G. Djorgovski
California Institute of Technology
- 28 shared
N. Weir
Argonne National Laboratory
Education
- 1994
Ph.D., Computer Science
University of Texas at Austin
- 1990
M.S., Computer Science
University of Texas at Austin
- 1987
B.S., Computer Science
American University of Beirut
Awards & honors
- ACM SIGKDD Innovation Award
- ACM fellowship
- US Government Medal from NASA
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Usama Fayyad
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup