11/10/2023

Using ChatGPT for Advanced Data Analysis - An example of the field of AM-Smart methods (approachable modelling and smart methods)

 

As some readers may recall, my colleague CoreySchimpf and I published an article in 2022 identifying a new area of methods development, called smart and approachable methods or AM-Smart for short. CLICK HERE for an open access copy of the article at IJSRM.

 

What are AM-Smart methods?

AM-Smart methods emerged over the last several years out of the intersection of smart technology, human-computer interface, educational design and interdisciplinary methods. AM-Smart platforms address a major challenge for applied and public sector analysts, educators and those trained in traditional methods: accessing the latest advances in interdisciplinary (particularly computational) methods.

 

AM-Smart platforms do so through nine design features. They are (1) bespoke tools that (2) involve a single or small network of interrelated (mostly computational) methods. They also (3) embed distributed expertise, (4) scaffold methods use, (5) provide rapid and formative feedback, (6) leverage visual reasoning, (7) enable productive failure, and (8) promote user-driven inquiry; all while (9) counting as rigorous and reliable tools.

 

Examples include R-shiny programs, computational modeling and statistical apps, public-sector data management platforms, data visualization tools, and smart phone apps. 

 

 

What about ChatGPT for advanced data analysis?

What we had not thought about was that advances such as ChatGPT also constituted a form of AM-Smart methods. That is until today, when I saw an advertisement for a 2-day instats workshop by Peter Gruber, titled, appropriately enough, UsingChatGPT for Advanced Data Analysis. Here is the brief description of the workshop:

 

This 2-day workshop teaches researchers, from PhD students to professors, how to use ChatGPT and its Advanced Data Analytics tool for statistical analysis without writing a line of code or even knowing how to use a statistics program. The seminar covers a range of topics from data preparation and descriptive statistics to regression analysis, advanced statistical tests and visualisation. Special emphasis is placed on understanding the workings and limits of AI models such as ChatGPT and reflecting on its implications for data analysis.

 

In addition to the description of the workshop, what caught my eye was the instats description of Gruber’s work around teaching methods.

 

Dr Gruber has been an early adopter of Large Language Models for teaching and research and leads the efforts of USI to integrate LLMs into the curriculum. He has been teaching statistics, financial econometrics and numerical methods with MATLAB, Python and R since 2005 at USI, University of St. Gallen, University of Geneva and Bocconi.


 This was insightful, because it suggested the approach that is very much driving the AM-Smart movement. Which is super cool.

 

This is not the only workshop!!!

To lean more, I started exploring on Google if other such workshops or courses were being offered. They are being offered! Lots of them. A whole new field of study and pedagogy seems to have emerged over night! Digital sociology and data science have a lot to think about!

 

Which led me to next ask, What is ChatGPT for advanced data analysis?

 

 

What is ChatGPT for advanced data analysis?

The best I can understand is that in August 2023, ChatGPT recently launched a new business version called ChatGPT Enterprise. Part of the package is the revamping of what used to be called CodeInterpreter – one of the quickly expanding list of Plugins being developed for ChatGPT – into its new version called Advanced Data Analysis – click here for a nice summary.

 

 

 

Let’s not get too excited!!!!

As Corey and I outlined in our articleon AM-Smart methods, while offering new possibilities for data analysis, critical reflection on AM-Smart platforms reveals considerable unevenness in these design features, which hamper their effectiveness.

 

A rigorous research agenda is vital.

 

A case in point is ChatGPT Advanced Data Analysis.

·       First, while currently free, it is not clear if it will not stay that way for long. This is potentially a major issue and goes against the open-access environment of current science and programming platforms such as R.

·       Second, it is not a bespoke suite of methods. It is unclear what sorts of algorithms are used in the background to do the work. What are the limits of these methods?

·       Third, it is not as user-friendly as it suggests. Part of the whole point of AM-Smart methods is ease of access. For example, there are limits to what data can be used and how it can be used. I am sure these issues will be addressed going forward, but presently the plug-in seems to be very much at the ground level.

·       There is still a significant learning curve to using it, including getting the plug-in to work.

·       Then there is the issue of data safety and access globally!

 

Then there is the question of pedagogy and value for teaching. It is key to have methods experts such as Peter Gruber involved in the development of AM-Smart methods, such as ChatGPT.  There are so many questions to explore. Is this a good way to learn statistics? Is fast statistics good statistics? What about gaming? Is that a good way to use methods? For example, I am increasingly worried about students not learning the math behind the methods they use, in the case of numerical analysis, and, in turn, the craft of qualitative inquiry, as in the case of ethnography.

 

Conclusion?

My short conclusion is that ChatGPT is a good example of AM-Smart methods but comes with lots of concerns that require the methods community to engage with it before endorsing it.

 

I do not make this point nonchalantly, as these sorts of smart tools are not the necessary future of methods. There is no inevitability here. Instead, there is lots of room for creative but critical interrogation and development, both for educational purposes and for the wider usage of methods in research, policy and practice.

 

 

 

 

 

04/10/2023

Actearly Methods Club: The COMPLEX-IT Platform and Smart Methods at University of York

Much thanks to Philip Garnett and Liina Mansukoski for the chance to present at the ActEarly Methods Club at University of York. (ActEarly focusses on early life changes to improve the health and opportunities for children living in areas with high levels of child poverty; Bradford, West Yorkshire and Tower Hamlets, London.)

 


 The COMPLEX-IT Platform and the New Field of Approachable Modelling and Smart Methods

 

 Abstract:

Smart platforms constitute an entirely new field of methods that are still in the emergent stage, requiring critical engagement and also the real potential to develop a new methodological toolkit for social and health inquiry. Smart platforms are comprised of bespoke tools that facilitate user-driven learning by building expertise into the platform to create an intuitive, supportive, and open-ended environment for complex social and health inquiry. Unlike statistical platforms, AM-Smart platforms focus on a single technique or small network of interrelated (mostly computational) methods, which help users engage new methods. Smart platforms provide method-specific operational scaffolding, rapid and formative feedback, and which requires modest technical skill while being rigorous and reliable. This session will introduce this newly emerging field by exploring the AM-Smart platform, COMPLEX-IT.

 

Bio:

Brian is Director of the Research Methods Centre and Director of the Wolfson Research Institute for Health and Wellbeing at Durham University, UK. He is also editor of the Routledge Complexity in Social Science series, CO-I for the Centre for the Evaluation of Complexity Across the Nexus, and a Fellow of the UK National Academy of Social Sciences. Brian also runs InSPIRE, a UK policy and research consortium for mitigating the impact places have on air quality, dementia and brain health across the life course. Brian and his colleagues have spent the past ten years developing a new case-based, data mining approach to modelling complex social systems and social complexity, called COMPLEX-IT, which they have used to help researchers, policy evaluators, and public sector organisations address a variety of complex public health issues. Staff Profile

 

CLICK HERE for a link to a PDF of my presentation

 

CLICK HERE for a link to our paper on AM-Smart Methods

 

CLICK HERE for a link to COMPLEX-IT