3. IntegratedML in InterSystems IRIS (Thomas Dyar)
In this episode, we chat with Thomas Dyar, product manager for machine learning and AI, about IntegratedML in InterSystems IRIS – the feature coming this spring that will enable SQL developers building applications to leverage machine learning directly within the SQL environment of InterSystems IRIS. In our discussion, Thomas tells us how he first got interested in machine learning, some of the most important uses of machine learning in the world today, and how InterSystems IRIS is taking the next step to unlock these capabilities for all developers.
To reach out to Thomas Dyar about IntegratedML, you can send him an email at email@example.com.
Derek Robinson 00:01 Welcome to Data Points, a podcast by InterSystems Learning Services. Make sure to subscribe to the podcast on your favorite podcast app such as Spotify, Apple Podcasts, Google Play, or Stitcher. You can do this by searching for Data Points and hitting that subscribe button. My name is Derek Robinson, and on today's episode, I'll chat with Thomas Dyar, one of the product specialists at InterSystems, about IntegratedML in InterSystems IRIS.
Derek Robinson 00:38 Welcome to Episode 3 of Data Points by InterSystems Learning Services. My name is Derek Robinson. As I've been mentioning in our first few episodes, we're excited about the launch of this podcast, and we've already released two other episodes along with this one for you to check out. In this episode, I'll be talking about machine learning and IntegratedML with Thomas Dyar. Thomas is a product specialist here at InterSystems, focused on the area of machine learning. In the interview, we're going to start with some interesting perspective on machine learning in general, and then segue into one of the exciting new features that's coming to InterSystems IRIS data platform, and that's IntegratedML. IntegratedML is really built for the SQL developer who wants to incorporate machine learning into their application, but they may not have the resources at their disposal to do it the traditional way. I'll leave the more thorough explanation to the expert. So here's my interview with Tom.
Derek Robinson 1:31 All right, and welcome to the podcast Thomas Dyar, one of our product specialists here at InterSystems. Thomas, how's it going?
Thomas Dyar 01:37 Very good. Thanks, Derek.
Derek Robinson 01:38 Yeah, so, today we're going to be talking about machine learning, which is one of your areas of expertise both in your career, I think, and here at InterSystems with the products that you oversee. So let's dive right in and get started. But before we get into the product, let's talk about machine learning in general. It's really a very popular new topic that's been emerging in the last few years as something that's really, really relevant. What is machine learning to you and how did you first get interested in it?
Thomas Dyar 2:05 So machine learning is essentially taking whatever data that you have and asking a computer to figure out what it is about that data that's interesting. And that also could be relevant to what you want to do with your application. So I got interested in machine learning in high school, and really what I was interested in at that time was how the brain worked. And being able to model the brain in a way that you could take the crazy complexity of all the neurons and distill that down into a mathematical representation of how those neurons would work, and make it possible for a human to think, was just extremely interesting. And as I got into actually trying to build those kinds of models, I realized that there were a lot of applications that you could actually build that would be useful to humans.
Derek Robinson 03:05 Yeah. Interesting. So, you know, to be honest, I hadn't actually taken that perspective on it, but it's kind of cool to frame it that way. So as you've gotten more into it, what are some of the most important and kind of significant, maybe life- changing, applications that you see in the world today that involve machine learning?
Thomas Dyar 03:21 So I'd say one of the most exciting areas is healthcare. Healthcare is a realm where we've spent a lot of time last 10 to 20 years taking all of the data from doctors and putting it into computers. Now you have a mountain of data and you have extremely life-altering problems to apply that to, whether you want to learn how to treat someone better or avoid some kind of a bad outcome, say a complication from surgery, you now have a huge amount of data that you could look at and decide whether or not…what you should do with that.
Derek Robinson 04:06 Right. So yeah, I think healthcare is really a well-known one…with machine learning. And I think what you brought up there is really important, not just from how well suited it can be for that amount of data, but also how significant it is, in that it has an impact on lives. Any non-healthcare examples come to mind that really jump out at you as good use cases? Because I think in some of our previous episodes, we've talked about how sometimes people see InterSystems and they associate healthcare IT because they see our logo, and they see how big of a player we are in healthcare. But for the people that aren't in healthcare, there's a lot that they can leverage on our technology stack as well. Any good ones come to mind as far as really cool machine learning applications that aren't in the healthcare space that you've seen?
Thomas Dyar 04:48 So definitely in the area of logistics is another application field where InterSystems technology is used. And there you have kind of optimization problems. Things like, well, you have a shipping company that needs to optimize how they move their ships and their infrastructure around to be able to meet the customer demands in real time, be able to lower the costs. There, machine learning is also very applicable. You can throw a bunch of data at it and determine the best place to put a ship or put a box, and be able to get it to the customer in the optimal way.
Derek Robinson 05:33 Right, interesting. Yeah. And so the more you think about it, the more it opens your mind to even more examples that can really take that data and use it in an effective way. So let's transition this into our InterSystems stack, right? One of the new features that's coming out for InterSystems IRIS is IntegratedML. That's one of the products we mentioned that you are managing and overseeing here at InterSystems. Tell us what IntegratedML is, who it's for, and what it can do for those developers.
Thomas Dyar 06:01 So IntegratedML is a new capability that we're placing within our SQL environment. It's an all-SQL feature that is turnkey machine learning. (You) don't need to install anything. It's going to become, out of the box, able to be applied to your problems. And then it's going to bring the best-of-breed machine learning frameworks right into your SQL environment. So an application developer that knows the data that knows SQL is going to be able to train models and then be able to use those models to make predictions and make their application smarter.
Derek Robinson 06:41 Interesting. Okay. So that's a cool kind of brief explanation of what it does. And I think in the little amount that I've kind of been looking at this stuff and kind of seeing it from the curious perspective, looking at it more closely, should these developers think that it's just instant magic, right? Like is it just kind of like flip a switch and all of a sudden you don't need your data scientists anymore? Or is it more of a thing to kind of open the door for them and kind of get them along the right path, let them taste it a little bit and kind of get you primed to be able to go even further later? Well, how would you frame it for those developers?
Thomas Dyar 07:14 Yeah, it's definitely not to replace data scientists. There's a lot of heavy-duty statistics and math, and also judgment that's required in determining where machine learning is useful and not. But as far as the ability to get a read on whether or not your data is good for machine learning, IntegratedML is a great place to start and will get you the basics of a machine learning model and also focus on being able to use the predictions from a machine learning model right in right in an application. So instead of spending all your time trying to install some framework, learn Python, figure out how to go between Python and SQL and all those things that are necessary to actually put a machine learning model into production, IntegratedML provides all the plumbing and makes that process a lot easier. So you can focus on your data, focus on the problem, and let the computer do what it's good at, which is putting it all together.
Derek Robinson 08:24 Right, right. Interesting. So it kind of sounds like for the developer, like you said, that knows their data, that knows how to use SQL, that knows how to…is really focused on those things in their application, this lets them take this buzzword that they've been hearing all over all over the industry, right—machine learning—and being able to apply some models and create some models and train those and really effectively use them on their data. So for some of the audience that might have a little bit more expertise in machine learning and maybe on InterSystems IRIS data platform as well as maybe legacy products, have used the previously existing architecture, which could be the Spark connector and loading PMML files for your machine learning models, how does that kind of play together with the new vision of IntegratedML and InterSystems IRIS? And what would be the message to the people that do have some experience with machine learning in that context and now seeing IntegratedML come into the picture?
Thomas Dyar 09:21 So the focus of IntegratedML is really on the SQL experience and the SQL developer, so it is really there to kind of streamline the process and also bring these frameworks into the development process and make that really easy, focus on deployment. What we've had before is more of a feature set for experts. So our Spark connector was good for users that already had either a Spark cluster or experience with Spark and wanted to then take the models that they would build in that environment and easily deploy those. Whereas IntegratedML, you don't need to start out with a Spark cluster. You don't have to have a huge infrastructure there. It's going to provide that on- ramp to machine learning much more quickly.
Derek Robinson 10:18 Right. Kind of an easier way for these developers to access this, and removing barriers that previously existed to really implement machine learning into their applications. That's really cool. So kind of wrapping up and summarizing a little bit, what do you see as the future of IntegratedML? Now I know you've kind of given us an example of it. When can developers maybe look to hopefully try to see more coming out about this, and like anything you kind of see as the vision, and what if someone right now listening to this is interested in what you're saying and kind of really, it's kind of intriguing them as an SQL developer, what's their next step, and what should they look for in the future?
Thomas Dyar 10:53 Well, it's coming out in the spring, and anybody that's interested can get on the Developer Community and look for content there, ask questions there. And if you don't find what you're looking for, feel free to email me. My email is going to be in the description of this podcast, and I'd love to hear from you and find out any feedback you have, and any questions I'll try to answer.
Derek Robinson 11:16 Absolutely. So, Tom Dyar, thank you so much for joining us, and we'll see you next time.
Thomas Dyar Thank you very much, Derek. This was fun.
Derek Robinson 11:25 Thanks again to Tom for sitting down with us. I really liked his perspective at the beginning on what got him into machine learning and kind of how powerful it can be. It brings into context just how amazing thinking and the human brain really is and how it all works, and really highlights how impactful some of the advances in technology can be…that can bring us closer to replicating that magic. On the IntegratedML front, there'll be much more content in the coming months for you to check out. As Tom mentioned this spring, it'll be released for general availability with InterSystems IRIS. In the interim, if you're super eager to learn more right away, shoot Tom an email. We put his email in the description of this podcast, but otherwise, definitely leverage the Developer Community, community.intersystems.com, and browse our learning content at learning.intersystems.com to learn more about machine learning features and InterSystems IRIS features and kind of how it all can fit together and kind of pique your imagination of what you might do with it down the road. I'm sure there are users on the Developer Community that would love to chime in with their thoughts as well. So hopefully you enjoyed Episode 3 and remember, make sure to find us on your favorite podcast app and hit that subscribe button. Thanks for listening, and we'll see you next time on Data Points.
Brought to you by InterSystems Learning Services of Data Points