This and that

slh1234
Level 2 Rank

Posts: 90

This and that Feb 19, 2023 15:54:03 GMT walnut, ratty, and 1 more like this

Quote

Post by slh1234 on Feb 19, 2023 15:54:03 GMT

Not distracting from Ratty’s post nor trying to minimize anything, but in my explanations, if you are reading carefully, you probably have several legitimate questions about the process I’ve begun to describe. This also answers whether a model is just “some woke programmer.”

Let’s take the example of the malaria model/solution. There are actually several models in this total solution, and I didn’t develop all of them. Some are well known. I also am NOT a medical professional, nor am I, in any way, the subject matter expert on malaria. There may be professionals with skill that span both the medical research areas and the data areas, but generally, it is considered to be more efficient to have separate individuals with separate areas of expertise to approach complex problems like this.

First of all, medical researchers develop the questions and problem statements such as “detect malaria in images of red blood cells.” Medical professional also define what is acceptable and which is the area of greater risk. So, for example, it may read “detect malaria in red blood cells with greater than 95% accuracy overall, and greater than 99% accuracy in detecting positive cases.” In a case like this, it would mean we must have less than 1% false negatives, but can handle much greater numbers of false positives so long as we’re above a threshold that makes it worthwhile to not just treat everybody. The data scientist(s) does not participate in establishing the criteria other than encouraging these folks to express a question in a form that can be solved with ML/AI.

When I started this project, I had no idea what malaria looked like in blood cell imaging. The medical professionals took me through images and this let me see that there are clearly defined edges and color contrasts. I was looking for this because there are specific numeric representations associated with edges and contrasts in digital images. I was also provided with a data set with over 6,000 examples of red blood cell images that contain malaria, and over 6,000 examples of red blood cells that were not parasitized. This was my labeled data set, so this becomes a supervised learning experiment. At this point, I give my opinion to the medical SMEs that I think the problem is solvable using AI. (Actually, there are many sets of data available online to do this, and I’m not the first person to take on such a problem, but we needed this for a specific application, so I needed to do this for that particular application). So now comes my part of the experimentation, and when I say “my part,” I don’t mean to imply that I was the only person involved in the experimentation.

So we express a null hypothesis that we cannot predict with greater than 95% confidence whether or not cells have the malaria parasite, or we cannot find find greater than 99% of all positive cases.” The alternative hypothesis is that we can – the opposite of the null hypothesis. We never say we have “proven” anything, but rather, we use probabilities to decide whether we have invalidated the null hypothesis or not, and based on this, decisions can be made on whether or not a model can be useful.

I don’t start by just feeding images into any kind of candidate models. Instead, I need to recognize that images may be .png or .jpg or other formats, they may also be in different sizes, and it’s possible it is an image of several red blood cells, or maybe a red blood cell doesn’t even exist in the picture. I need to start by prepping the data. In this case, I need to choose one specific image type, and for this, I chose .png. So the first thing that needs to be done is to ensure that the file is in this format. OpenCV already has models that will convert other image types into .png. I start off trying to not invent any wheels, and in experimentation, we will learn if this is okay or not, but the first step is in converting to .png. Next, I need to ensure the image contains only the area of interest, and once again, OpenCV had models already that allows me to center images on the area of interest. So here are 2 models already used in processing.

Next, I need to ensure the images are all the same size and scale because I intend to use a convolutional neural network (CNN) to take on the process (I’ll explain that in a bit). I also use OpenCV to standardize the size. Now, I need to convert this .png image into a 3 dimensional array of integers representing the height, width, and color vector of each pixel in the image. This is actual data that the CNN will see and operate on.

As I get ready to try to train models, I have to separate the data randomly into a training and test set. In my case, I used 80% of the data for training, ensuring I had about the same number of positive and negative cases in my data set, then saved 20% for a test data set – data that will never be seen by the algorithm/model during training.

The nice thing about a neural network in supervised training is that it is able to evaluate the patterns without the data scientist actually doing that mundane work. It simply tries the weightings and combinations and compares with similar patterns it has detected, then compares with the label of parasitized or uninfected (which we actually just express as 0 or 1 for such binary classification problems), then after a trip through the entire test set, it will assess its accuracy, then make additional trips through the data (epochs) each adjusting for whether it is getting better or worse than the previous epoch. There are several hyperparameters that affect training such as learning rate (amount to change weightings in each epoch – a discussion in and of itself). There is also a concern that too many epochs can cause overfitting, so there are two metrics we watch and evaluate to try to prevent this. We typically set a high number of epochs, but set early-exit policies in the training process so that when it starts trending the wrong direction for too many epochs (which we define), then we terminate training and present the best model as a candidate to be tested.

A convolutional neural network is used for searching for patterns in arrays such as the 3 dimensional array that the .png images are converted into. A “convolution” means it takes a certain subset, such as a certain height and width, and searches that, then moves a pre-set distance to the side and searches this new area. The convolutional areas should overlap to ensure that no part of the image is left unexamined.

The process of training is also called “fitting,” and here, we take the candidate model that best fits the training data and move to validation or testing. In this, we call “predict” on our candidate model with the test data, and we examine how well the model performs with data it did not see during training. We gather the statistics on this, and see the accuracy rates for positive and negative cases based on the outcomes we already know – this is supervised training. If (and only if) this indicates we have met the acceptance criteria, we also need to figure the probability of this just being a statistical anomaly. We we are above the acceptance criteria, we can move on, but if not, more experimentation is needed.

In this particular case, I started out with a CNN with 2 level of evaluation. It gave about 80% accuracy which is encouraging that the problem is possible, but falls far short of the acceptance criteria. From there, I changed the structure of the CNN to have 4 levels, and then I got nearly 95% accuracy. I still need to validate whether I can improve that, so I changed the design of the CNN to have 6 levels of evaluation, and then I got unmistakably above the acceptance criteria for overall accuracy, but it showed I needed to improve on the number of false negatives I had. I first tried going to 8 levels of evaluation, but this didn’t significantly improve model performance, and since it is significantly more expensive in terms of compute, I decided to try another approach.

In the images, I could sometimes see background noise that seemed to me to be impacting overall accuracy. I needed to try different approaches to minimize the effects of background noise. I tried using models in OpenCV to enhance the images using HSV (Hue, Saturation, and Value). I had to train a new model to interpret the HSV enhanced images. I found it to perform about the same as the model trained on raw images. I also used a model from OpenCV to perform gaussian blurring on the images, and this required another model to be trained. I found it once again to perform about like the model on raw images. I also used a model from OpenCV to convert the images to grayscale and tried to train a model on the grayscale images, but performance was very poor, so I determined that grayscale was not useful.

Looking at the output of the 3 models that were useful, although I could see they gave about the same accuracy, the set of misidentified images was not the same among the three. Seeing this, I tried two different approaches: run the image (converted where necessary) on all 3 models and take a vote where 2 out of 3 makes the final determination of the prediction, and another approach where a prediction of “parasitized” from any of the 3 resulted in a final prediction of “parasitizes,” and only images where all 3 models predicted “uninfected” would result in a final prediction of “uninfected.” This is the rules-based portion of the process that I say is sometimes involved in the final determination of AI processes.

The outcome of the experiment is that taking a vote resulted in the fewest number of errors overall, but it did so by minimizing the number of false positives, and didn’t decrease the number of false negatives that much. Taking the approach of “any one of the models predicting ‘parasitized’ gives a final result of ‘parasitized’ brought the number of false negatives down to meet the acceptance criteria, so the decision was made to follow this approach.

Another step of validation was required, and for this, we took several steps. First of all, the data scientist steps was to take the same structure of CNN, but divide the training and testing data differently and run through the training and testing steps again. This gave us increased confidence that our approach was not producing a statistical anomaly, and we should see consistent results with more general data. Once this was determined, we used additional images provided to test the model/approach and ensure we stayed consistently within the acceptance criteria. Medical professionals (generalized to “Subject Matter Experts”) are involved again in this step to agree that we are, or are not meeting the criteria.

Once approval on the models is met, we need to operationalize. For this, the front end web service is built that can convert the images to the format needed, stringify the image, and submit it to a web endpoint that contained the “scoring script” that did the work of HSV enhancement, Gaussian blurring, calling the models, and going through the rules-based steps of testing if any of the models gave a prediction of “parasitized,” and returning the prediction in a human readable form. This means that medical professionals can concern themselves with medical tasks instead of needing to learn image manipulation, etc. They simply upload a set of images, and each image is returned with the prediction of “uninfected” or “parasitized,” and they can use the combination of image and label in their final diagnosis.

That is a condensed version of a single example of Machine Learning used in AI, and how the AI model was used. I think from that illustration you can see that concerns like “some programmer” really show a lack of understanding of the model development and operationalization process. But also note that this only takes it through initial deployment, and doesn’t take into account the CI/CD going forward from that point. It does produce models that are useful, and this is just one of many examples in the world around you. Translation models, natural language models, evaluations of risk of diabetes, financial projections, market projections, etc. all have many AI models actually in use.

Sigurdur
Level 5 Rank

Posts: 2,625

This and that Feb 19, 2023 16:42:45 GMT via mobile slh1234 likes this

Quote

Post by Sigurdur on Feb 19, 2023 16:42:45 GMT

Feb 19, 2023 4:47:14 GMT slh1234 said:

On questions of models being useful, or just reinforcing someone's misunderstanding:

When Mrs. SLH scheduled her knee replacement, the hospital sent her a 25 page PDF document to prepare for the surgery, help her understand what to expect, explain the nerve block she would have in her leg when she came home, and how she should proceed with recovery. The document was in English. Mrs. SLH's first language in Korean. She speaks English, but is not as good as she is in Korean. The document was daunting, difficult, and she was afraid that even if she could read through all of it, she might not understand it well enough. My first language is English and Korean is my second language. When I looked at it, I was not sure I could translate it, and even if I could, it would likely take me days to do it. It was more than I could take on.

You can open pdf files in MS word, and I know that MS Word with Office 365 has translation built in to it, so I opened that pdf in word, saved it as a word document, then used the translate feature to translate the entire 25 page document into Korean. I was actually a bit tentative because the language used was very specific to a medical context, but I gave the document to Mrs. SLH and asked her if it was understandable.

She disappeared upstairs for about an hour, then came back downstairs with one word: "Perfect." I reviewed some things with her, and she did, indeed, understand all of the critical points she needed to. (I don't think "perfect" meant every word was perfect and every sentence sounded professional. I think that meant she could understand it perfectly.)

So was this translation model useful? or was it just providing false support for my misunderstandings and limitations of the modelers and/or input data?

On the same wise, I have legal matters I sometimes need to take care of in Mexico. Spanish is my third language and to be honest, I'm not that good at it. I can go shopping and take care of 80% of what I need to do downtown quite comfortably, but on legal matters, there is no way I can understand it. I use Microsoft Edge specifically because I know the translate feature in it uses Microsoft's translation models. I use this to translate web pages to ensure I understand what I need to do, and to ensure I am providing the correct information.

So is this translation model useful? or is it just providing false support for my misunderstandings and limitations of the modelers and/or input data?

Likewise, expecting a language model designed to carry on a conversation with you and give information obtained on the internet to confirm your beliefs for you is a fundamental misunderstanding of what a model is.

I gave 2 specific examples of AI models that are really useful. I have others in areas of environment narrators for visually impaired, screen readers for visually impaired, voice to text in Power Point or Teams (and other apps) which are good for hearing impaired including me, and I can assure you they are quite useful. For that matter, voice enabled chatbots like Alexa, Siri, Cortana, or the Google Assistant (whom I call "Miss Google" when using Android Auto) are quite useful, and since they are models similar to ChatGPT, they will likely just return to you the data they are trained to use as a source which will be found on the internet, so even though they're useful, they're not going to tickle your ears when you want someone to tell you what you want to hear. Maybe someone can build a speech enabled "buddy model" for the times when you need the model to tell you what you want to hear on a subject.

Note that I'm skeptical of manmade global warming - I think the jury is still out. But I don't expect a natural language model to confirm my belief for me.

In Ag, we use yield models. They are very useful in agronomy to establish a yield limiting factor.

Thanks for your excellent explanations.

code Administrator I'm not sure what values are but I know them when it see them Posts: 2,157	This and that Feb 19, 2023 20:38:40 GMT Sigurdur and dej like this Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by code on Feb 19, 2023 20:38:40 GMT A new vehicle coming to market ineosgrenadier.com/en/us/the-vehicle/the-grenadier
	Last Edit: Feb 19, 2023 20:40:47 GMT by code

Sigurdur Level 5 Rank Posts: 2,625	This and that Feb 20, 2023 2:12:36 GMT via mobile ratty and dej like this Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by Sigurdur on Feb 20, 2023 2:12:36 GMT www.sciencenews.org/article/protein-3d-maps-toxic-substances-organs

blustnmtn Administrator "The funny thing about believing you can do no wrong is you quickly begin doing nothing but wrong" Posts: 1,711	This and that Feb 20, 2023 15:11:50 GMT via mobile missouriboy likes this Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by blustnmtn on Feb 20, 2023 15:11:50 GMT www.americanthinker.com/articles/2023/02/when_stuff_happens__blame_it_on_climate_change.html
	“If ye love wealth better than liberty, the tranquility of servitude better than the animating contest of freedom, go home from us in peace. We ask not your counsels or arms. Crouch down and lick the hands which feed you. May your chains set lightly upon you, and may posterity forget that ye were our countrymen.” -Samuel Adams

duwayne
Level 5 Rank

"Prosperity, not bankruptcy"

Posts: 560

This and that Feb 20, 2023 19:03:29 GMT missouriboy and Sigurdur like this

Quote

Post by duwayne on Feb 20, 2023 19:03:29 GMT

I haven’t followed this thread too closely, but I assume we are still discussing the accuracy of and use (misuse) of climate models. For what it’s worth here is my view.

Climate models are different from most models in that they are “chain” models. You run a simulation, then take the output from that simulation and use it as input for the next simulation. You run a simulation of, say, what the weather will be an hour from now, then using that as a starting point you simulate what the weather will be 2 hours from now.

If your first hour prediction accuracy is 98% (I’m just guessing a number), then the second prediction may be 98% times 98% or 96% accurate. A 24-hour prediction of weather may be 80% accurate. You can use what number you believe is representative.

A forecast for 1 week ahead is generally not very accurate.

We do not rely much on forecasts beyond 2 weeks.

When we use those weather models to forecast temperatures 30 years ahead, are they accurate? To what extent can we count on these predictions as a solid basis for setting the world’s policies?

We can’t. We need to use other methods.

I propose just looking at the best model possible. The earth and its atmosphere, and recording what happens when CO2 is added. Then analyze what the effect of CO2 might be without being misled by modelers who we know are incapable of delivering forecasts with any degree of accuracy.

I’ve proposed 1 simple method which is turning out to be pretty accurate. Other methods are welcomed.

slh1234
Level 2 Rank

Posts: 90

This and that Feb 20, 2023 23:12:50 GMT ratty and missouriboy like this

Quote

Post by slh1234 on Feb 20, 2023 23:12:50 GMT

Feb 20, 2023 19:03:29 GMT duwayne said:

I haven’t followed this thread too closely, but I assume we are still discussing the accuracy of and use (misuse) of climate models. For what it’s worth here is my view.

Climate models are different from most models in that they are “chain” models. You run a simulation, then take the output from that simulation and use it as input for the next simulation. You run a simulation of, say, what the weather will be an hour from now, then using that as a starting point you simulate what the weather will be 2 hours from now.

If your first hour prediction accuracy is 98% (I’m just guessing a number), then the second prediction may be 98% times 98% or 96% accurate. A 24-hour prediction of weather may be 80% accurate. You can use what number you believe is representative.

A forecast for 1 week ahead is generally not very accurate.

We do not rely much on forecasts beyond 2 weeks.

When we use those weather models to forecast temperatures 30 years ahead, are they accurate? To what extent can we count on these predictions as a solid basis for setting the world’s policies?

We can’t. We need to use other methods.

I propose just looking at the best model possible. The earth and its atmosphere, and recording what happens when CO2 is added. Then analyze what the effect of CO2 might be without being misled by modelers who we know are incapable of delivering forecasts with any degree of accuracy.

I’ve proposed 1 simple method which is turning out to be pretty accurate. Other methods are welcomed.

This specific problem space is "Time series" or "forecasting." Machine learning is used in this space, but this is also a bit of why I am skeptical here.

Some of the ways time series forecasting is done are Autoregressive (AR), Moving Average (MA), combining the two into Autoregressive Integrated Moving Average (ARIMA), and Deep Learning (Neural networks. The most common type of neural net used in time series is Long Short Term Memory (LSTM)).

Since ARIMA contains both Autoregressive and Moving Average, a single article on ARIMA probably covers this area: www.investopedia.com/terms/a/autoregressive-integrated-moving-average-arima.asp

I don't think climate models can be done with this because all of these methods assume the future will resemble the past, so it looks for patterns to repeat. The assertion with climate change is that we're seeing something that has never happened before.

LSTM: (https://en.wikipedia.org/wiki/Long_short-term_memory); (https://machinelearningmastery.com/multi-step-time-series-forecasting-long-short-term-memory-networks-python/#:~:text=The%20Long%20Short-Term%20Memory%20network%20or%20LSTM%20is,which%20may%20be%20useful%20for%20time%20series%20forecasting.)

I think climate model attempts have been around longer than LSTM, but LSTM would be a possible approach, except that we need a LOT of data to learn the pattern, and we need real associations of contributing factors and outcomes. I don't believe this really exists. I think we have a lot of assumptions, but even here, I highly doubt we have sufficient data to really train an LSTM net on something like climate. I don't think we can include solar cycle data, for instance, because we only even know of 25 solar cycles, and very little of that actually corresponds to climate changes up or down. So how can it be included? I don't think we've seen the level of UHI in the past to allow us to include that accurately, and this is just another example of where I think there are weaknesses.

One thing with forecasting time series is that your forecast is only good for a few following time slices. I don't think climate modeling would be done day by day, but weather modeling would be. So with weather models, forecasts are made a few days ahead and no more. With climate models, I would expect time slices to be much larger, so it wouldn't be like a weather forecast.

Additionally, any kind of model like I'm describing depends on good data, and I just see too much adjusting of data for me to be comfortable with that.

Someone could model it with a statistical model, and do computations on that on a computer and not use machine learning, but what method is used in such a case? And how much data do they have to draw the correlations we're supposed to believe? In my experience, I just would not try to take on modeling of the next 25 years or 100 years with the data that I know of that exists. Now if there is more data, I could change my mind, but I'm speaking only about what I know of.

When I first came here, I was still working as an engineer, and not as a data scientist. I came because I first read a paper by David Archibald that made a correlation between length of a solar cycle with temperature over the next solar cycle, and in that paper, he said we could follow what was happening with the solar cycle by following the old solarcycle24 boards. With 24 being a long cycle at that time, it seemed like a good time to follow the experiment. Honestly, I've stopped following because it just didn't seem to be happening, or if it is, then it was nowhere near the scale of drop in temperature that David Archibald claimed. Since I've stopped following it, it may have changed and be following that now, but someone would have to show that to me at this point.

One reason I kept coming back was because, even though I had emotional reactions I won't detail, Astromet gave an alternate theory with an experiment we should be able to follow. It seemed worth following, so I stayed or came back. It did seem promising when he predicted the last la niña while the other sites I was reading were very late in seeing it coming. But it appears to me that the temperature predictions are beginning to break down here, too. We have a few anecdotes, but a full-on global cooling doesn't seem to be happening. I would probably need to follow for another year or two to be able to draw a conclusion that it was or was not proceeding as he said.

My problem is that I don't think there is enough data for time series modeling the way that I know machine learning time series modeling. I'm still not convinced that we know enough of the factors (features) that drive the climate to develop models. I don't see that past models have been accurate, and I think it is because of those things I just pointed out.

FWIW, when I talked about my ongoing personal project to predict short term market trends, I've used combinations of AR, MA, ARIMA, LSTM, and rules based approaches since I was looking for a way to predict for day-trading kinds of applications. If that was an easy problem, of course, all data scientists would be retired billionaires, so no, it isn't easy. I've worked in spare time and time that Mrs. SLH was otherwise occupied for a couple of years on this problem using such features as market depth, exponential moving averages, slope between ticker, asks at different depths and bids at different depths, etc. The funny thing is, I have actually gotten a way that wins more often than it loses, but the problem is, it mostly works in high volatility, and only picks about 65 - 70% winners. It doesn't perform well when a volatile market smooths and makes a longer slower move in one direction or another ... and that can lead to a situation where you can slowly lose all your profits, or slowly miss an opportunity, then be led to make a bad decision on a buy or sell. Also, you can see the market is about to go up and it may go up, but does it go up enough to cover your trade fees on the buy and sell? That's another part of the problem is predicting the magnitude of the change that is about to happen. The bottom line is that there are some things in the dynamics of a market that I don't have properly weighted - like the depth of human emotions to respond to certain events external to the market. This actually corresponds well to the climate modeling because if someone is looking only at a certain set of data features in the climate, but another event happens that is outside of that system, then they can't predict because they can't properly take it into account. These events can lead to big changes in market direction, or probably big changes in climate direction. I don't think we know enough about it.

Last Edit: Feb 21, 2023 2:20:07 GMT by slh1234

Sigurdur Level 5 Rank Posts: 2,625	This and that Feb 20, 2023 23:30:55 GMT via mobile Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by Sigurdur on Feb 20, 2023 23:30:55 GMT Excellent!!!

ratty Level 5 Rank Posts: 3,224	This and that Feb 21, 2023 6:37:16 GMT missouriboy likes this Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by ratty on Feb 21, 2023 6:37:16 GMT So, I guess I'll stick with my old method of throwing a dart at the stock section of the newspaper.
	Never ever trust a Socialist: They pretend to be pushing your barrow, but really they're stealing it. - Ratty

steve
Level 2 Rank

Posts: 72

This and that Feb 21, 2023 12:57:38 GMT missouriboy and slh1234 like this

Quote

Post by steve on Feb 21, 2023 12:57:38 GMT

This has been my concern with respect to climate models for a long time. I think of climate as a series of cycles that are added together. I do not think we understand the interactions of those cycles. I am also not sure we know about all of the cycles that we would need to input into a model to get good output. slh1234, I may have over simplified your post, but this is how it works in my head.

Joe Bastardi has mentioned some of the weather forecasting models have trouble seeing cold in the short term future at times.

I have other questions/concerns about the climate models but have not figured out how to post them elegantly.

Between Sigurdur and missouriboy and a little to the left (geographically).

missouriboy Administrator "The Sun so hot I froze to death" Stephen Foster was a Climatologist! Posts: 6,070	This and that Feb 21, 2023 15:45:40 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by missouriboy on Feb 21, 2023 15:45:40 GMT SIG! You still in Dothan?
	“When the situation is hopeless, there’s nothing to worry about.” (Edward Abbey)

Sigurdur Level 5 Rank Posts: 2,625	This and that Feb 21, 2023 15:49:01 GMT via mobile blustnmtn likes this Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by Sigurdur on Feb 21, 2023 15:49:01 GMT Feb 21, 2023 15:45:40 GMT missouriboy said: SIG! You still in Dothan? Yep, until early April

ratty
Level 5 Rank

Posts: 3,224

This and that Feb 21, 2023 20:47:13 GMT

Quote

Post by ratty on Feb 21, 2023 20:47:13 GMT

Feb 21, 2023 12:57:38 GMT steve said:

This has been my concern with respect to climate models for a long time. I think of climate as a series of cycles that are added together. (and overlapping?)I do not think we understand the interactions of those cycles. I am also not sure we know about all of the cycles that we would need to input into a model to get good output. slh1234, I may have over simplified your post, but this is how it works in my head.

Joe Bastardi has mentioned some of the weather forecasting models have trouble seeing cold in the short term future at times.

I have other questions/concerns about the climate models but have not figured out how to post them elegantly.

I'm in the same club, Steve. I have not figured out how to post them without embarrassing myself.

Never ever trust a Socialist: They pretend to be pushing your barrow, but really they're stealing it. - Ratty

missouriboy
Administrator

"The Sun so hot I froze to death" Stephen Foster was a Climatologist!

Posts: 6,070

This and that Feb 21, 2023 22:02:00 GMT walnut and Sigurdur like this

Quote

Post by missouriboy on Feb 21, 2023 22:02:00 GMT

Feb 21, 2023 15:49:01 GMT Sigurdur said:

Feb 21, 2023 15:45:40 GMT missouriboy said:

SIG! You still in Dothan?

Yep, until early April

Thinking about heading to the Southland over spring break about the end of March. We'll know before long. Marta has an urge for beaches and warm water and warm weather.

Last Edit: Feb 21, 2023 22:02:50 GMT by missouriboy

“When the situation is hopeless, there’s nothing to worry about.” (Edward Abbey)

missouriboy
Administrator

"The Sun so hot I froze to death" Stephen Foster was a Climatologist!

Posts: 6,070

This and that Feb 21, 2023 23:49:08 GMT slh1234 likes this

Quote

Post by missouriboy on Feb 21, 2023 23:49:08 GMT

Feb 20, 2023 23:12:50 GMT slh1234 said:

Feb 20, 2023 19:03:29 GMT duwayne said:

I haven’t followed this thread too closely, but I assume we are still discussing the accuracy of and use (misuse) of climate models. For what it’s worth here is my view.

Climate models are different from most models in that they are “chain” models. You run a simulation, then take the output from that simulation and use it as input for the next simulation. You run a simulation of, say, what the weather will be an hour from now, then using that as a starting point you simulate what the weather will be 2 hours from now.

If your first hour prediction accuracy is 98% (I’m just guessing a number), then the second prediction may be 98% times 98% or 96% accurate. A 24-hour prediction of weather may be 80% accurate. You can use what number you believe is representative.

A forecast for 1 week ahead is generally not very accurate.

We do not rely much on forecasts beyond 2 weeks.

When we use those weather models to forecast temperatures 30 years ahead, are they accurate? To what extent can we count on these predictions as a solid basis for setting the world’s policies?

We can’t. We need to use other methods.

I propose just looking at the best model possible. The earth and its atmosphere, and recording what happens when CO2 is added. Then analyze what the effect of CO2 might be without being misled by modelers who we know are incapable of delivering forecasts with any degree of accuracy.

I’ve proposed 1 simple method which is turning out to be pretty accurate. Other methods are welcomed.

This specific problem space is "Time series" or "forecasting." Machine learning is used in this space, but this is also a bit of why I am skeptical here.

Some of the ways time series forecasting is done are Autoregressive (AR), Moving Average (MA), combining the two into Autoregressive Integrated Moving Average (ARIMA), and Deep Learning (Neural networks. The most common type of neural net used in time series is Long Short Term Memory (LSTM)).

Since ARIMA contains both Autoregressive and Moving Average, a single article on ARIMA probably covers this area: www.investopedia.com/terms/a/autoregressive-integrated-moving-average-arima.asp

I don't think climate models can be done with this because all of these methods assume the future will resemble the past, so it looks for patterns to repeat. The assertion with climate change is that we're seeing something that has never happened before.

LSTM: (https://en.wikipedia.org/wiki/Long_short-term_memory); (https://machinelearningmastery.com/multi-step-time-series-forecasting-long-short-term-memory-networks-python/#:~:text=The%20Long%20Short-Term%20Memory%20network%20or%20LSTM%20is,which%20may%20be%20useful%20for%20time%20series%20forecasting.)

I think climate model attempts have been around longer than LSTM, but LSTM would be a possible approach, except that we need a LOT of data to learn the pattern, and we need real associations of contributing factors and outcomes. I don't believe this really exists. I think we have a lot of assumptions, but even here, I highly doubt we have sufficient data to really train an LSTM net on something like climate. I don't think we can include solar cycle data, for instance, because we only even know of 25 solar cycles, and very little of that actually corresponds to climate changes up or down. So how can it be included? I don't think we've seen the level of UHI in the past to allow us to include that accurately, and this is just another example of where I think there are weaknesses.

One thing with forecasting time series is that your forecast is only good for a few following time slices. I don't think climate modeling would be done day by day, but weather modeling would be. So with weather models, forecasts are made a few days ahead and no more. With climate models, I would expect time slices to be much larger, so it wouldn't be like a weather forecast.

Additionally, any kind of model like I'm describing depends on good data, and I just see too much adjusting of data for me to be comfortable with that.

Someone could model it with a statistical model, and do computations on that on a computer and not use machine learning, but what method is used in such a case? And how much data do they have to draw the correlations we're supposed to believe? In my experience, I just would not try to take on modeling of the next 25 years or 100 years with the data that I know of that exists. Now if there is more data, I could change my mind, but I'm speaking only about what I know of.

When I first came here, I was still working as an engineer, and not as a data scientist. I came because I first read a paper by David Archibald that made a correlation between length of a solar cycle with temperature over the next solar cycle, and in that paper, he said we could follow what was happening with the solar cycle by following the old solarcycle24 boards. With 24 being a long cycle at that time, it seemed like a good time to follow the experiment. Honestly, I've stopped following because it just didn't seem to be happening, or if it is, then it was nowhere near the scale of drop in temperature that David Archibald claimed. Since I've stopped following it, it may have changed and be following that now, but someone would have to show that to me at this point.

One reason I kept coming back was because, even though I had emotional reactions I won't detail, Astromet gave an alternate theory with an experiment we should be able to follow. It seemed worth following, so I stayed or came back. It did seem promising when he predicted the last la niña while the other sites I was reading were very late in seeing it coming. But it appears to me that the temperature predictions are beginning to break down here, too. We have a few anecdotes, but a full-on global cooling doesn't seem to be happening. I would probably need to follow for another year or two to be able to draw a conclusion that it was or was not proceeding as he said.

My problem is that I don't think there is enough data for time series modeling the way that I know machine learning time series modeling. I'm still not convinced that we know enough of the factors (features) that drive the climate to develop models. I don't see that past models have been accurate, and I think it is because of those things I just pointed out.

FWIW, when I talked about my ongoing personal project to predict short term market trends, I've used combinations of AR, MA, ARIMA, LSTM, and rules based approaches since I was looking for a way to predict for day-trading kinds of applications. If that was an easy problem, of course, all data scientists would be retired billionaires, so no, it isn't easy. I've worked in spare time and time that Mrs. SLH was otherwise occupied for a couple of years on this problem using such features as market depth, exponential moving averages, slope between ticker, asks at different depths and bids at different depths, etc. The funny thing is, I have actually gotten a way that wins more often than it loses, but the problem is, it mostly works in high volatility, and only picks about 65 - 70% winners. It doesn't perform well when a volatile market smooths and makes a longer slower move in one direction or another ... and that can lead to a situation where you can slowly lose all your profits, or slowly miss an opportunity, then be led to make a bad decision on a buy or sell. Also, you can see the market is about to go up and it may go up, but does it go up enough to cover your trade fees on the buy and sell? That's another part of the problem is predicting the magnitude of the change that is about to happen. The bottom line is that there are some things in the dynamics of a market that I don't have properly weighted - like the depth of human emotions to respond to certain events external to the market. This actually corresponds well to the climate modeling because if someone is looking only at a certain set of data features in the climate, but another event happens that is outside of that system, then they can't predict because they can't properly take it into account. These events can lead to big changes in market direction, or probably big changes in climate direction. I don't think we know enough about it.

I think you've provided a good summary of the problems associated with climate science modeling today. I'm going to go back and assemble some of the "best" data series that we have and move them over to the "Climate Science" discussion thread.

“When the situation is hopeless, there’s nothing to worry about.” (Edward Abbey)

Post by slh1234 on Feb 19, 2023 15:54:03 GMT

Post by Sigurdur on Feb 19, 2023 16:42:45 GMT

Post by code on Feb 19, 2023 20:38:40 GMT

Post by Sigurdur on Feb 20, 2023 2:12:36 GMT

Post by blustnmtn on Feb 20, 2023 15:11:50 GMT

Post by duwayne on Feb 20, 2023 19:03:29 GMT

Post by slh1234 on Feb 20, 2023 23:12:50 GMT

Post by Sigurdur on Feb 20, 2023 23:30:55 GMT

Post by ratty on Feb 21, 2023 6:37:16 GMT

Post by steve on Feb 21, 2023 12:57:38 GMT

Post by missouriboy on Feb 21, 2023 15:45:40 GMT

Post by Sigurdur on Feb 21, 2023 15:49:01 GMT

Post by ratty on Feb 21, 2023 20:47:13 GMT

Post by missouriboy on Feb 21, 2023 22:02:00 GMT

Post by missouriboy on Feb 21, 2023 23:49:08 GMT

Quick Reply