Machine learning for fun, to pay your rent or whatever

tags · November 12th, 2017, 9:53 am

What 2.0?

Cuchulainn · November 12th, 2017, 5:42 pm

What 2.0?

tags · December 16th, 2017, 7:37 pm

Satellites Are Reshaping How Traders Track Earthly Commodities

Article title pushes a bit too much. Sorry for that.

Cuchulainn · December 16th, 2017, 8:39 pm

Satellites Are Reshaping How Traders Track Earthly Commodities

Article title pushes a bit too much. Sorry for that.

Is this a new form of industrial espionage?

how many utility vault grates are there in London?

Cuchulainn · December 18th, 2017, 11:52 am

Can you write an AI bot to do the job of software testers and maintenance programmers?

Traden4Alpha · December 18th, 2017, 2:33 pm

https://www.google.com/search?q=AI+soft ... l=en&gbv=1

tags · December 20th, 2017, 3:58 pm

N.Taleb on twitter.
Attention Data Scientists: the M4 competition is Open.
The Makridakis Competitions address out of sample performance. M1, M2 & M3 revealed how complex methods are outformed by simpler ones (themselves poor).

outrun · December 20th, 2017, 4:11 pm

N.Taleb on twitter.
Attention Data Scientists: the M4 competition is Open.
The Makridakis Competitions address out of sample performance. M1, M2 & M3 revealed how complex methods are outformed by simpler ones (themselves poor).

Thanks! €20k is nice incentive he has organized.

tags · December 20th, 2017, 4:25 pm

The M4 consists of 100,000 time series of Yearly, Quarterly, Monthly and Other (weekly, daily, hourly and additional) data. (m4)

I'm afraid my laptop would not survive.

Traden4Alpha · December 20th, 2017, 5:07 pm

N.Taleb on twitter.
Attention Data Scientists: the M4 competition is Open.
The Makridakis Competitions address out of sample performance. M1, M2 & M3 revealed how complex methods are outformed by simpler ones (themselves poor).
Thanks! €20k is nice incentive he has organized.

Cool contest!

It looks like these may be all {given X(t), predict X(t+)} time series rather than {given [X,Y], predict Y[i+] from X[i+]} data sets?

outrun · December 20th, 2017, 7:23 pm

N.Taleb on twitter.
Attention Data Scientists: the M4 competition is Open.
The Makridakis Competitions address out of sample performance. M1, M2 & M3 revealed how complex methods are outformed by simpler ones (themselves poor).
Thanks! €20k is nice incentive he has organized.
Cool contest!

It looks like these may be all {given X(t), predict X(t+)} time series rather than {given [X,Y], predict Y[i+] from X[i+]} data sets?

For M3 they have data on various timescales (annual, quarterly, monthly,..) and sometimes as little a dozen data points. In not sure if you're allowed to correlate on point in time between series (or aditional series you can add yourself)?

tags · December 20th, 2017, 8:15 pm

tag: "there is a machine learning contest"
outrun: "a 20k prize"
t4a: "cool contest!"

that's disappointing...

tags · December 20th, 2017, 8:16 pm

tag: "there is a machine learning contest"
outrun: "a 20k prize"
t4a: "cool contest!"
that is disappointing...

tags · December 20th, 2017, 8:16 pm

tag: "there is a machine learning contest"
outrun: "a 20k prize"
t4a: "cool contest!"

that is disappointing...

Traden4Alpha · December 21st, 2017, 12:01 am

Thanks! €20k is nice incentive he has organized.
Cool contest!

It looks like these may be all {given X(t), predict X(t+)} time series rather than {given [X,Y], predict Y[i+] from X[i+]} data sets?

For M3 they have data on various timescales (annual, quarterly, monthly,..) and sometimes as little a dozen data points. In not sure if you're allowed to correlate on point in time between series (or aditional series you can add yourself)?

Why not? Wouldn't most ML approaches automagically notice correlation patterns among the data series and exploit them? If the ML system shares state between it's analyses of multiple series, then it is implicitly using more than each data series in isolation.

The use of additional data series seems a bit trickier. It might be considered "cheating" or it might be entirely encouraged. If the goal is to create learning system capable of ingesting a novel corpus of data and making predictions, then attempting to augment the 100,000 provided series with additional series would be bad. But if the goal is to generate a general predictor that processes all of humanity's data and provides predictions, then augmenting the 100,000 data series is clearly a welcome solution.

Are the 100,000 data series meant to be the only training data or are they just the test data series?

And what about a hybrid system that generates "new" data series through functions of combinations of the original data?

Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever

Re: Machine learning for fun, to pay your rent or whatever