# NYC TAXI

### [**NYC taxi pickup problem** ](http://www.vivekchoksi.com/papers/taxi_pickups.pdf)

[**The taxi problem**](http://www.vivekchoksi.com/papers/taxi_pickups.pdf) **is an intro to a well known machine learning problem, the paper will explain about feature engineering, analysis and using various regression algorithms for the purpose of solving the problem, you can use this as a base for many regression and classification problems.**<br>

**A** [**Second study**](http://blog.nycdatascience.com/student-works/predict-new-york-city-taxi-demand/) **(regression, random forest,** [**xgboost**](http://xgboost.readthedocs.io/en/latest/model.html) **(extreme gradient boosting tree)).**

[**Standard error estimate**](https://www.youtube.com/watch?v=r-txC-dpI-E\&index=4\&list=PLF596A4043DBEAE9C) **-- measures the distance from the estimated value to the real value**

**R^2 error estimate- measures the distance of the estimated to the mean against the real to the mean, 1 no error, 0 lots.**<br>

**\*\*\*\* with regression prediction it's best to create dummy variables (i.e., binary variables - exist or doesn't exist) from numeric variables, such as grid\_number to grid\_1, grid\_2 etc..**<br>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://www.mlcompendium.com/business-domains/examples.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
