Ml-bucket

Ever gone through a situation where you are implementing a research paper and wish for some petty scripts which could have made your life easier? Well, the aim of the repository is to bring all the appurtenances of ML (NLP/CV etc.) into one place and use them whenever you need them with a little tweak. I have added some basic scripts and will add more in due time.

tf-idf.py implements the standard tf-idf (term frequence - inverse document frequency) algorithm using sklearn (TfidfVectorizer), although you can use HashVectorizer for better speedup and scalability.
SVM.py implements Support Vector Machine algorithm on the data train.csv. The code first removes all the un-necessary features, converts the categorical/nominal features to numberical using one-hot encoding method and final training is done using LibSVM .

Everyone is encouraged to contribute to this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Python		Python
Scala		Scala
README.md		README.md
SVM.py		SVM.py
app.py		app.py
codecoax.xml		codecoax.xml
dash.ipynb		dash.ipynb
doc2vec.py		doc2vec.py
example-kube.xml		example-kube.xml
pyspark_kmeans.py		pyspark_kmeans.py
report.ipynb		report.ipynb
resume_sahil_2.pdf		resume_sahil_2.pdf
small_text.txt		small_text.txt
tf-idf.py		tf-idf.py
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ml-bucket

Everyone is encouraged to contribute to this repository.

About

Releases

Packages

Languages

wadhwasahil/ML-bucket

Folders and files

Latest commit

History

Repository files navigation

Ml-bucket

Everyone is encouraged to contribute to this repository.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages