The Wayback Machine - http://web.archive.org/web/20220418213112/https://github.com/topics/data-mining?l=python
Skip to content
#

data-mining

Here are 1,166 public repositories matching this topic...

gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
sktime
fkiraly
fkiraly commented Apr 14, 2022

The first entry is being eaten by the Differencer in its current standard setting, which may cause user frustration, especially when combined with a pipeline (which is its "typical use"), see e.g., here: alan-turing-institute/sktime#2452

We should add an NA handling parameter setting and make the default to fill in sth for the first value, e.g., a difference from an

feature request good first issue module:forecasting enhancement
awesome-fraud-detection-papers

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.

  • Updated Apr 11, 2022
  • Python
annoviko
annoviko commented Oct 23, 2020

Introduction
As a tester or developer, I want to have correct answer for Hepta FCPS sample so that I can use it in test scenarios to improve quality of the library.

Acceptance Criteria

  • Introduce Hepta.answer that will contain cluster labels for points from Hepta.data.
  • Add Hepta.answer to definitions.
Testing Good First Issue

Improve this page

Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."

Learn more