[B! 近似] sh19910711のブックマーク

sh19910711 id:sh19910711

近似に関するsh19910711のブックマーク (5)

Philippe Flajolet’s contribution to streaming algorithms
Jérémie Lumbroso's talk at the AK Data Science Summit on Streaming and Sketching in Big Data and Analytics on 06/20/2013 at 111 Minna. For more information: http://blog.aggregateknowledge.com/ak-data-science-summit-june-20-2013
sh19910711 2017/05/27
*history

*algorithm

近似
リンク
Rustで作ってみよう -- HyperLogLogと並列処理で、ウィキペディア全記事のユニーク単語数を見積もる（その１） - Qiita
Rustで作ってみよう -- HyperLogLogと並列処理で、ウィキペディア全記事のユニーク単語数を見積もる︵その1︶アルゴリズムRust 今年の始め、私が Rust を習いはじめのころ、手本となるプログラムがあまり見つからないことが不満でした。GitHub で探せば、Rust で書かれた実用的なライブラリーが数多く見つかりますが、それらを読むのは入門者にとっては敷居が高過ぎます。私が欲しかったのは、学習用に書かれたプログラムで、入門者が手軽に試せて、いろいろといじれるプログラム例でした。そんなわけで、そういうプログラム例を書いてみようと思います。2回に分けて、Rust で簡単なツールを作ります。今回は乱択アルゴリズムの一種である、probability cardinarity estimatior︵確率的カーディナリティ推定機︶を実装します。HyperLogLog という名前のデ
sh19910711 2017/05/27
*program

rust

*algorithm

近似

*data

Wikipedia
リンク
HyperLogLogで遊ぶ - Negative/Positive Thinking
はじめに「さぁ、お前の罪の異なり数を数えろ！」と言われたときに使えそうな「HyperLogLog」という異なり数をカウントする方法を教えてもらったので、遊んでみた。いつもながら論文ちゃんと読んでないので、条件やコード間違ってるかも。。。 HyperLogLogとは cardinalityと呼ばれる、要素の異なり数を決定する問題かなり省メモリで精度のよい異なり数を推定できる方法要素をそのまま保存せず、ハッシュ値に変換したものをうまくレジスタに保存しておくので、レジスタサイズ程度しかメモリを使わない並列化もできて、最近のbigdataとかで注目されているまた、googleが並列計算用に改善したHyperLogLogを提案してるみたい http://blog.aggregateknowledge.com/2013/01/24/hyperloglog-googles-take-on-
sh19910711 2017/05/27
*algorithm

近似

NLP

*program

c*
リンク
乱択データ構造の最新事情－MinHash と HyperLogLog の最近の進歩－
Several recent papers have explored self-supervised learning methods for vision transf ormers (ViT). Key approaches include: 1. Masked prediction tasks that predict masked patches of the input image. 2. Contrastive learning using techniques like MoCo to learn representations by contrasting augmented views of the same image. 3. Self-distillation methods like DINO that distill a teacher ViT into a st
sh19910711 2017/05/27
*data

*algorithm

近似
リンク
HyperLogLog - Wikipedia
HyperLogLog is an algorithm for the count-distinct probl em, approximating the number of distinct elements in a multiset.[1] Calculating the exact cardinality of the distinct elements of a multiset requires an amount of memory proportional to the cardinality, which is impractical for very large data sets. Probabilistic cardinality estimators, such as the HyperLogLog algorithm, use significantly les
sh19910711 2017/05/26
HyperLogLogってツールかと思ったらアルゴリズムのことなのか

*algorithm

近似
リンク
1

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx