Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Introduction  





2 My Thesis  





3 User List  





4 Updates (before 2008/4/22)  





5 Updates (2008/4/22)  





6 Updates (2008/5/27)  














User:Abhaac

















User page
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
User contributions
User logs
View user groups
Upload file
Special pages
Permanent link
Page information
Get shortened URL
Download QR code
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Introduction

[edit]

Hello everyone, I am a postgraduate of institute of technology management in Taiwan Tsing-Hua University. My domain knowledge is about information management, knowledge management, information retrieval and data mining.

abhaac

My Thesis

[edit]

My thesis topic is about the distributed knowledge management. A good data source of distributed knowledge is Wikipedia. That's why I am here and looking forward to some exciting discovery.

A more exact topic is Constructing a Knowledge Evolution Map System on Wikipedia. My thesis proposal was just on 1/14. An important reason to build such a system on Wikipedia is that the knowledge resource is rich and the quality of knowledge is good on Wiki.

User List

[edit]

I need a list of users to keep track the history which can be a good data source cause the knowledge evolution map system will take individuals as subjects. A map for a user.

The criterion to select a user to be a subject are as below:


user link to contribution Note
Ronz Special:Contributions/Ronz
Fmccown Special:Contributions/Fmccown See also User:Fmccown, there is a list of topics the user have made main contributions.
JackyR Special:Contributions/JackyR
Qwfp Special:Contributions/Qwfp
Michael Hardy Special:Contributions/Michael_Hardy
Angelo.romano Special:Contributions/Angelo.romano
Warut Special:Contributions/Warut
Mav Special:Contributions/Mav
Acalamari Special:Contributions/Acalamari
Hoary Special:Contributions/Hoary
Greekboy Special:Contributions/Greekboy
El_Greco Special:Contributions/El_Greco
Grk1011 Special:Contributions/Grk1011

Updates (before 2008/4/22)

[edit]

It has come to an idea: using the concepts of n-gram and hierarchical clustering (HAC). N-gram Clustering by date can find out the periods when the user has edited the similar pages, while hierarchical clustering can find out the similar periods which may not be in the sequent time.

The experiment results look not bad. It works to identify the different knowledge periods in time line.

Updates (2008/4/22)

[edit]

Now we've collected the data mentioned above and clustered them by date. There comes some problems:

  1. Every cluster hasn't been clustered by knowledge domain. This would cause the ambiguous knowledge structure in a cluster. The idea which taking the categories of Wikipedia seems not good because Wiki's categories are also defined by users, and intermingle with some categories which are not well-defined or not related to domain knowledge;
  2. We use bottom-up hierarchical clustering to classify the data. The threshold of merging two clusters in every hierarchy is totally the same, i.e. 0.8. I wounder why this would work in hierarchical clustering. It should be less similar when the hierarchy is getting higher;
  3. With TFxIDF value implemented, the computing time increases exponentially when the hierarchy is higher.

JnWtalk 12:41, 22 April 2008 (UTC)

Updates (2008/5/27)

[edit]

Now it has been the end of May. The progress of thesis is still going. We have conducted the mechanism to find out the knowledge evolution map. We argue that a user may have accessed the similar topics on Wiki, so we firstly implement n-gram algorithm to identify the periods with similar knowledge structure. After that, we use HAC to cluster these periods. In order to decide a good clustering result, we use Minmax to determine the final clustering result in HAC.

Now the problem has come to visualize the clustering result. My classmate had suggested me a java-based visualization tooltip: JFreeChart, and I have produced some charts. But it looks like not so user-friendly...

Time Series KEM
Time Series KEM
Histogram KEM
Histogram KEM

JnWtalk 08:40, 27 May 2008 (UTC)


Retrieved from "https://en.wikipedia.org/w/index.php?title=User:Abhaac&oldid=1142407309"

Categories: 
Taiwanese Wikipedians
User zh-N
User en-2
User java-2
User c++-2
User php
User html
User mysql
User sql
Chinese Wikipedians
 



This page was last edited on 2 March 2023, at 06:06 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



Privacy policy

About Wikipedia

Disclaimers

Contact Wikipedia

Code of Conduct

Developers

Statistics

Cookie statement

Mobile view



Wikimedia Foundation
Powered by MediaWiki