본문 바로가기

쓰기

 
저자 Sungwoo Lee, Jaedong Lee, Jaekwang Kim, Jee-Hyong Lee 
학회명 International Conference on Robust Statistics 
학회명 (약자) ICORS 2012 
pp.  
학회시작일 2012-08-05 
학회종료일 2012-08-10 
비고  

We propose a method of estimation for blog topic variation using TFS(Term Frequency Smoothing) and PLSA(Probabilistic Latent Semantic Analysis).

In the earliest blogging services, the number of blogger published their own contents for daily life.

Over time, blog services as a business model came under the spotlight.

But, SNS were beginning to replace blog's function for personal life.

Bloggers were worried about a steady influx of visitors for profit.

In response to these changes, the purpose of blog was changed to accumulation of specialized information.

When bloggers want to run specialized and profitable blog service, the most important problem is how to select a blog topic.

To solve this problem, many related studies have been performed. But, these studies did not consider the temporal feature of blog.

So, the number of the extracted word were inaccurate.

We present a TFS reflecting the temporal feature of blog.

Blog contents are updated in chronological order.

And, over time, subject of blog contents that are continuously updated has high probability to be similar.

So, we propose the following formular.

First, term frequency of documents posted at a date and term frequency of documents posted before and after a date are added.

Then, term frequency in documents is smoothed at a date.

This consists of document-term matrix reflecting temporal feature of blog.

Through this method and PLSA,

we can estimate more accurately what documents and terms belong to any subject.

We extract documents and terms belonging to each subject following the formular.

    2024

      Learning with Structural Labels for Learning with Noisy Labels
      2024.02.27
      저자: 김누리*, 이진섭*, 이지형 (*: equal contribution)     학회명: IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024     학회명 (약자): CVPR 2024     학회시작일: 2024-06-17     학회종료일: 2024-06-19    
      STAGE: Simple Text Data Augmentation by Graph Exploration
      2024.02.28
      저자: 김호승, 강용훈, 이지형     학회명: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation     학회명 (약자): LREC-COLING 2024     학회시작일: 2024-05-20     학회종료일: 2024-05-25    
      TF-EDA: Efficient and Effective Text Data Augmentation
      2024.02.29
      저자: 김호승, 이지형, 김한별     학회명: The 24th International Symposium on Advanced Intelligent Systems     학회명 (약자): ISIS 2023     학회시작일: 2023-12-06     학회종료일: 2023-12-09    

    2023

    2022

      Reducing computational cost in federated ensemble learning via rank-one matrix
      2022.12.05
      저자: YongHoon Kang, HoSeung Kim, Jee-Hyong Lee     학회명: Joint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems     학회명 (약자): SCIS-ISIS 2022     학회시작일: 2022-11-29     학회종료일: 2022-12-02    

    2021