i-manager Publications

Information Extraction with Semantic Clustering

Halavath*, Dr. A. Govardhan**

* Associate Professor and Head, Department of Computer Science and Engineering, Noble College of Engineering and Technology for Women, Hyderabad, India.

** Director, School of Information Technology, Jawaharlal Nehru Technical University, Hyderabad, India.

Periodicity:March - May'2015
DOI : https://doi.org/10.26634/jcom.3.1.3437

Abstract

Generally, Information Extraction (IE) is concentrated on fulfilling exact, restricted, pre-specified solicitations from homogeneous corpora (e.g., extract the area and time of courses from a set of declarations). Information Extraction (IE) customarily relied with comprehensive individual engagement by means of hand crafted extraction guidelines as well as hand-tagged instruction illustrations. Furthermore, the user is required to clearly pre specify about every relative associated with attention. Data extraction in information control is usually associated with the artificial thinking ability throughout, and the progress associated with methods as well as algorithms for those aspects of words evaluation, as well as their laptop or computer enactment. Moving to another space requires the client to name the target relations and to physically make new extraction tenets or hand-label new preparing cases. This difficult work scales directly with the quantity of target relations. This paper, explains extraction strategy for site data focused around DOM to enhance the seeking proficiency, which is to safeguard the topic data, and to channel out the commotion data that the clients are not inspired by. The experiments are done by taking different data sets. The proposed semantic clustering gives the best way to extract the information from web than existing techniques. The experimental results clearly show that the proposed technique gives better results when compared to existing techniques.

Keywords

Information Extraction, Handcrafted Extraction, Data Extraction, Semantic Clustering.

How to Cite this Article?

Balaji, H., and Govardhan, A. (2015). Information Extraction with Semantic Clustering, i-manager’s Journal on Computer Science, 3(1), 15-20. https://doi.org/10.26634/jcom.3.1.3437

References

[1]. D. M. Bikel, R. Schwartz, and R. M. Weischedel, (1999). “An algorithm that learns what's in a name”, Machine Learning, Vol.34(8), pp.211–232.

[2].B.Couasnon,(2003). “Dmos, a Generic Document Recognition Method: Application to Table Structure Analysis in a General and in a Specific Way,” International Journal on Decission Analysis, Vol.35(1), pp.129-147.

[3]. Ribeiro Neto. B. A, Laender. A. H. F. and DA Silva. A. S, (1999). “Extracting Semi-structured Data through th Examples,” Proceedings of the 8 ACM International Conference on Information and Knowledge Management (CIKM), Kansas City, Missouri, pp.94-101.

[4]. Muslea. I., Minton. S., and Knoblock. C, (1999). “A Hierarchical Approach to Wrapper Induction,” rd Proceedings of the 3 International Conference on Autonomous Agents (AA-99), pp.145-169.

[5]. Kushmerick. N., (2003). Adaptive Information Extraction: Core Technologies for Information Agents. In Intelligent Information Agents R&D in Europe: An AgentLink perspective (Klusch, Bergamaschi, Edwards & Petta, eds.). Lecture Notes in Computer Science 2586, Springer.

[6]. The Open Group. TOGAF Version 9: The Open Group Architecture Frame-work 2009.

[7]. Robert Cooley, (2003). “The use of Web structure and Content to Identify Subjectively Interesting Web Usage Patterns,” ACM Transactions on Internet Technology, Vol.3(2).

[8]. Califf. M. and Mooney. R, (1998). “Relational Learning of Pattern-match Rules for Information Extraction,” Proceedings of AAAI Spring Symposium on Applying Machine Learning to Discourse Processing, Stanford, California.

[9]. Agichtein, Eugene and Gravano, Luis,(2002). “Snowball: Extracting Relations From Large Plain-Text Collections,” ACM International Conference on Digital Libraries, pp.85–94.

[10]. A. K. Jain and R. C. Dubes, (1988). “Algorithms for Clustering Data,” Englewood Cliffs, NJ: Prentice-Hall.

[11]. S. Z. Selim and M. A. Ismail, (1984). “K-means Type Algorithms: a Generalized Convergence Theorem and Characterization of Local Optimality,” In IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.6(1), pp.81-87.

Information Extraction with Semantic Clustering

Abstract

Keywords

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Options for accessing this content:

	North Americas,UK, Middle East,Europe		India	Rest of world
	USD	EUR	INR	USD-ROW
Pdf	35	35	200	20
Online	15	15	200	15
Pdf & Online	35	35	400	25