CN109299167B - A visualization method to display family migration history and family development - Google Patents
A visualization method to display family migration history and family development Download PDFInfo
- Publication number
- CN109299167B CN109299167B CN201811158830.1A CN201811158830A CN109299167B CN 109299167 B CN109299167 B CN 109299167B CN 201811158830 A CN201811158830 A CN 201811158830A CN 109299167 B CN109299167 B CN 109299167B
- Authority
- CN
- China
- Prior art keywords
- family
- character
- influence
- migration
- characters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2135—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on approximation criteria, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to a visualization method for displaying family migration history and family development conditions, which comprises the following steps: arranging the character information; calculating the influence of the person; establishing a figure family spectrogram; visualization of human influence: the method comprises the following steps of adopting a rose diagram as a visual diagram of the influence of all families in the same dynasty and with the same ancestor of people in the same place as a household, wherein the area of each petal of the rose diagram represents the influence of each person, the total area of roses represents the influence of all the families living in the same dynasty, and the sum of the influences of all the families in the same place represents the prosperity degree and social prestige of the family of the people; visualizing family migration process; predicting the cause of family migration; family migration and spatio-temporal visualization of social influence.
Description
Technical Field
The invention relates to a machine learning-based visualization method.
Background
The development of data mining technology and visualization brings a new way for traditional human history research. Historians provide computer students with good research materials by electronically documenting historical literature. The computer student can better show the historical rules through the visual electronic documents, and meanwhile, the historian can be better judged through the machine learning related algorithm.
Disclosure of Invention
The invention provides an effective visualization method, which can better show family migration and family development processes of people. The technical scheme is as follows:
a visualization method for displaying family migration history and family development conditions comprises the following steps:
(1) collating character information
Preprocessing the data to form the information containing the names of the figures, the longitude and latitude of the places where the family members of the figures are located, the life dynasties of the figures, the parents of the figures and the achievement information of the figures, wherein the achievement information of the figures comprises the following steps: literary works of people, occupation of people, people's manner of entering, social relations of people, relative relations of people, and major events of people's participation;
(2) calculating human influence
(3) Establishing a family spectrogram of a character
By a recursion algorithm, continuously recursively searching the parent-child relationship of the figure for each person, and matching rules, wherein the rules are as follows: 0< father birth time-son birth time <100, until the father-son relationship cannot be found, recording that the current person is an ancestor of the same family; all the people with ancestors of the same person are the same family;
(4) visualization of human influence
The method comprises the following steps of adopting a rose diagram as a visual diagram of the influence of all families in the same dynasty and with the same ancestor of people in the same place as a household, wherein the area of each petal of the rose diagram represents the influence of each person, the total area of roses represents the influence of all the families living in the same dynasty, and the sum of the influences of all the families in the same place represents the prosperity degree and social prestige of the family of the people;
(5) family migration process visualization
The dynamic straight line with an arrow represents the migration direction and the migration scale of the character family, the generation of each migration is recorded, the arrow of the migration straight line represents the migration direction of the character family, the two ends of the straight line respectively represent the home address and the migration destination of the family, and the width of the straight line represents the number of the migrating people;
(6) prediction of cause of family migration
Sorting various factors which may influence family migration in a database, setting 1 and setting 0 to respectively represent existence and nonexistence, firstly adopting a PCA dimension reduction algorithm to reduce dimension of features and prevent overfitting, then adopting a multi-classification logistic regression algorithm to carry out model training on the data after dimension reduction, balancing variance and deviation in a cross validation mode, continuously debugging the dimension after PCA dimension reduction and training parameters of the logistic regression model to enable the accuracy of the model to reach the maximum value, storing the model, and finally generating the model to predict each migration which has been recorded;
(7) family migration and spatio-temporal visualization of social influence
The two-dimensional map marks the geographic position of the life of the person by taking the time coordinate axis as the substitute, simultaneously displays the reason of the predicted migration, and dynamically displays the family migration process of the person and the social influence change process of the family of the person with time change.
In the step (2), the crawler technology is utilized to obtain the attention of the entry of the search engine of the stored literature, and the influence factor of the character literature is calculated according to the formula (1); scoring the occupation of the figure according to the ancient Chinese occupation status, and calculating a figure occupation influence factor according to a formula (2); counting the attention of the entries of the search engine on the major events participated by the characters, standardizing the influence factors of the major events participated by the characters to 0-100, adding the influence factors of the major events participated by the characters, calculating a formula (3), counting the social relation influence of the characters according to the attached degree of the characters, wherein the formula (4) is shown, and finally, carrying out overall scoring on the social influence of the characters, wherein the overall scoring is shown as a formula (5);
I1influence factor, P, representing human literature1iInfluence factor representing the i-th work of a person participating in authoring, E1The influence factors of any works stored in the database at present;
I2influencing factor, P, representing the occupation of a person2iAn influence factor representing an i-th occupation in which the person is engaged;
I3the impact factor, P, representing a significant event in which a person participates3iInfluence factor representing the i-th significant event in which the person participates, E3Influence factors of any significant event stored in the database at present;
I4influential factor, P, representing the social relationship of a person4iRepresenting the number of persons attached to the character, E4The number of people to which any person is attached is stored in the current database;
Isum=I1+I2+I3+I4formula (5)
IsumRepresenting the influence of the character.
Drawings
FIG. 1 is a flow chart of the method
FIG. 2 flow chart of person family generation
FIG. 3 is a flow chart of character family migration cause model training and prediction
FIG. 4 visualization of family migration and influence of a person
Detailed Description
The invention provides a comprehensive and effective visualization mode for a family influence change process and a family migration process by combining with space-time information, a figure family relation graph is obtained through a data mining technology, a figure influence value is obtained through statistics, all data are comprehensively visualized on a two-dimensional map by combining with the time and space information of the life of all people in the whole family of a figure, and a valuable auxiliary speculation is given for a migration reason through a machine learning algorithm. The visualization method can effectively display the family migration history of the person and the influence conditions of the families at different times and can effectively analyze the family migration reasons. The method comprises the following specific steps:
1. collating character information
Preprocessing the data to form the information containing the names of the figures, the longitude and latitude of the places where the family members of the figures are located, the life dynasties of the figures, the parents of the figures and the achievement information of the figures, wherein the achievement information of the figures comprises the following steps: literary works of people, occupation of people, manner of entering people, social relationship of people, relative relationship of people, major events of people participation, and the like.
2. Calculating human influence
The method comprises the steps of firstly, obtaining the attention of the Baidu vocabulary entry of existing literature works by utilizing a crawler technology, calculating influence factors of the character literature works according to a formula (1), grading the occupation of characters according to the ancient occupation status of China, calculating the occupation influence factors of the characters according to a formula (2), counting the attention of the Baidu vocabulary entry to major events in which the characters participate, standardizing the influence factors to be 0-100, adding the influence factors of the major events in which the characters participate, wherein the calculation formula is a formula (3), counting the social relation influence of the characters according to the degree to which the characters are attached, namely a formula (4), and finally, counting the overall grade of the social influence of the characters, namely a formula (5).
Isum=I1+I2+I3+I4Formula (5)
3. Establishing a family spectrogram of a character
By a recursion algorithm, continuously recursively searching the parent-child relationship of the figure for each person, and matching rules, wherein the rules are as follows: 0< father birth time-son birth time <100, until the father-son relationship cannot be found, the process is as shown in fig. 2, and the current person is recorded as an ancestor of the same family. All the people with the ancestors of the same person are the same family.
4. Visualization of human influence
The rose flower picture is used as a visual influence picture of all people in the same dynasty and in the same place and in the same ancestor with the characters, wherein the area of each petal of the rose picture represents the influence of each character, the total area of roses represents the influence of all people living in the same dynasty, and the total sum of the influences of all the people in the same place represents the prosperity degree and social prestige of the family in which the characters are located.
5. Family migration process visualization
The dynamic straight line with an arrow represents the migration direction and the migration scale of the character family, the generation of each migration is recorded, the arrow of the migration straight line represents the migration direction of the character family, the two ends of the straight line respectively represent the home address and the migration destination of the family, and the width of the straight line represents the number of the migrating people.
6. Prediction of cause of family migration
Sorting various factors which may influence family migration in a database, setting 1 and setting 0 to respectively represent existence and nonexistence, firstly adopting a PCA dimension reduction algorithm to reduce dimension of features and prevent overfitting, then adopting a multi-classification logistic regression algorithm to train a model of the data after dimension reduction, balancing variance and deviation in a cross validation mode, continuously debugging the dimension after PCA dimension reduction and training parameters of the logistic regression model to enable the accuracy of the model to reach the maximum value, storing the model, and finally generating the model to predict each recorded migration, wherein the specific flow is shown in FIG. 3.
7. Family migration and spatio-temporal visualization of social influence
The two-dimensional map marks the geographic position of the life of the person by taking the time coordinate axis as the substitute, simultaneously displays the reason of the predicted migration, and dynamically displays the family migration process of the person and the social influence change process of the family of the person with time change. Fig. 4 shows the influence and migration effect of a family of people in a certain generation.
And fitting the migration cause prediction model by adopting a PCA algorithm and a logistic regression algorithm in a sklern module of python. The method comprises the steps of taking a map component in an echarts frame as a geographic information platform, taking a java language spring MVC frame and a mybatis frame as background technical frames, dynamically displaying the character family migration and the change process of social influence by taking a two-dimensional display plane as a substitute time axis and taking a Chinese map as a two-dimensional display plane, and giving a migration reason prediction result. By adopting the method, a good visualization effect can be obtained.
Claims (1)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811158830.1A CN109299167B (en) | 2018-09-30 | 2018-09-30 | A visualization method to display family migration history and family development |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201811158830.1A CN109299167B (en) | 2018-09-30 | 2018-09-30 | A visualization method to display family migration history and family development |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109299167A CN109299167A (en) | 2019-02-01 |
| CN109299167B true CN109299167B (en) | 2021-08-13 |
Family
ID=65161507
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201811158830.1A Expired - Fee Related CN109299167B (en) | 2018-09-30 | 2018-09-30 | A visualization method to display family migration history and family development |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109299167B (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110110270B (en) * | 2019-04-25 | 2021-01-15 | 武汉大学 | A method and device for generating large-scale genealogy graphs with parallel processing |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101988119A (en) * | 2009-07-31 | 2011-03-23 | 刘晓明 | Method for calculating family branch of family name and tracing pedigree by using DNA |
| CN106540448A (en) * | 2016-09-30 | 2017-03-29 | 浙江大学 | The visual analysis method affected on its consuming behavior is exchanged between a kind of game player |
| CN107346337A (en) * | 2017-06-30 | 2017-11-14 | 福州大学 | A kind of family tree with history age mark and ancestral hall information linkage method for visualizing |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH10250261A (en) * | 1997-03-12 | 1998-09-22 | Noritsu Koki Co Ltd | Family tree and family tree generation system |
| US6416325B2 (en) * | 2000-04-14 | 2002-07-09 | Jeffrey J. Gross | Genealogical analysis tool |
-
2018
- 2018-09-30 CN CN201811158830.1A patent/CN109299167B/en not_active Expired - Fee Related
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101988119A (en) * | 2009-07-31 | 2011-03-23 | 刘晓明 | Method for calculating family branch of family name and tracing pedigree by using DNA |
| CN106540448A (en) * | 2016-09-30 | 2017-03-29 | 浙江大学 | The visual analysis method affected on its consuming behavior is exchanged between a kind of game player |
| CN107346337A (en) * | 2017-06-30 | 2017-11-14 | 福州大学 | A kind of family tree with history age mark and ancestral hall information linkage method for visualizing |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109299167A (en) | 2019-02-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Karthikeyan et al. | RETRACTED ARTICLE: Towards developing hybrid educational data mining model (HEDM) for efficient and accurate student performance evaluation | |
| US20150050637A1 (en) | System and method for early warning and recognition for student achievement in schools | |
| CN111709575A (en) | Academic achievement prediction method based on C-LSTM | |
| CN118396795B (en) | Campus life recording method integrating large models | |
| CN119988735B (en) | Talent culture recommendation method based on federal learning and natural language processing | |
| CN108897750B (en) | Personalized location recommendation method and device integrating multiple contextual information | |
| Gupta et al. | Machine learning approaches for student performance prediction | |
| Peng | Research on online learning behavior analysis model in big data environment | |
| CN120069258A (en) | Online education learning path optimization system based on big data and intelligent analysis | |
| Tobey et al. | Interpretable models for the automated detection of human trafficking in illicit massage businesses | |
| Hassan et al. | Identification of Technical and Vocational Education and Training (TVET) trainee’s personality attributes which impact skills learning | |
| Kırdar et al. | A design proposal of integrated smart mobility application for travel behavior change towards sustainable mobility | |
| Holding et al. | Quantifying the mover’s advantage: transatlantic migration, employment prestige, and scientific performance | |
| Saranya et al. | ENHANCED PREDICTION OF STUDENT DROPOUTS USING FUZZY INFERENCE SYSTEM AND LOGISTIC REGRESSION. | |
| CN112800210A (en) | Crowd portrait algorithm based on massive bus data | |
| CN109299167B (en) | A visualization method to display family migration history and family development | |
| Chiang et al. | Linear correlation discovery in databases: a data mining approach | |
| Langan et al. | Benchmarking factor selection and sensitivity: a case study with nursing courses | |
| Govindarajan | Educational data mining techniques and applications | |
| Yuan | Recommended Teaching Resources for Ideological and Political Courses Based on Normalized Discounted Cumulative Gain | |
| CN113051469A (en) | Subject selection recommendation method based on K-clustering algorithm | |
| Jenitha et al. | Prediction of Students' Performance based on Academic, Behaviour, Extra and Co-Curricular Activities. | |
| Kubegenova et al. | Using the data mining tool to analyze student performance | |
| Slomczynski et al. | On the Future of Survey Data Harmonization | |
| US20200104799A1 (en) | Identifying fake positions |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20210813 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |