General "Knowledge Representation and Sharing" and complementary techniques I designed to support that


Ph. Martin

LIM, University of La Réunion, France


www.phmartin.info/slides/lim2021/

Preliminary definitions (for this presentation)

(Formal) Identifier (ID), alias "unique identifier" (UID), "formal lexical term" or, in logics, "symbol": lexical object declared as having a unique meaning → not just a "name" (informal ID). (In logics, a sequence of symbols (IDs) is used to create either  a term (formal expression) that denotes an object, or  a formula (sentence) that denotes a fact.) Formal syntax: syntax (grammar) composed of terms: terminal ones (IDs: lexical terms) and non-terminal ones (directly or indirectly) defined into terminal ones. Model-theoretic semantic/interpretation (1 formal way to define semantics): function that maps terms to individual(object)s, and sentences to truth values.


Information:  Information Object(s). Sequence of IDs that conforms to a formal/informal syntax and that expresses something. Knowledge or Data   (↔ Info base: Knowledge base or Database) Not here: - "knowledge: (information: data + semantic/types) + context/understanding/entailment/strategy" - "knowledge: thing known to be true by/given observations/beliefs/assertions + deductions " as in Epistemology or Epistemic logic.


Knowledge:  Knowledge Representations (KRs). Information that is, at least partially, represented and organized - in some logics, and - via semantic relations (subtype, part, result, instrument, time, place, author, ...), and - in a way known by the interpreting agent (person or software). Example 1 and 2 in Section 2.1. Semi-formal KR: KR relating informal IDs/sentences by semantic (→ formal) relations.


Data:  information that is not knowledge.

Plan


1.  Need for general Knowledge Sharing (KS)

      1.1.  Need for knowledge bases (KBs) to be used instead of databases

      1.2.  Need to distinguish present-time KS and general KS



2.  Some complementary techniques I designed to support general knowledge sharing

      2.1.  Unrestricted/"à la carte" formal languages

      2.2.  General-purpose ontologies aligning top-level ones and lexical ones

      2.3.  KB servers that support non-restricting KB Sharing by Web users

      2.4.  KB servers that support networked KBs

1.1. Need for KBs to be used instead of databases – Definitions


Knowledge Base (KB):  set of IDs and of KRs using these IDs. 


Database (DB):  information base that is not a KB.

1.1. Need for KBs to be used instead of databases – Notes

Note 1. As in programming, the more expressive the used formal language, the more objects (functions, relations, ...) can be represented explicitly (→ as "1st class" objects) and hence named, quantified, related, ... and then exploited. With most DB systems, the end-user cannot define nor set relations, the schema is mostly a tree → implicit redundancies (e.g. those that form-normalizing a DB, from 1NF to 6NF, tries to remove) → arbitrary choices (e.g. making a class Employee part of a class Company, or conversely, or choosing one plan/structure in an article, web site, wiki, ...). Note 2. The implementation of a KB system can reuse a DB system with a schema of mainly two tables/classes: one for "explicit relation nodes", one for explicit concept nodes.

1.1. Need for KBs to be used instead of databases


Possibility+need to


BUT

* manually representing information is difficult (especially for large KBs and general KS)

* automatically representing the content of a document requires the software to "understand" it

* automatically merging a set of KBs not unified into a "networked KB" by the knowledge authors does not lead to good results if only because the required information is only in the head of the knowledge authors

1.1. ... – Example of KR organization by generalization/implication/exclusion

Entity £ Process Bird↗ ↖Counting //or: Flight Tweety

`no Bird can be agent of a Counting´ //false belief £ £ £ `at least 1 Bird `at least 50% of Bird `every Clever Bird can be agent of can be agent of can be agent of a Counting´ a Counting´ a Counting´ ⇖↘ ⇗↙ `1 Bird `Tweety can be `every Bird can be agent of a Counting can be agent of that has for duration agent of a Counting´ at least 0.5 Hour´ a Counting´ //if ... `Tweety has been agent of a Counting `every Bird|Bat can with duration at least 0.5 Hour´ be agent of a Counting´

Legend. "": generalization that is not an implication, e.g. subtypeOf, instanceOf; "": implication; "£": exclusion ( x £ y <=> ((x ⇒ ¬y) ∨ (x → ¬y)) ); "can": is able to; every sentence is in FE; relation types are in italics; concept types begin by an uppercase; the authors of IDs, sentences and relations are not represented (unlike in this example); in FE, "every" (alias 100%) and "%" are for "observations" and hence imply "at least 1", whereas "any" is for a "definition" and hence does not imply "at least 1"; the distinction is important since observations may be false while definitions cannot (since agents can give any identifier they want to the types they create) and thus cannot be corrected or contradicted

1.2. Need to distinguish present-time KS and general KS


Knowledge sharing as commonly currently thought (since 2000 / the W3C's "Linked Data"):
- for a known application and, generally, for efficient+complete knowledge inferencing
- between agents that can discuss to solve ambiguities: B2B KS.


KS in the general sense (or pre-W3C):  KR in explicit ways to maximize knowledge reuse.
KR creators do not make exploitation-dependent choices → each application can make them.
This goal was forgotten/abandoned since the "Linked Data" Web in the same way as advanced
researches on hypertext systems were forgotten/abandoned after the beginning of the Web.


The W3C (WWW Consortium),  researchers in knowledge inferencing,  and
for hype/grant/publication/... reasons, most researchers since 2000

→ misleading + unscalable "KS supporting" techniques + poorly reusable KRs.

1.2. Need to distinguish present-time KS and general KS


General KS is necessary for

1.2. Need to distinguish present-time KS and general KS

(... General KS is necessary for ...)

2.  Underlying ideas of some complementary techniques that I designed to support general knowledge sharing


2.1.  Unrestricted/"à la carte" formal languages


2.2.  General-purpose ontologies aligning top-level ones and lexical ones


2.3.  KB servers that support non-restricting KB Sharing by Web users


2.4.  KB servers that support networked KBs

2.1. Unrestricted/"à la carte" formal languages


2.1. ... – Example 1

En: By definition, a flying_bird_with_2_wings is a bird that flies and has two wings. LP: Flying_bird_with_2_wings (b) := Bird(b) ∧ ∃f Flight(f) ∧ agent(f,b) ∧ ∃w1,w2 Wing(w1) ∧ part(b,w1) ∧ part(b,w2) ∧ w1!=w2 FE: any Flying_bird_with_2_wings is a Bird that is agent of a Flight and has for part 2 Wing. FL: Flying_bird_with_2_wings = ^(Bird agent of: a Flight, part: 2 Wing). RDF+OWL2 / Turtle: :Flying_bird_with_2_wings owl:intersectionOf (:Bird [a owl:Restriction; owl:onProperty :agent; owl:someValuesFrom :Flight] [a owl:Restriction; owl:onProperty :wingPart; owl:qualifiedCardinality 2]);

UML_model / UML_concise_notation:   

2.1. ... – Example 2

En: On March 21st 2016, John Doe believed that in 2015 and in the USA, at least 78% of adult healthy carinate birds were able to fly. FL: [ [ [ [ [at least 78% of Adult Healthy Carinate_bird is able to be agent of: a Flight ] place: USA ] time: 2015 ] believer: John_Doe ] time: 2016-03-21 ]. FE: ` ` ` ` `at least 78% of Adult Healthy Carinate_bird is able to be agent of: a Flight´ at place USA´ at time 2015´ for believer John_Doe´ at time 2016-03-21´. IKLmE / Turtle: [rdf:value [rdf:value [rdf:value [rdf:value [rdf:value [rdf:value [:agent_of [a :Flight] ]; pm:q_ctxt [quantifier "78to100pc"; rdf:type :Adult, :Healthy, :Carinate_bird ] ]; pm:ctxt [:modality :Physical_possibility] ]; pm:ctxt [:place :USA] ]; pm:ctxt [:time "2015"] ]; pm:ctxt [:believer :John_Doe] ]; pm:ctxt [:time 2016-03-21] ].

2.2. General-purpose ontologies aligning top-level ones and lexical ones


2.3. KB servers that support non-restricting KB Sharing by Web users

2.3. KB servers that support non-restricting KB Sharing by Web users

2.3. ... – Examples of additive corrections

u1#`every bird is agent of a flight´ | \c=> _[u3] | ↘ u3#`at least 75% of healthy flying_bird can be agent of a flight´ | |c=>/^ _[u2] |c=> _[u3] ... | u2#`every bird can be agent of a flight´


Legend. "------(typeID) _[userID]----→": relation of type typeID, created by userID "u1#...": u1 is the author of the prefixed statement; "c=>/^": correction and implication and semantic/structural generalization; "c=>": correction and implication (no specialization/generalization); "": implication relation with destination on the left; "every": 100%

2.4. KB servers that support networked KBs

2.4. KB servers that support networked KBs – Solution details – Part 1

2.4. KB servers that support networked KBs – Solution details – Part 2

3. Conclusion


General KS