Updates the doc Quickstart

msaito8623 · msaito8623 · commit e3d64e29a0b4 · 2024-09-15T13:54:32.000+02:00
diff --git a/docs/source/conf.py b/docs/source/conf.py
@@ -46,7 +46,7 @@
 # The theme to use for HTML and HTML Help pages.  See the documentation for
 # a list of builtin themes.
 #
-html_theme = 'classic'
+html_theme = 'sphinx_rtd_theme'
 
 # Add any paths that contain custom static files (such as style sheets) here,
 # relative to this directory. They are copied after the builtin static files,
diff --git a/docs/source/index.rst b/docs/source/index.rst
@@ -1,9 +1,6 @@
-.. pyldl documentation master file, created by
-   sphinx-quickstart on Mon Nov 15 19:11:20 2021.
-   You can adapt this file completely to your liking, but it should at least
-   contain the root `toctree` directive.
+.. discriminative_lexicon_model documentation master file
 
-.. automodule:: pyldl
+.. automodule:: discriminative_lexicon_model
 
 .. toctree::
    :maxdepth: 3
diff --git a/docs/source/quickstart.rst b/docs/source/quickstart.rst
@@ -17,97 +17,135 @@ Installation
 Quick overview of the theory "Discriminative Lexicon Model (DLM)"
 =================================================================
 
-In DLM, language processing is modelled as linear mappings between word-forms and word-meanings. Word-forms and word-meanings can be defined in any way, as long as each word form/meaning is expressed in the form of a vector (i.e., an array of numbers). Word-forms are stacked up to be a matrix called the *C* matrix. Word-meanings are stacked up to be another matrix called the *S* matrix. The comprehension process can be modelled as receiving word-forms (i.e., C) and predicting word-meanings (i.e., S). Such a matrix that approximates S as closely as possible based on C can be estimated either analytically or computationally (see [1]_ for more detail), and it is called the *F* matrix. With C and F, the approximation (prediction) of S can be derived, and it is called the :math:`\hat{S}` matrix. Similarly, the production process can be modelled as receiving word-meanings (i.e., S) and predicting word-forms (i.e., C). Such a matrix that approximates C based on S is called the *G* matrix. With S and G, the model's predictions about word-forms are obtained as yet another matrix. The matrix is called the :math:`\hat{C}` matrix. It is shown below how to set up and estimate these matrices.
+Short summary
+-------------
+DLM is a single model of language processing (comprehension and production both) consisting of 4 + 2 components (i.e., matrices). They are :math:`\mathbf{C}` (word-forms), :math:`\mathbf{S}` (word-meanings), :math:`\mathbf{F}` (form-meaning associations), :math:`\mathbf{G}` (meaning-form associations), :math:`\mathbf{\hat{C}}` (predicted word-forms), and :math:`\mathbf{\hat{S}}` (predicted word-meanings).
+
+A little bit more detail
+------------------------
+DLM is a language processing model based on learning. DLM usually consists of four components (matrices): :math:`\mathbf{C}` (word-forms), :math:`\mathbf{S}` (word-meanings), :math:`\mathbf{F}` (form-meaning associations), and :math:`\mathbf{G}` (meaning-form associations). DLM models the comprehension as mapping from forms to meanings, namely DLM estimates :math:`\mathbf{F}` so that the product of :math:`\mathbf{C}` and :math:`\mathbf{F}`, namely :math:`\mathbf{CF}` (i.e., mapping of forms onto meanings), becomes as close as possible to :math:`\mathbf{S}`. :math:`\mathbf{CF}` is also called :math:`\mathbf{\hat{S}}`. :math:`\mathbf{\hat{S}}` is the model's predictions about word meanings, while :math:`\mathbf{S}` is the gold-standard "correct" meanings of these words. Similarly, DLM models the speech production as mapping from meanings to forms. DLM estimates :math:`\mathbf{G}` so that :math:`\mathbf{SG}` (which is also called :math:`\mathbf{\hat{C}}`) becomes as close as possible to :math:`\mathbf{C}` (i.e., the gold-standard correct form matrix). DLM is conceptually a single model containing these six components (i.e., :math:`\mathbf{C}`, :math:`\mathbf{S}`, :math:`\mathbf{F}`, :math:`\mathbf{G}`, :math:`\mathbf{\hat{C}}`, and :math:`\mathbf{\hat{S}}`). To reflect this conceptualization, *discriminative_lexicon_model* provides a class having these matrices as its attributes. The class is ``discriminative_lexicon_model.ldl.LDL``.
+
+
+
+
+Create a model object
+=====================
+
+``discriminative_lexicon_model.ldl.LDL`` creates a model of DLM.
+
+.. code-block:: python
+
+   >>> import discriminative_lexicon_model as dlm
+   >>> mdl = dlm.ldl.LDL()
+   >>> print(type(mdl))
+   <class 'discriminative_lexicon_model.ldl.LDL'>
+   >>> mdl.__dict__
+   {}
+
+With no argument, ``discriminative_lexicon_model.ldl.LDL`` creates an empty model (of DLM), which is to be populated later with some class methods (see below).
+
 
 
 Set up the basis matrices C and S
 =================================
 
+In order to estimate association matrices and create predictions based on them, :math:`\mathbf{C}` and :math:`\mathbf{S}` must be set up first.
+
+
+
 C-matrix
 --------
 
-The C matrix is a collection of form-vectors of words. You can create a C-matrix from a list of words by using discriminative_lexicon_model.mapping.gen_cmat.
+:math:`\mathbf{C}` is a collection of form-vectors of words. :math:`\mathbf{C}` can be created from a list of words by ``discriminative_lexicon_model.ldl.LDL.gen_cmat``.
+
 
 .. code-block:: python
 
-    >>> import discriminative_lexicon_model as dlm
-    >>> words = ['walk','walked','walks']
-    >>> cmat  = dlm.mapping.gen_cmat(words)
-    >>> cmat
-    <xarray.DataArray (word: 3, cues: 9)>
-    array([[1, 1, 1, 1, 0, 0, 0, 0, 0],
-           [1, 1, 1, 0, 1, 1, 1, 0, 0],
-           [1, 1, 1, 0, 0, 0, 0, 1, 1]])
-    Coordinates:
-      * word     (word) <U6 'walk' 'walked' 'walks'
-      * cues     (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
+   >>> mdl.gen_cmat(['walk','walked','walks'])
+   >>> print(mdl.cmat)
+   <xarray.DataArray (word: 3, cues: 9)>
+   array([[1, 1, 1, 1, 0, 0, 0, 0, 0],
+          [1, 1, 1, 0, 1, 1, 1, 0, 0],
+          [1, 1, 1, 0, 0, 0, 0, 1, 1]])
+   Coordinates:
+     * word     (word) <U6 'walk' 'walked' 'walks'
+     * cues     (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
+
+
 
 
 S-matrix
 --------
 
-The S matrix is a collection of semantic vectors of words. For one method, an S-matrix can be set up by defining semantic dimensions by hand. This can be achieved by discriminative_lexicon_model.mapping.gen_smat_from_df.
+:math:`\mathbf{S}` is a collection of semantic vectors of words. :math:`\mathbf{S}` can be set up by means of ``discriminative_lexicon_model.ldl.LDL.gen_smat``. For its argument, semantic vectors need to be set up with ``pandas.core.frame.DataFrame`` with words as its indices and semantic dimensions as its columns. Semantic dimensions can be defined either by hand or by an embeddings algorithm such as word2vec and fastText. Regardless of the method of constructing semantics, ``discriminative_lexicon_model.ldl.LDL.gen_smat`` sets up :math:`\mathbf{S}`, as long as the dataframe given to its (first) argument follows the right format (i.e., rows = words, columns = semantic dimensions). In the example below, semantic dimensions are set up by hand.
 
 
 .. code-block:: python
 
-    >>> import pandas as pd
-    >>> smat = pd.DataFrame({'WALK':[1,1,1], 'Present':[1,0,1], 'Past':[0,1,0], 'ThirdPerson':[0,0,1]}, index=['walk','walked','walks'])
-    >>> smat = dlm.mapping.gen_smat_from_df(smat)
-    <xarray.DataArray (word: 3, semantics: 4)>
-    array([[1, 1, 0, 0],
-           [1, 0, 1, 0],
-           [1, 1, 0, 1]])
-    Coordinates:
-      * word       (word) <U6 'walk' 'walked' 'walks'
-      * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
+   >>> import pandas as pd
+   >>> semdf = pd.DataFrame({'WALK':[1,1,1], 'Present':[1,0,1], 'Past':[0,1,0], 'ThirdPerson':[0,0,1]}, index=['walk','walked','walks'])
+   >>> print(semdf)
+           WALK  Present  Past  ThirdPerson
+   walk       1        1     0            0
+   walked     1        0     1            0
+   walks      1        1     0            1
+   >>> mdl.gen_smat(semdf)
+   >>> print(mdl.smat)
+   <xarray.DataArray (word: 3, semantics: 4)>
+   array([[1, 1, 0, 0],
+          [1, 0, 1, 0],
+          [1, 1, 0, 1]])
+   Coordinates:
+     * word       (word) <U6 'walk' 'walked' 'walks'
+     * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
 
 
 
-Estimation of the association matrices
-======================================
+
+Estimation of the association matrices F and G
+==============================================
 
 F-matrix
 --------
 
-With C and S established, the comprehension association matrix F can be estimated by discriminative_lexicon_model.mapping.gen_fmat.
+With :math:`\mathbf{C}` and :math:`\mathbf{S}` established, the comprehension association matrix :math:`\mathbf{F}` can be estimated by ``discriminative_lexicon_model.ldl.LDL.gen_fmat``. It does not require any argument, because :math:`\mathbf{C}` and :math:`\mathbf{S}` are stored already as attributes of the class and therefore accessible by the model.
 
 .. code-block:: python
 
-    >>> fmat = dlm.mapping.gen_fmat(cmat, smat)
-    >>> fmat.round(2)
-    <xarray.DataArray (cues: 9, semantics: 4)>
-    array([[ 0.28,  0.23,  0.05,  0.08],
-           [ 0.28,  0.23,  0.05,  0.08],
-           [ 0.28,  0.23,  0.05,  0.08],
-           [ 0.15,  0.31, -0.15, -0.23],
-           [ 0.05, -0.23,  0.28, -0.08],
-           [ 0.05, -0.23,  0.28, -0.08],
-           [ 0.05, -0.23,  0.28, -0.08],
-           [ 0.08,  0.15, -0.08,  0.38],
-           [ 0.08,  0.15, -0.08,  0.38]])
-    Coordinates:
-      * cues       (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
-      * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
+   >>> mdl.gen_fmat()
+   >>> print(mdl.fmat.round(2))
+   <xarray.DataArray (cues: 9, semantics: 4)>
+   array([[ 0.28,  0.23,  0.05,  0.08],
+          [ 0.28,  0.23,  0.05,  0.08],
+          [ 0.28,  0.23,  0.05,  0.08],
+          [ 0.15,  0.31, -0.15, -0.23],
+          [ 0.05, -0.23,  0.28, -0.08],
+          [ 0.05, -0.23,  0.28, -0.08],
+          [ 0.05, -0.23,  0.28, -0.08],
+          [ 0.08,  0.15, -0.08,  0.38],
+          [ 0.08,  0.15, -0.08,  0.38]])
+   Coordinates:
+     * cues       (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
+     * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
 
 
 G-matrix
 --------
 
-The production association matrix G can be obtained by discriminative_lexicon_model.mapping.gen_gmat.
+Similarly, with :math:`\mathbf{C}` and :math:`\mathbf{S}` established, the production association matrix :math:`\mathbf{G}` can also be estimated by ``discriminative_lexicon_model.ldl.LDL.gen_gmat``. It does not require any argument, either, because :math:`\mathbf{C}` and :math:`\mathbf{S}` are stored already as attributes of the class and therefore accessible by the model.
 
 .. code-block:: python
 
-    >>> gmat = dlm.mapping.gen_gmat(cmat, smat)
-    >>> gmat.round(2)
-    <xarray.DataArray (semantics: 4, cues: 9)>
-    array([[ 0.67,  0.67,  0.67,  0.33,  0.33,  0.33,  0.33, -0.  , -0.  ],
-           [ 0.33,  0.33,  0.33,  0.67, -0.33, -0.33, -0.33, -0.  , -0.  ],
-           [ 0.33,  0.33,  0.33, -0.33,  0.67,  0.67,  0.67, -0.  , -0.  ],
-           [ 0.  ,  0.  ,  0.  , -1.  ,  0.  ,  0.  ,  0.  ,  1.  ,  1.  ]])
-    Coordinates:
-      * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
-      * cues       (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
+   >>> mdl.gen_gmat()
+   >>> print(mdl.gmat.round(2))
+   <xarray.DataArray (semantics: 4, cues: 9)>
+   array([[ 0.67,  0.67,  0.67,  0.33,  0.33,  0.33,  0.33, -0.  , -0.  ],
+          [ 0.33,  0.33,  0.33,  0.67, -0.33, -0.33, -0.33, -0.  , -0.  ],
+          [ 0.33,  0.33,  0.33, -0.33,  0.67,  0.67,  0.67, -0.  , -0.  ],
+          [ 0.  ,  0.  ,  0.  , -1.  ,  0.  ,  0.  ,  0.  ,  1.  ,  1.  ]])
+   Coordinates:
+     * semantics  (semantics) object 'WALK' 'Present' 'Past' 'ThirdPerson'
+     * cues       (cues) <U3 '#wa' 'wal' 'alk' 'lk#' 'lke' 'ked' 'ed#' 'lks' 'ks#'
 
 
 
@@ -117,7 +155,7 @@ Prediction of the form and semantic matrices
 S-hat matrix
 ------------
 
-The S-hat matrix (:math:`\mathbf{\hat{S}}`) can be obtained by discriminative_lexicon_model.mapping.gen_shat.
+The model's predictions about word-meanings based on word-forms (i.e., :math:`\mathbf{\hat{S}}`) can be obtained by discriminative_lexicon_model.ldl.LDL.gen_shat, given that :math:`\mathbf{C}` and :math:`\mathbf{F}` are already set up and stored as attributes of the class instance.
 
 .. code-block:: python
 
@@ -135,7 +173,7 @@ The S-hat matrix (:math:`\mathbf{\hat{S}}`) can be obtained by discriminative_le
 C-hat matrix
 ------------
 
-The C-hat matrix (:math:`\mathbf{\hat{C}}`) can be obtained with discriminative_lexicon_model.mapping.gen_chat.
+Similarly, the model's predictions about word-forms based on word-meanings (i.e., :math:`\mathbf{\hat{C}}`) can be obtained with discriminative_lexicon_model.ldl.LDL.gen_chat, given that :math:`\mathbf{S}` and :math:`\mathbf{G}` are already set up and stored as attributes of the class instance.
 
 .. code-block:: python
 

Original file line number	Diff line number	Diff line change
`@@ -46,7 +46,7 @@`
`46`	`46`	`# The theme to use for HTML and HTML Help pages. See the documentation for`
`47`	`47`	`# a list of builtin themes.`
`48`	`48`	`#`
`49`		`-html_theme = 'classic'`
	`49`	`+html_theme = 'sphinx_rtd_theme'`
`50`	`50`
`51`	`51`	`# Add any paths that contain custom static files (such as style sheets) here,`
`52`	`52`	`# relative to this directory. They are copied after the builtin static files,`