Object

com.enriquegrodrigo.spark.crowd.methods

DawidSkene

Related Doc: package methods

Permalink

object DawidSkene

Provides functions for transforming an annotation dataset into a standard label dataset using the DawidSkene algorithm.

This algorithm only works with types.MulticlassAnnotation datasets although one can easily use it for types.BinaryAnnotation through Spark Dataset as method

It returns a types.DawidSkeneModel with information about the estimation of the true class, as well as the annotator quality and the log-likelihood obtained by the model.

Example:
  1. import com.enriquegrodrigo.spark.crowd.methods.DawidSkene
    import com.enriquegrodrigo.spark.crowd.types._
    val exampleFile = "data/multi-ann.parquet"
    val exampleData = spark.read.parquet(exampleFile).as[MulticlassAnnotation]
    //Applying the learning algorithm
    val mode = DawidSkene(exampleData)
    //Get MulticlassLabel with the class predictions
    val pred = mode.getMu().as[MulticlassLabel]
    //Annotator precision matrices
    val annprec = mode.getAnnotatorPrecision()
    //Annotator likelihood
    val like = mode.getLogLikelihood()
Version

0.1.5

See also

Dawid, Alexander Philip, and Allan M. Skene. "Maximum likelihood estimation of observer error-rates using the EM algorithm." Applied statistics (1979): 20-28.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. DawidSkene
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def apply(dataset: Dataset[MulticlassAnnotation], eMIters: Int = 10, eMThreshold: Double = 0.001): DawidSkeneModel

    Permalink

    Applies learning algorithm.

    Applies learning algorithm.

    dataset

    The dataset over which the algorithm will execute (spark Dataset of type types.MulticlassAnnotation

    eMIters

    Number of iterations for the EM algorith

    eMThreshold

    LogLikelihood variability threshold for the EM algorithm

    returns

    types.DawidSkeneModel

    Version

    0.2.0

  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  11. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  12. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  13. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  14. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  15. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  16. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  17. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  18. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  19. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  20. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped