Skip to content

Bias path is not working correctly #52

@umarbutler

Description

@umarbutler

Hi,
I have observed that including bias seems to cause the loss produced by my models to diverge significantly from the losses produced without cut cross-entropy. They tend to be much, much lower, often in the negative ranges. Omitting bias seems to improve things dramatically, such that losses are within a range of +/-0.05 of their originals.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions