Skip to content

Why should the number of SAP representation protein sequence file lines and the number of Canonical compound SMILE file lines match?  #9

@CallMeDek

Description

@CallMeDek

Hi,

I am trying to get results of my own data with your model.

(1) According to the file "DeepAffinity_inference.sh", it seems that the number of lines for input protein sequences file and compound file must matches like below.
스크린샷, 2022-09-22 10-46-42
Is it mean that the number of each entity in both files have to be matched or literally the the number of lines of both files have to be matched?

(2) I got two files for my own data after following your manual.
Could you tell me if their entities' structure are correct for model input?

  • CID_Smi_Feature:
    스크린샷, 2022-09-22 11-31-44
  • protein_grouped_finalPresentation
    스크린샷, 2022-09-22 11-33-55

Thank you,
CallMeDek

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions