; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G192940 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G192940
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationCiama_Chr10:27699166..27705286
RNA-Seq ExpressionCaUC10G192940
SyntenyCaUC10G192940
Gene Ontology termsGO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136694.1 uncharacterized protein LOC101213732 isoform X1 [Cucumis sativus]9.9e-10374.81Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHNNSN-NNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSNF+CFA+K NNSN N  Q+PPI SL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHNNSN-NNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQD  QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]4.2e-10173.86Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHN------NSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSNF+CFA+K N      N+NN +Q+PPI SL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHN------NSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQD  QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

XP_022934711.1 uncharacterized protein LOC111441814 isoform X2 [Cucurbita moschata]1.1e-8266.67Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVK  N+NNNHQ+PPI SLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKP+ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]4.4e-11982.81Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+F+CFAVK+NNSN++HQ+PPI SL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQD
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
        EWR++MPSFQL   +VS VADVRLNCRS ++ QDYPIHIPHHVSKFIDLQLMRWE+KGLGT+FKP++FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQD  QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

XP_038905855.1 uncharacterized protein LOC120091799 isoform X2 [Benincasa hispida]3.2e-8566.8Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+F+CFAVK+NNSN++HQ+PPI SL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
                                                           MRWE+KGLGT+FKP++FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQD  QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein4.8e-10374.81Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHNNSN-NNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSNF+CFA+K NNSN N  Q+PPI SL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHNNSN-NNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQD  QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

A0A1S3B8N8 uncharacterized protein LOC1034869822.0e-10173.86Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHN------NSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSNF+CFA+K N      N+NN +Q+PPI SL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKHN------NSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQD  QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X25.5e-8366.67Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVK  N+NNNHQ+PPI SLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKP+ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1F3D2 uncharacterized protein LOC111441814 isoform X11.3e-7961.4Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESP-----------------------QASFDD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVK  N+NNNHQ+PPI SLRFS+FHPL ESP                       QASFD+
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESP-----------------------QASFDD

Query:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALY
        YI DE RLLRATF GKSEKLN+ EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKP+ F I+V G +Y
Subjt:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALY

Query:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        A RT  ESKS+L N+L+L+LH+F +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1J0I3 uncharacterized protein LOC1114823527.2e-8366.01Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLII+     KC  HQK+    +F  FAVK  N+NNNHQ+PPI SLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SP+ DVRL+CRSC+  +DYPIHIP HVSKF+DLQ+MRWEV+G+G DFK + F I+V GA YA RT  ESKSVL N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV
         +  P DF      QP AEKGLKGMM+E+M +FT+NL+LDY+KYKKEKQ   V
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)9.1e-3034.02Show/hide
Query:  SLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWE
        S + S+   L ESPQA FD+Y+ED++R+  A F  K +  +LN++EWR++M   +   +   PV  +R+ C+  S+ QDYP  +P H++K ++L + +WE
Subjt:  SLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWE

Query:  VKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN
        ++GL    +P  FT+ V GALY +R    + L   L   + +F  P+ L    +D+ + +A   L G+++   +   E+L+ DYSK+K E++++
Subjt:  VKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN

AT5G39530.1 Protein of unknown function (DUF1997)1.2e-3438.27Show/hide
Query:  PPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL
        P   S R S+  PL+ESPQA FD+Y+ED++R+  A F  K  S +LN++EWR++M     L + V PV D+RL C+  S+ QDYP  +P  ++K ++L +
Subjt:  PPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL

Query:  MRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        MRW+++GL    +P  F++ V GALY +R    + L   L +N+ +F  P  L+   +D+ + LA   L G++E   ++   +LL DYS++K E++
Subjt:  MRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATAATTTGGTTGCTGTTTCTTTTCAATTCCCACAGCTTATTATCAATCCAAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTT
TATGTGTTTTGCAGTGAAACATAATAATAGTAATAATAATCATCAAAGCCCTCCAATTATCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTT
CCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAA
TTGCTTTTGGTCAAGGTCAGCCCGGTAGCTGACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTAT
TGACCTTCAATTGATGAGATGGGAGGTGAAGGGATTGGGCACAGATTTCAAACCAAAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAA
AAAGTGTTCTCACAAATAATTTACTTCTCAATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATCTTTTTCAACCTCTTGCAGAAAAGGGATTG
AAGGGAATGATGGAGGAAACAATGAATGAATTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTA
A
mRNA sequenceShow/hide mRNA sequence
ATGGGTCATAATTTGGTTGCTGTTTCTTTTCAATTCCCACAGCTTATTATCAATCCAAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTT
TATGTGTTTTGCAGTGAAACATAATAATAGTAATAATAATCATCAAAGCCCTCCAATTATCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTT
CCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAA
TTGCTTTTGGTCAAGGTCAGCCCGGTAGCTGACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTAT
TGACCTTCAATTGATGAGATGGGAGGTGAAGGGATTGGGCACAGATTTCAAACCAAAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAA
AAAGTGTTCTCACAAATAATTTACTTCTCAATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATCTTTTTCAACCTCTTGCAGAAAAGGGATTG
AAGGGAATGATGGAGGAAACAATGAATGAATTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTA
A
Protein sequenceShow/hide protein sequence
MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKHNNSNNNHQSPPIISLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQ
LLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPKKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGL
KGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG