; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G14640 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G14640
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1997)
Genome locationClcChr10:28318206..28322853
RNA-Seq ExpressionClc10G14640
SyntenyClc10G14640
Gene Ontology termsGO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136694.1 uncharacterized protein LOC101213732 isoform X1 [Cucumis sativus]2.6e-10375.19Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSNF+CFA+K NNSN N  Q+PPIFSL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQD  QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]8.4e-10274.24Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSNF+CFA+K N      N+NN +Q+PPIFSL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQD  QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

XP_022934711.1 uncharacterized protein LOC111441814 isoform X2 [Cucurbita moschata]3.5e-8467.87Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKPQ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]1.1e-12083.98Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+F+CFAVKNNNSN++HQ+PPIFSL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQD
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
        EWR++MPSFQL   +VS VADVRLNCRS ++ QDYPIHIPHHVSKFIDLQLMRWE+KGLGT+FKPQ+FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQD  QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

XP_038905855.1 uncharacterized protein LOC120091799 isoform X2 [Benincasa hispida]7.6e-8767.97Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNLVAVSFQ PQLIIN N RSKCYVH KKK+YS+F+CFAVKNNNSN++HQ+PPIFSL+FSSFHPLSESPQASFDDYIEDEARLLR TF GKSEK+NQ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA
                                                           MRWE+KGLGT+FKPQ+FTINV GALYAERTESKS+LTNN +LNLHNFAA
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAA

Query:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG
        PTP DFFAQD  QP AEKGLKGMMEETMNEFTE LLLDYSKYKKEKQ+NEV AN G
Subjt:  PTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein1.3e-10375.19Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK
        M H++VAVS Q PQL+INPN +  SKCYVH KKK+    YSNF+CFA+K NNSN N  Q+PPIFSL+FSSF PLSESPQASFDDYIEDEARLLRATF GK
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNNNSN-NNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK

Query:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLL
        SEK+NQD+WRVEMPSFQ+L +KVSPVADVRL+C+  SST+D PIHIP +VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVLTNNLLL
Subjt:  SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLL

Query:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY
        NL+N A   P+DFFAQD  QPL EKGLKGMMEE M EFTENLLLDY+KYKKE Q+NEVP+NY
Subjt:  NLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEVPANY

A0A1S3B8N8 uncharacterized protein LOC1034869824.1e-10274.24Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA
        M HNLVAVS Q PQLIINPN +  SKCYVH KKK+    YSNF+CFA+K N      N+NN +Q+PPIFSL+FSSFHPLSESPQASFDDYIEDE RLLRA
Subjt:  MGHNLVAVSFQFPQLIINPNNR--SKCYVHQKKKN----YSNFMCFAVKNN------NSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRA

Query:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLT
        TF GKSEK++QD WRVEMP+FQ+L +KVSPVADVRL+C+SC  T+D PIHIPH+VSKFIDLQLM WE+KGL  DFK  K  INV GA+YAERT+SKSVL 
Subjt:  TFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLT

Query:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV
        NNLLLNL+N A P P+DFFAQD  QPLAEKGLKGMMEE M EF ENLLLDY+KYKKEKQ +NEV
Subjt:  NNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ-ENEV

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X21.7e-8467.87Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKPQ F I+V G +YA RT  ESKS+L N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
         +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1F3D2 uncharacterized protein LOC111441814 isoform X13.9e-8162.5Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESP-----------------------QASFDD
        M HNL AVSF FPQLIIN     KC  HQ++     F  FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP                       QASFD+
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESP-----------------------QASFDD

Query:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALY
        YI DE RLLRATF GKSEKLN+ EWRVEMPSFQLL +K+SPV DVRL+C+  SST+DYPIHIP HVSKF+DLQ+MRWEV+G+G DFKPQ F I+V G +Y
Subjt:  YIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALY

Query:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        A RT  ESKS+L N+L+L+LH+F +P P DF      QP AEKGL+GMM+E+M +FT+NL+LDY+KYKKEKQ
Subjt:  AERT--ESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ

A0A6J1J0I3 uncharacterized protein LOC1114823522.2e-8467.19Show/hide
Query:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD
        M HNL AVSF FPQLII+     KC  HQK+    +F  FAVKNN  NNNHQ+PPIFSLRFS+FHPL ESP ASFD+YI DE RLLRATF GKSEKLN+ 
Subjt:  MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQD

Query:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF
        EWRVEMPSFQLL +K+SP+ DVRL+CRSC+  +DYPIHIP HVSKF+DLQ+MRWEV+G+G DFK Q F I+V GA YA RT  ESKSVL N+L+L+LH+F
Subjt:  EWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERT--ESKSVLTNNLLLNLHNF

Query:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV
         +  P DF      QP AEKGLKGMM+E+M +FT+NL+LDY+KYKKEKQ   V
Subjt:  AAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQENEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)4.1e-3033.85Show/hide
Query:  FSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRW
        +S + S+   L ESPQA FD+Y+ED++R+  A F  K +  +LN++EWR++M   +   +   PV  +R+ C+  S+ QDYP  +P H++K ++L + +W
Subjt:  FSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSE--KLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRW

Query:  EVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN
        E++GL    +P  FT+ V GALY +R    + L   L   + +F  P+ L    +D+ + +A   L G+++   +   E+L+ DYSK+K E++++
Subjt:  EVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQEN

AT5G39530.1 Protein of unknown function (DUF1997)5.5e-3538.27Show/hide
Query:  PPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL
        P  +S R S+  PL+ESPQA FD+Y+ED++R+  A F  K  S +LN++EWR++M     L + V PV D+RL C+  S+ QDYP  +P  ++K ++L +
Subjt:  PPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGK--SEKLNQDEWRVEMPSFQLLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQL

Query:  MRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ
        MRW+++GL    +P  F++ V GALY +R    + L   L +N+ +F  P  L+   +D+ + LA   L G++E   ++   +LL DYS++K E++
Subjt:  MRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGLKGMMEETMNEFTENLLLDYSKYKKEKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCATAATTTGGTTGCTGTTTCTTTCCAATTCCCACAGCTTATTATCAATCCCAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTT
TATGTGTTTTGCAGTGAAAAATAATAATAGTAATAATAATCATCAAAGCCCTCCAATTTTCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTT
CCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAA
TTGCTTTTGGTCAAGGTCAGCCCGGTAGCTGACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTAT
TGACCTTCAATTGATGAGATGGGAGGTGAAGGGATTGGGCACAGATTTCAAACCACAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAA
AAAGTGTTCTCACAAATAATTTACTTCTCAATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATCTTTTTCAACCTCTTGCAGAAAAGGGATTG
AAGGGAATGATGGAGGAAACAATGAATGAATTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTA
A
mRNA sequenceShow/hide mRNA sequence
CGTGGTTGCTCAAACATAGTGTTTTTATTTGAACAATCTTCCCATTTATATATTTCAGCCTCTTCTACGTAGTAAAAAGTAAAGTAAAGTGTATGCACTTTTTCTTCCTC
TCATTGCAAAAAGCACTTTGGGGGATCGATATTGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAGCTATGGGTCATAATTTGGTTGCTGTTTCTTTC
CAATTCCCACAGCTTATTATCAATCCCAACAACAGATCAAAATGTTATGTTCATCAGAAAAAGAAGAACTATTCTAATTTTATGTGTTTTGCAGTGAAAAATAATAATAG
TAATAATAATCATCAAAGCCCTCCAATTTTCTCTCTCAGATTCTCAAGTTTCCATCCACTTTCTGAGTCTCCTCAGGCTTCCTTTGATGATTACATTGAAGATGAAGCTA
GATTGTTGAGAGCCACTTTTGTTGGAAAAAGTGAAAAATTAAACCAGGATGAATGGAGAGTTGAAATGCCATCTTTCCAATTGCTTTTGGTCAAGGTCAGCCCGGTAGCT
GACGTAAGATTAAATTGTAGAAGCTGTAGCTCTACTCAAGATTACCCTATTCATATTCCTCACCATGTCTCCAAATTTATTGACCTTCAATTGATGAGATGGGAGGTGAA
GGGATTGGGCACAGATTTCAAACCACAAAAGTTCACAATCAATGTAAATGGAGCTTTGTATGCTGAAAGAACAGAATCAAAAAGTGTTCTCACAAATAATTTACTTCTCA
ATCTTCACAATTTTGCTGCCCCAACACCCCTTGATTTCTTTGCACAAGATCTTTTTCAACCTCTTGCAGAAAAGGGATTGAAGGGAATGATGGAGGAAACAATGAATGAA
TTTACAGAAAATTTGCTCTTGGATTACAGCAAATACAAGAAGGAGAAGCAAGAGAATGAAGTTCCAGCCAATTATGGCTAATTAGTGTGTCTCAAATCAACAACATCAAA
ATTACCAATTATATTATTTTAGAACATTACTTGAAAAGTATCAGTGAATTAA
Protein sequenceShow/hide protein sequence
MGHNLVAVSFQFPQLIINPNNRSKCYVHQKKKNYSNFMCFAVKNNNSNNNHQSPPIFSLRFSSFHPLSESPQASFDDYIEDEARLLRATFVGKSEKLNQDEWRVEMPSFQ
LLLVKVSPVADVRLNCRSCSSTQDYPIHIPHHVSKFIDLQLMRWEVKGLGTDFKPQKFTINVNGALYAERTESKSVLTNNLLLNLHNFAAPTPLDFFAQDLFQPLAEKGL
KGMMEETMNEFTENLLLDYSKYKKEKQENEVPANYG