; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G38770 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G38770
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationChr3:33408493..33410822
RNA-Seq ExpressionCSPI03G38770
SyntenyCSPI03G38770
Gene Ontology termsGO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136694.1 uncharacterized protein LOC101213732 isoform X1 [Cucumis sativus]1.5e-143100Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
        SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL

Query:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
        YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
Subjt:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI

XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]1.0e-12388.55Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKK-NNSNC----NTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRA
        MVH++VAVSLQLPQL+INPNYKL+SKCYVHHKKKHYYYYYSNFICFALKK NNSNC    N  QNPPIFSLKFSSF PLSESPQASFDDYIEDE RLLRA
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKK-NNSNC----NTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRA

Query:  TFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNN
        TF+GKSEKI+QD WRVEMP+FQVLFLKVSPVADVRLSCKS TKD+PIHIP NVSKFIDLQLMGWELKGLSKDFK  KI+INVKGAMYAERTKSKSVL NN
Subjt:  TFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNN

Query:  LLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ-KNEV
        LLLNLYNLAP KPIDFFAQDFLQPL EKGLKGMMEE+MKEF ENLLLDYNKYKKE Q KNEV
Subjt:  LLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ-KNEV

XP_011652227.1 uncharacterized protein LOC101213732 isoform X2 [Cucumis sativus]2.9e-11886.97Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
        SEKINQDDWRVEMPSFQVLFLK                                  MGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL

Query:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
        YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
Subjt:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI

XP_022983867.1 uncharacterized protein LOC111482352 [Cucurbita maxima]1.4e-7259.3Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        M H++ AVS   PQL+I      S KC   H+K+H      +F  FA+K NN+N    QNPPIFSL+FS+F PL ESP ASFD+YI DE RLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTK--SKSVLTNNLLL
        SEK+N+ +WRVEMPSFQ+LFLK+SP+ DVRLSC+S  KD PIHIP++VSKF+DLQ+M WE++G+ KDFK+   +I+VKGA YA RTK  SKSVL N+L+L
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTK--SKSVLTNNLLL

Query:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEV
        +L++     P      DFLQP  EKGLKGMM+E M++FT+NL+LDY KYKKE Q   V
Subjt:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEV

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]4.1e-9672.41Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        M H++VAVS QLPQL+IN N +  SKCYVHHKKK     YS+F+CFA+K NNSN +  QNPPIFSLKFSSF PLSESPQASFDDYIEDEARLLR TFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSST--KDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLL
        SEKINQD+WR++MPSFQ+ F +VS VADVRL+C+S T  +D PIHIP +VSKFIDLQLM WELKGL  +FK  +  INV+GA+YAERT+SKS+LTNN +L
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSST--KDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLL

Query:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSN
        NL+N A   P DFFAQDFLQP  EKGLKGMMEE M EFTE LLLDY+KYKKE QKNEV +N
Subjt:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSN

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein7.5e-144100Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
        SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNL

Query:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
        YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI
Subjt:  YNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI

A0A1S3B8N8 uncharacterized protein LOC1034869825.0e-12488.55Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKK-NNSNC----NTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRA
        MVH++VAVSLQLPQL+INPNYKL+SKCYVHHKKKHYYYYYSNFICFALKK NNSNC    N  QNPPIFSLKFSSF PLSESPQASFDDYIEDE RLLRA
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKK-NNSNC----NTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRA

Query:  TFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNN
        TF+GKSEKI+QD WRVEMP+FQVLFLKVSPVADVRLSCKS TKD+PIHIP NVSKFIDLQLMGWELKGLSKDFK  KI+INVKGAMYAERTKSKSVL NN
Subjt:  TFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNN

Query:  LLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ-KNEV
        LLLNLYNLAP KPIDFFAQDFLQPL EKGLKGMMEE+MKEF ENLLLDYNKYKKE Q KNEV
Subjt:  LLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ-KNEV

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X23.4e-7259.84Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        M H++ AVS   PQL+IN       KC   H+++H       F  FA+K NN+N    QNPPIFSL+FS+F PL ESP ASFD+YI DE RLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERT--KSKSVLTNNLLL
        SEK+N+ +WRVEMPSFQ+LFLK+SPV DVRLSCKSSTKD PIHIP++VSKF+DLQ+M WE++G+ KDFK    +I+VKG MYA RT  +SKS+L N+L+L
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERT--KSKSVLTNNLLL

Query:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ
        +L++     P      DFLQP  EKGL+GMM+E M++FT+NL+LDY KYKKE Q
Subjt:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ

A0A6J1F3D2 uncharacterized protein LOC111441814 isoform X17.9e-6955.23Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESP----------------------
        M H++ AVS   PQL+IN       KC   H+++H       F  FA+K NN+N    QNPPIFSL+FS+F PL ESP                      
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESP----------------------

Query:  -QASFDDYIEDEARLLRATFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINV
         QASFD+YI DE RLLRATFSGKSEK+N+ +WRVEMPSFQ+LFLK+SPV DVRLSCKSSTKD PIHIP++VSKF+DLQ+M WE++G+ KDFK    +I+V
Subjt:  -QASFDDYIEDEARLLRATFSGKSEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINV

Query:  KGAMYAERT--KSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ
        KG MYA RT  +SKS+L N+L+L+L++     P      DFLQP  EKGL+GMM+E M++FT+NL+LDY KYKKE Q
Subjt:  KGAMYAERT--KSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQ

A0A6J1J0I3 uncharacterized protein LOC1114823526.9e-7359.3Show/hide
Query:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK
        M H++ AVS   PQL+I      S KC   H+K+H      +F  FA+K NN+N    QNPPIFSL+FS+F PL ESP ASFD+YI DE RLLRATFSGK
Subjt:  MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK

Query:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTK--SKSVLTNNLLL
        SEK+N+ +WRVEMPSFQ+LFLK+SP+ DVRLSC+S  KD PIHIP++VSKF+DLQ+M WE++G+ KDFK+   +I+VKGA YA RTK  SKSVL N+L+L
Subjt:  SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTK--SKSVLTNNLLL

Query:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEV
        +L++     P      DFLQP  EKGLKGMM+E M++FT+NL+LDY KYKKE Q   V
Subjt:  NLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)8.2e-2631.09Show/hide
Query:  FSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGKSE--KINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWEL
        +S K S+   L ESPQA FD+Y+ED++R+  A F  K +  ++N+++WR++M   +  FL   PV  +R+ CKS+ +D P  +P +++K ++L +  WEL
Subjt:  FSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGKSE--KINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWEL

Query:  KGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKN
        +GL +  + +   + VKGA+Y +R    + L   L   +  + P   +    +D  + +    L G+++ +     E+L+ DY+K+K E +K+
Subjt:  KGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKN

AT5G39530.1 Protein of unknown function (DUF1997)6.5e-3134.9Show/hide
Query:  PPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK--SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMG
        P  +S + S+  PL+ESPQA FD+Y+ED++R+  A F  K  S ++N+++WR++M     LFL V PV D+RL CKS+ +D P  +P +++K ++L +M 
Subjt:  PPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGK--SEKINQDDWRVEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMG

Query:  WELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKE
        W+L+GL +  + +   + VKGA+Y +R    + L   L +N+  + P   ++   +D  + L    L G++E +  +   +LL DY+++K E
Subjt:  WELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPLVEKGLKGMMEEIMKEFTENLLLDYNKYKKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCACAGTATGGTTGCTGTTTCTCTTCAACTCCCACAGCTTGTTATCAATCCAAATTACAAGCTTAGCTCAAAGTGTTATGTTCATCACAAAAAGAAGCATTATTA
TTATTATTATTCTAATTTCATTTGCTTTGCATTGAAAAAGAATAACAGTAATTGCAATACTATTCAAAATCCTCCAATTTTCTCTCTCAAGTTCTCCAGTTTCAGTCCAC
TTTCTGAATCTCCTCAGGCTTCCTTTGATGATTACATTGAAGATGAAGCTAGATTGTTGAGAGCCACTTTTTCTGGAAAAAGTGAAAAAATCAACCAGGATGACTGGAGA
GTTGAAATGCCATCTTTCCAAGTGCTTTTCTTGAAGGTGAGCCCAGTAGCTGATGTAAGATTAAGTTGCAAAAGCAGCACAAAAGATAGCCCTATTCATATTCCTCAAAA
TGTCTCCAAATTTATTGACCTTCAATTGATGGGATGGGAATTGAAAGGATTGAGCAAAGATTTCAAAGCATCAAAGATCAAAATCAATGTAAAAGGAGCTATGTATGCAG
AGAGAACAAAATCAAAAAGTGTTCTCACAAACAATTTGCTTCTCAATCTTTACAACTTGGCTCCCCAAAAGCCCATTGATTTCTTTGCACAAGATTTCCTTCAACCTCTT
GTGGAAAAGGGATTGAAGGGAATGATGGAAGAAATAATGAAAGAATTTACAGAAAATTTGCTATTGGATTACAACAAATACAAGAAGGAAACACAAAAGAACGAAGTTCC
TTCCAATTATATATAA
mRNA sequenceShow/hide mRNA sequence
GTACTTTTCTTCTTCTTCTCATTACAGAAAGCACTTTGGAGGATATTGAAGAAAAAGAAGAGCAATGGTTCACAGTATGGTTGCTGTTTCTCTTCAACTCCCACAGCTTG
TTATCAATCCAAATTACAAGCTTAGCTCAAAGTGTTATGTTCATCACAAAAAGAAGCATTATTATTATTATTATTCTAATTTCATTTGCTTTGCATTGAAAAAGAATAAC
AGTAATTGCAATACTATTCAAAATCCTCCAATTTTCTCTCTCAAGTTCTCCAGTTTCAGTCCACTTTCTGAATCTCCTCAGGCTTCCTTTGATGATTACATTGAAGATGA
AGCTAGATTGTTGAGAGCCACTTTTTCTGGAAAAAGTGAAAAAATCAACCAGGATGACTGGAGAGTTGAAATGCCATCTTTCCAAGTGCTTTTCTTGAAGGTGAGCCCAG
TAGCTGATGTAAGATTAAGTTGCAAAAGCAGCACAAAAGATAGCCCTATTCATATTCCTCAAAATGTCTCCAAATTTATTGACCTTCAATTGATGGGATGGGAATTGAAA
GGATTGAGCAAAGATTTCAAAGCATCAAAGATCAAAATCAATGTAAAAGGAGCTATGTATGCAGAGAGAACAAAATCAAAAAGTGTTCTCACAAACAATTTGCTTCTCAA
TCTTTACAACTTGGCTCCCCAAAAGCCCATTGATTTCTTTGCACAAGATTTCCTTCAACCTCTTGTGGAAAAGGGATTGAAGGGAATGATGGAAGAAATAATGAAAGAAT
TTACAGAAAATTTGCTATTGGATTACAACAAATACAAGAAGGAAACACAAAAGAACGAAGTTCCTTCCAATTATATATAAGCTAATTACACTCTGCCCTATTTATCTCAA
CATTTTTCCAATATTATTGTAAAACATTATTTGCAATGTTAAATAAAATTTTCAAATATATATTGAAAATTCCTTCGTTTTTGCTTTATTTATTTTATAAAGTATAATAT
TGTTGTGTT
Protein sequenceShow/hide protein sequence
MVHSMVAVSLQLPQLVINPNYKLSSKCYVHHKKKHYYYYYSNFICFALKKNNSNCNTIQNPPIFSLKFSSFSPLSESPQASFDDYIEDEARLLRATFSGKSEKINQDDWR
VEMPSFQVLFLKVSPVADVRLSCKSSTKDSPIHIPQNVSKFIDLQLMGWELKGLSKDFKASKIKINVKGAMYAERTKSKSVLTNNLLLNLYNLAPQKPIDFFAQDFLQPL
VEKGLKGMMEEIMKEFTENLLLDYNKYKKETQKNEVPSNYI