; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000877 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000877
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0430 protein CG31712 isoform X2
Genome locationtig00000589:101438..107863
RNA-Seq ExpressionSgr000877
SyntenySgr000877
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157281.1 uncharacterized protein LOC111024023 isoform X1 [Momordica charantia]1.8e-8676.35Show/hide
Query:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE
        L +NSSRET+V+KNNDISRLQA+VV LE+H+RDLL ENKQLMENV DYQSK++NLERRI S H SD +TKEMLSSQVDAARILVDKLITENAELIGKVNE
Subjt:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE

Query:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR
        LFVELQRVTKTELSSA EP +  GA+ D PE   IQ+T     RL+ALES+ +H+HS  SNIV+LDNDLLAPTSS+PIEAGEIVQIPL ENED       
Subjt:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR

Query:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
             +AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

XP_022157289.1 uncharacterized protein LOC111024023 isoform X2 [Momordica charantia]1.8e-8676.35Show/hide
Query:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE
        L +NSSRET+V+KNNDISRLQA+VV LE+H+RDLL ENKQLMENV DYQSK++NLERRI S H SD +TKEMLSSQVDAARILVDKLITENAELIGKVNE
Subjt:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE

Query:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR
        LFVELQRVTKTELSSA EP +  GA+ D PE   IQ+T     RL+ALES+ +H+HS  SNIV+LDNDLLAPTSS+PIEAGEIVQIPL ENED       
Subjt:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR

Query:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
             +AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

XP_022941666.1 uncharacterized protein LOC111446955 [Cucurbita moschata]6.0e-8271.81Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISS--THCSDQVTKEMLSSQVDAARILVDKLIT
        L   ++    ++NSS E +VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADYQSKI  LER+ISS  TH SD+VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISS--THCSDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE
        ENAELIGKVNEL+VELQRVTK E++S  EP+QMV      A+F++PEP LI N VTS K LDALESVPIH+HS   NIVD+DND LL+PTS + P+E GE
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE

Query:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        I QIP  ENEDR++  +REL  + AESDE+DVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

XP_022971477.1 UPF0430 protein CG31712 isoform X1 [Cucurbita maxima]3.9e-8171.04Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT
        L   ++    ++NSS E +VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADY+SKI  LER+ISSTH   SD+VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE
        ENAELIGKVN L+VELQRVTK E++S  EP+QMV      A+F+DP+P LI N VTS K LDALESVPIH+HS   N+VD+DND LL+PTS + P+E GE
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE

Query:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        I QIP  ENEDR++  +REL  + AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

XP_038906167.1 myosin-6 [Benincasa hispida]1.7e-8473.15Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISST--HCSDQVTKEMLSSQVDAARILVDKLIT
        L   ++   L++ SS ET+VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADYQSK+LNLER++SST  H S +VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISST--HCSDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIV
        ENAELIGKVN LFVELQRVTKTELSS  EP+QM       A+F+DPEP LI N+VTS K LDALESVPIH+HS  S+ VDLDND LA  SS+P+ AGEI 
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIV

Query:  QIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        Q PLHE EDR++  DRELP     SDE+DVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  QIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

TrEMBL top hitse value%identityAlignment
A0A6J1DU42 uncharacterized protein LOC111024023 isoform X18.8e-8776.35Show/hide
Query:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE
        L +NSSRET+V+KNNDISRLQA+VV LE+H+RDLL ENKQLMENV DYQSK++NLERRI S H SD +TKEMLSSQVDAARILVDKLITENAELIGKVNE
Subjt:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE

Query:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR
        LFVELQRVTKTELSSA EP +  GA+ D PE   IQ+T     RL+ALES+ +H+HS  SNIV+LDNDLLAPTSS+PIEAGEIVQIPL ENED       
Subjt:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR

Query:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
             +AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

A0A6J1DW29 uncharacterized protein LOC111024023 isoform X28.8e-8776.35Show/hide
Query:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE
        L +NSSRET+V+KNNDISRLQA+VV LE+H+RDLL ENKQLMENV DYQSK++NLERRI S H SD +TKEMLSSQVDAARILVDKLITENAELIGKVNE
Subjt:  LSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNE

Query:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR
        LFVELQRVTKTELSSA EP +  GA+ D PE   IQ+T     RL+ALES+ +H+HS  SNIV+LDNDLLAPTSS+PIEAGEIVQIPL ENED       
Subjt:  LFVELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDR

Query:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
             +AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  ELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

A0A6J1FUD8 uncharacterized protein LOC1114469552.9e-8271.81Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISS--THCSDQVTKEMLSSQVDAARILVDKLIT
        L   ++    ++NSS E +VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADYQSKI  LER+ISS  TH SD+VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISS--THCSDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE
        ENAELIGKVNEL+VELQRVTK E++S  EP+QMV      A+F++PEP LI N VTS K LDALESVPIH+HS   NIVD+DND LL+PTS + P+E GE
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE

Query:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        I QIP  ENEDR++  +REL  + AESDE+DVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

A0A6J1I220 UPF0430 protein CG31712 isoform X11.9e-8171.04Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT
        L   ++    ++NSS E +VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADY+SKI  LER+ISSTH   SD+VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE
        ENAELIGKVN L+VELQRVTK E++S  EP+QMV      A+F+DP+P LI N VTS K LDALESVPIH+HS   N+VD+DND LL+PTS + P+E GE
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE

Query:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        I QIP  ENEDR++  +REL  + AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

A0A6J1I8P0 UPF0430 protein CG31712 isoform X21.9e-8171.04Show/hide
Query:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT
        L   ++    ++NSS E +VDKN DISRLQA+VV LEE RRDLL ENKQL ENVADY+SKI  LER+ISSTH   SD+VTKEMLSSQVDAARILVDKLIT
Subjt:  LYSSRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHC--SDQVTKEMLSSQVDAARILVDKLIT

Query:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE
        ENAELIGKVN L+VELQRVTK E++S  EP+QMV      A+F+DP+P LI N VTS K LDALESVPIH+HS   N+VD+DND LL+PTS + P+E GE
Subjt:  ENAELIGKVNELFVELQRVTKTELSSAREPNQMV-----GASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDND-LLAPTSSV-PIEAGE

Query:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
        I QIP  ENEDR++  +REL  + AESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
Subjt:  IVQIPLHENEDRDKDWDRELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G38580.1 Mitochondrial ATP synthase D chain-related protein1.3e-2636.78Show/hide
Query:  QNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNELF
        + SSRE +   NN+I+RL+A+V  LE+ + +LL +N+ L E +++ Q +  N +            ++E L+SQ++AA  LV+KLITENA+L+ KVNEL 
Subjt:  QNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNELF

Query:  VELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVP--IEAGEIVQ-IPLHENEDRDKDWD
        ++L              NQ   AS   PE   I+      ++ ++LE +PIH       ++ +DN     T+S+      GEI + +PL  N + + D +
Subjt:  VELQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVP--IEAGEIVQ-IPLHENEDRDKDWD

Query:  RELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL
         ++ VA  +     V L+DAPLIGAP+RL+SF+A+YVSGADL
Subjt:  RELPVAVAESDEKDVLLSDAPLIGAPYRLISFMAKYVSGADL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCCTCCGACATTGCTGCTCGATCATGCCCATGCCTTAACCTCGGCGATACTTCGGCTGAAGACCTTCTGCTCTCTTTACGTCCTTGCCACCTCCCTCCCTTCCT
TCTTGAAACTTCCTCCATTATAGAGGGATATGACCTTCTTTTTAGCTTCACTTTTAGCATTCCCTTCAGATGCTGTTGGCGTGTTCGTCTTCTCTCCCGTCGAGATTCGT
CATTCTTAAATCACGAGGGTTTCCATTTCAGTGGCAGGGATTTTTCTTCATCTAATCTGGTGGTTTCCCGCTTGTTCATGTCTTCTCCAGTAGGGGGGGGGTTATATTCT
TCAAGAAGTGGGTTCTGTCTTTCGCAGAATTCAAGCAGGGAGACGATGGTAGATAAGAACAACGACATCTCTAGGTTACAGGCACGGGTTGTGGGGCTGGAAGAACATAG
ACGTGATCTGTTGCTAGAAAACAAACAACTGATGGAAAATGTTGCTGATTATCAGTCAAAAATTCTGAATCTTGAGAGGAGAATCTCTTCCACTCACTGTTCAGATCAAG
TCACAAAGGAGATGTTGAGTTCACAAGTGGATGCAGCTCGTATTCTGGTTGATAAATTGATTACAGAAAATGCAGAGCTTATTGGGAAGGTGAATGAGTTATTTGTTGAG
CTTCAAAGAGTTACAAAAACCGAGCTATCTTCAGCCAGGGAGCCTAACCAGATGGTTGGAGCTAGTTTCGATGATCCGGAGCCTTCATTGATTCAGAATACGGTCACATC
AGATAAAAGGTTGGACGCATTAGAATCTGTTCCAATCCACAGCCATAGCAATGCTAGTAATATTGTGGACCTGGACAATGATTTATTGGCCCCAACATCTTCAGTACCTA
TCGAGGCGGGAGAAATCGTACAAATTCCCTTGCATGAAAATGAAGATAGGGATAAGGATTGGGACCGAGAACTGCCAGTTGCAGTTGCAGAGAGCGATGAGAAGGATGTG
CTGCTGTCAGATGCTCCTCTCATTGGGGCTCCTTATCGGTTGATATCATTTATGGCCAAATACGTAAGCGGTGCTGACCTGGGCGACGTTGAACTGGCAAATGTTGAAGC
ACTGCCAAGCACTAACGGAAGGGTTTTGAAAAGGTCGATTGAATACTGTGCGGGAACAGTGTTGTCAAATGCTCCTCGTCCCGTTTCATATCCTATAAAGGCCAACAGAA
GCATCATGATGATAGATAAAACACATTATAAAATCATTCAGTGTGAAACAACGTTTTGGTTGAACATCAAGAAATTCTTATTGACGCATGAACTAGAACAGGGAAGTTTG
CACGAATGTGTAGCCTTGGAACTAGGAGTTAAGGACCTTGAGCTTGCGTTTGTCTGGTTTTGTCAACAGACGCGCTG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCCTCCGACATTGCTGCTCGATCATGCCCATGCCTTAACCTCGGCGATACTTCGGCTGAAGACCTTCTGCTCTCTTTACGTCCTTGCCACCTCCCTCCCTTCCT
TCTTGAAACTTCCTCCATTATAGAGGGATATGACCTTCTTTTTAGCTTCACTTTTAGCATTCCCTTCAGATGCTGTTGGCGTGTTCGTCTTCTCTCCCGTCGAGATTCGT
CATTCTTAAATCACGAGGGTTTCCATTTCAGTGGCAGGGATTTTTCTTCATCTAATCTGGTGGTTTCCCGCTTGTTCATGTCTTCTCCAGTAGGGGGGGGGTTATATTCT
TCAAGAAGTGGGTTCTGTCTTTCGCAGAATTCAAGCAGGGAGACGATGGTAGATAAGAACAACGACATCTCTAGGTTACAGGCACGGGTTGTGGGGCTGGAAGAACATAG
ACGTGATCTGTTGCTAGAAAACAAACAACTGATGGAAAATGTTGCTGATTATCAGTCAAAAATTCTGAATCTTGAGAGGAGAATCTCTTCCACTCACTGTTCAGATCAAG
TCACAAAGGAGATGTTGAGTTCACAAGTGGATGCAGCTCGTATTCTGGTTGATAAATTGATTACAGAAAATGCAGAGCTTATTGGGAAGGTGAATGAGTTATTTGTTGAG
CTTCAAAGAGTTACAAAAACCGAGCTATCTTCAGCCAGGGAGCCTAACCAGATGGTTGGAGCTAGTTTCGATGATCCGGAGCCTTCATTGATTCAGAATACGGTCACATC
AGATAAAAGGTTGGACGCATTAGAATCTGTTCCAATCCACAGCCATAGCAATGCTAGTAATATTGTGGACCTGGACAATGATTTATTGGCCCCAACATCTTCAGTACCTA
TCGAGGCGGGAGAAATCGTACAAATTCCCTTGCATGAAAATGAAGATAGGGATAAGGATTGGGACCGAGAACTGCCAGTTGCAGTTGCAGAGAGCGATGAGAAGGATGTG
CTGCTGTCAGATGCTCCTCTCATTGGGGCTCCTTATCGGTTGATATCATTTATGGCCAAATACGTAAGCGGTGCTGACCTGGGCGACGTTGAACTGGCAAATGTTGAAGC
ACTGCCAAGCACTAACGGAAGGGTTTTGAAAAGGTCGATTGAATACTGTGCGGGAACAGTGTTGTCAAATGCTCCTCGTCCCGTTTCATATCCTATAAAGGCCAACAGAA
GCATCATGATGATAGATAAAACACATTATAAAATCATTCAGTGTGAAACAACGTTTTGGTTGAACATCAAGAAATTCTTATTGACGCATGAACTAGAACAGGGAAGTTTG
CACGAATGTGTAGCCTTGGAACTAGGAGTTAAGGACCTTGAGCTTGCGTTTGTCTGGTTTTGTCAACAGACGCGCTG
Protein sequenceShow/hide protein sequence
MPASDIAARSCPCLNLGDTSAEDLLLSLRPCHLPPFLLETSSIIEGYDLLFSFTFSIPFRCCWRVRLLSRRDSSFLNHEGFHFSGRDFSSSNLVVSRLFMSSPVGGGLYS
SRSGFCLSQNSSRETMVDKNNDISRLQARVVGLEEHRRDLLLENKQLMENVADYQSKILNLERRISSTHCSDQVTKEMLSSQVDAARILVDKLITENAELIGKVNELFVE
LQRVTKTELSSAREPNQMVGASFDDPEPSLIQNTVTSDKRLDALESVPIHSHSNASNIVDLDNDLLAPTSSVPIEAGEIVQIPLHENEDRDKDWDRELPVAVAESDEKDV
LLSDAPLIGAPYRLISFMAKYVSGADLGDVELANVEALPSTNGRVLKRSIEYCAGTVLSNAPRPVSYPIKANRSIMMIDKTHYKIIQCETTFWLNIKKFLLTHELEQGSL
HECVALELGVKDLELAFVWFCQQTRX