; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005948 (gene) of Chayote v1 genome

Gene IDSed0005948
OrganismSechium edule (Chayote v1)
DescriptionBEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein .
Genome locationLG06:36140370..36147882
RNA-Seq ExpressionSed0005948
SyntenySed0005948
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581290.1 hypothetical protein SDJN03_21292, partial [Cucurbita argyrosperma subsp. sororia]7.1e-12889.3Show/hide
Query:  MGVESNSSAPP-----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDD
        MGVESNS  PP     SSS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDD
Subjt:  MGVESNSSAPP-----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDD

Query:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGP
        CRFCETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PSTGT+TSLGC+TGP+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GP
Subjt:  CRFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGP

Query:  SPMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
        S MELPYCSMPEPGP+I+AEER    IK+LVDERVYQL ECSSMGV+EPEYNEQKSC+DLNR+MKDSESGG
Subjt:  SPMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

XP_008454305.1 PREDICTED: uncharacterized protein LOC103494744 isoform X1 [Cucumis melo]6.6e-12688.15Show/hide
Query:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF
        MGVESNS+   PP+SS STPSPSGKRARDP+DEVYLDNFHS KRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRF
Subjt:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM
        CETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PS GT+TSLGCST P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS M
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM

Query:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG
        ELPYCSMPEPGP+I+AE+RP  CIK+LVDERVYQLEECSSM  GV+E EYNEQKSC+DLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG

XP_022934215.1 uncharacterized protein LOC111441454 isoform X1 [Cucurbita moschata]5.4e-12889.63Show/hide
Query:  MGVESNSSAPP----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDC
        MGVESNS  PP    SSS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSSAPP----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PSTGT+TSLGC+TGP+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPS

Query:  PMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
         MELPYCSMPEPGP+I+AEER    IK+LVDERVYQL ECSSMGV+EPEYNEQKSC+DLNR+MKDSESGG
Subjt:  PMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

XP_023526160.1 uncharacterized protein LOC111789720 [Cucurbita pepo subsp. pepo]7.8e-12787.96Show/hide
Query:  MGVESNSSAPP--------SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSED
        MGVESNS+ PP        SSS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSED
Subjt:  MGVESNSSAPP--------SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSED

Query:  SDDCRFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP
        SDDCRFCETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PSTGT+TSLGC+TGP+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP
Subjt:  SDDCRFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP

Query:  LGPSPMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
         GPS MELPYCSMPEPGP+I+AEER    IK+LVDERVYQL ECSSMGV+EPEYNEQKSC+DLN +MKDSESGG
Subjt:  LGPSPMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

XP_038904570.1 uncharacterized protein LOC120090942 [Benincasa hispida]3.7e-12990.23Show/hide
Query:  MGVESNSSAPPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFCE
        MGVESNS+ PPSSS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY R+EMS QYSPMSEDSDDCRFCE
Subjt:  MGVESNSSAPPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFCE

Query:  TSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPMEL
        TSTNLFP+QSDSSVPTSPVSPYRY RPFSG+ PSTGT+TSLGCST P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPS MEL
Subjt:  TSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPMEL

Query:  PYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
        PYCSMPEPGP+I+AEERP  CIK+LVDER +QLEECSSMGV+EPEYNE+KSC+DLNRDMKDSESGG
Subjt:  PYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

TrEMBL top hitse value%identityAlignment
A0A0A0KWN0 Uncharacterized protein1.6e-12588.89Show/hide
Query:  MGVESNSS-APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFC
        MGVESNS+  PP+SS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSS-APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCS-TGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM
        ETSTNLFPSQSDSSVPTSPVSPYRY RPFSG+ PSTGT+TSLGCS T P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS M
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCS-TGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM

Query:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG
        ELPYCSMPEPGP+I+AE+RP  CIK+LVDERVYQLEECSSM  GV+E EYNEQKSC+DLNRDMKDS SGG
Subjt:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG

A0A1S3BXT2 uncharacterized protein LOC103494744 isoform X13.2e-12688.15Show/hide
Query:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF
        MGVESNS+   PP+SS STPSPSGKRARDP+DEVYLDNFHS KRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRF
Subjt:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM
        CETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PS GT+TSLGCST P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS M
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM

Query:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG
        ELPYCSMPEPGP+I+AE+RP  CIK+LVDERVYQLEECSSM  GV+E EYNEQKSC+DLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG

A0A5A7TRC2 Uncharacterized protein3.2e-12688.15Show/hide
Query:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF
        MGVESNS+   PP+SS STPSPSGKRARDP+DEVYLDNFHS KRYLSEIMASSLNGLTVG+PLSENLMDSPARSESMLY RDEMS QYSPMSEDSDDCRF
Subjt:  MGVESNSS--APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRF

Query:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM
        CETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PS GT+TSLGCST P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS M
Subjt:  CETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPM

Query:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG
        ELPYCSMPEPGP+I+AE+RP  CIK+LVDERVYQLEECSSM  GV+E EYNEQKSC+DLNRDMKDS+SGG
Subjt:  ELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSM--GVTEPEYNEQKSCEDLNRDMKDSESGG

A0A6J1F722 uncharacterized protein LOC111441454 isoform X12.6e-12889.63Show/hide
Query:  MGVESNSSAPP----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDC
        MGVESNS  PP    SSS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDC
Subjt:  MGVESNSSAPP----SSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDC

Query:  RFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPS
        RFCETSTNLFPSQSDSSVPTSPVSPYRY RPFSGM PSTGT+TSLGC+TGP+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS
Subjt:  RFCETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPS

Query:  PMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
         MELPYCSMPEPGP+I+AEER    IK+LVDERVYQL ECSSMGV+EPEYNEQKSC+DLNR+MKDSESGG
Subjt:  PMELPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

A0A6J1J464 uncharacterized protein LOC111482488 isoform X11.0e-12488.01Show/hide
Query:  MGVESNSS-APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFC
        MGVESNS+  PP SS STPSPSGKRARDPDDEVYLDNFHS KRYLSEIMASSLNGLTVGD LSENLMDSPARSESMLYLRDEMS QYSPMSEDSDDCRFC
Subjt:  MGVESNSS-APPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFC

Query:  ETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPME
        ETSTNLFPSQSD+SVPTSPVSPYRY RPFSGM PST T+TSLGC+T P+TSL PHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQP GPS ME
Subjt:  ETSTNLFPSQSDSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPME

Query:  LPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG
        LPYCSMPEPGP+I+AEER    IK+LVDERVYQL ECS+MGV+EPEYNEQKSC+DLNR+MKD ESGG
Subjt:  LPYCSMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G25920.1 BEST Arabidopsis thaliana protein match is: 3'-5' exonuclease domain-containing protein / K homology domain-containing protein / KH domain-containing protein (TAIR:AT2G25910.2)9.1e-5753.75Show/hide
Query:  STPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFCE--TSTNLFPSQSDSS
        S  SP GKR RDP+DEVYLDN  SQKRYLSEIMA SLNGLTVGD L  N+++SPARSES LY RD++S QYSPMSEDSD+ RFCE  T+T    S    S
Subjt:  STPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFCE--TSTNLFPSQSDSS

Query:  VPTSPVSPYRYPRPFSGMNPSTGTST----------SLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPMELPYC
         PTSPVSPYRY RP +  N    + T          S+  +    T+    QRGSD+EGRFPSSPSDICHS DLRR ALLRSVQMR QP G       Y 
Subjt:  VPTSPVSPYRYPRPFSGMNPSTGTST----------SLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPMELPYC

Query:  SMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLN
        S   P  +ID EER   C K++ ++R Y   E           ++ KSC+ L+
Subjt:  SMPEPGPSIDAEERPYPCIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCGAATCGAACTCGTCGGCGCCGCCGTCATCGTCGCCTTCTACGCCGTCTCCGAGCGGGAAGAGAGCCAGAGATCCTGACGATGAAGTTTATCTCGACAATTT
CCACTCTCAGAAGCGCTACCTCAGTGAGATAATGGCGTCTAGTTTGAACGGATTGACGGTTGGGGATCCCCTTTCAGAGAACCTCATGGATTCCCCTGCTAGGTCGGAGT
CCATGCTTTATTTAAGGGATGAAATGTCCGGGCAATATTCTCCTATGTCGGAAGATTCAGATGACTGCCGATTTTGTGAGACATCGACAAACTTGTTTCCATCTCAGTCT
GATAGCAGTGTACCTACCAGTCCAGTCTCTCCATATCGATATCCAAGGCCATTCAGTGGAATGAATCCTTCAACAGGTACTAGTACTTCACTTGGATGTTCTACTGGTCC
TATCACTAGCTTGCATCCCCATCAACGTGGATCAGATTCCGAAGGTCGTTTCCCATCTTCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCTGCGCTCTTGC
GCTCGGTACAAATGAGAGCACAACCTCTTGGTCCATCACCTATGGAGTTGCCATATTGCTCAATGCCGGAGCCTGGACCTAGTATAGATGCCGAAGAGCGCCCTTATCCT
TGCATAAAAGCGTTGGTCGATGAAAGAGTTTATCAACTCGAGGAATGCTCCTCAATGGGAGTGACTGAGCCTGAATATAATGAACAAAAATCATGCGAGGACTTGAACAG
AGATATGAAAGACAGTGAGTCTGGAGGGTAG
mRNA sequenceShow/hide mRNA sequence
GCAATTTTGAAAACGAAAAGGAAAAACCAGAAGAAACCACTTTTGTGGTTCTTCTTCTACAATCCAAAAGAATGATGGTTCCCATTTCCCAAAATTGTTCGCATTATAAT
ATCCTCTTTTCCATACCAAATGATCGTTGAGTTTCAACTTTTGCTCTTTAATCTTTTGAAGCTGCATTGATGGGCGTCGAATCGAACTCGTCGGCGCCGCCGTCATCGTC
GCCTTCTACGCCGTCTCCGAGCGGGAAGAGAGCCAGAGATCCTGACGATGAAGTTTATCTCGACAATTTCCACTCTCAGAAGCGCTACCTCAGTGAGATAATGGCGTCTA
GTTTGAACGGATTGACGGTTGGGGATCCCCTTTCAGAGAACCTCATGGATTCCCCTGCTAGGTCGGAGTCCATGCTTTATTTAAGGGATGAAATGTCCGGGCAATATTCT
CCTATGTCGGAAGATTCAGATGACTGCCGATTTTGTGAGACATCGACAAACTTGTTTCCATCTCAGTCTGATAGCAGTGTACCTACCAGTCCAGTCTCTCCATATCGATA
TCCAAGGCCATTCAGTGGAATGAATCCTTCAACAGGTACTAGTACTTCACTTGGATGTTCTACTGGTCCTATCACTAGCTTGCATCCCCATCAACGTGGATCAGATTCCG
AAGGTCGTTTCCCATCTTCTCCTAGTGATATATGCCACTCAGCAGACTTGAGAAGGGCTGCGCTCTTGCGCTCGGTACAAATGAGAGCACAACCTCTTGGTCCATCACCT
ATGGAGTTGCCATATTGCTCAATGCCGGAGCCTGGACCTAGTATAGATGCCGAAGAGCGCCCTTATCCTTGCATAAAAGCGTTGGTCGATGAAAGAGTTTATCAACTCGA
GGAATGCTCCTCAATGGGAGTGACTGAGCCTGAATATAATGAACAAAAATCATGCGAGGACTTGAACAGAGATATGAAAGACAGTGAGTCTGGAGGGTAGTTAATGCTTT
AACGAAGTGTCTGGAAAATGTCTTCTAATATTACAGATGATGCTGCTGTGGGACGGAAATTTATCCCTTTTGGTTCAATCACCTTGGAAAATTCTCTTTTTGAATATAGC
TGCCGAATTTACTTGCTTTCGCTTTTTTGAGAATGGTTTGGTCTTTCCCAAATTCGTGGTGGTTTGTGCCGTCTTCATTGAGCATGTGCGTGAGCTCTGTATCTTTTTTG
TCGTTCGAAGCACGAGATCGTGAAAGATGTTCGTTGTTGAGCATGTGTAGGAGTCTGTTGTATCTATAATGATTATATCAAATGTCGAATTCATTGGCGATCTTCGTTGT
CGAGCATGTGGTGAAGCTTTGTCTATGTATTTAATTCTACTAGAGATCTGTCTATCAAACTACCTTGGGAACTTGCAGGCAAAATTGTTGGGAGAACCAAAAGATGAATA
GAGAGATGAAATTATATTTTGAGTTTCATCACTTGCGT
Protein sequenceShow/hide protein sequence
MGVESNSSAPPSSSPSTPSPSGKRARDPDDEVYLDNFHSQKRYLSEIMASSLNGLTVGDPLSENLMDSPARSESMLYLRDEMSGQYSPMSEDSDDCRFCETSTNLFPSQS
DSSVPTSPVSPYRYPRPFSGMNPSTGTSTSLGCSTGPITSLHPHQRGSDSEGRFPSSPSDICHSADLRRAALLRSVQMRAQPLGPSPMELPYCSMPEPGPSIDAEERPYP
CIKALVDERVYQLEECSSMGVTEPEYNEQKSCEDLNRDMKDSESGG