; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005880 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005880
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationscaffold11:782958..786470
RNA-Seq ExpressionSpg005880
SyntenySpg005880
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607325.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]5.0e-17981.01Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+D  VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGELASLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNCKKYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+E+Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLS-SQRK
        AFEGMK+VGL  +QRK
Subjt:  AFEGMKKVGLS-SQRK

XP_022948306.1 protein CHUP1, chloroplastic [Cucurbita moschata]3.8e-17981.49Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKS FLEKENQEL+QEL R KS IHSLKAHNNDRKSILWKKFHNS+D  VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGELASLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNCKKYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+E Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLS-SQRK
        AFEGMK+VGL  +QRK
Subjt:  AFEGMKKVGLS-SQRK

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]1.2e-18081.25Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+DV+VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGELASLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNC KYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+ELQ NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVG-LSSQRK
        AFEGMK+VG L SQRK
Subjt:  AFEGMKKVG-LSSQRK

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]1.1e-18181.49Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKSNFLEKENQEL+QEL R KS I SLKAHNNDRKSILWKKFHNS+DV+VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGEL SLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNCKKYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+E+Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLS-SQRK
        AFEGMK+VGL  +QRK
Subjt:  AFEGMKKVGLS-SQRK

XP_038896069.1 protein CHUP1, chloroplastic [Benincasa hispida]2.3e-17981.64Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MPKEEDEELAMEI N LKKELEI+LQKSNFLE ENQELRQELGRLKSQI SLKAHNN+RKSILWKKFH+S+DV+VAGADS P SPAAAA   R + T+SQ
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRMAAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
        KQSSWG VKENQRM  APA APPPPPPLP KLL GSKAVRRVPEVLELYR+LTKRDAQKENKA HGG+P VAFTKNMIGEIENRSAYLSAIKSEVETHGE
Subjt:  KQSSWGFVKENQRMAAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECT
        FVNWLI++VE   PRDI+EVERFVKW+D +L SLVDERAVLKHFP+WPE KADALREAAFSYRDLKRLE+EVC FRDN KEE+NVV+KRAQALQDR    
Subjt:  FVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECT

Query:  ICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVA
               VEQSVSN+E+TREFN KKY  FQIP QWM DSALP QMKLSSLRL KE M RITRE++S ETPQAENLFLQGVRFAYRVHQ+AGGFDSEA V 
Subjt:  ICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVA

Query:  FEGMKKVGLSSQRK
        FE +KK GLSSQRK
Subjt:  FEGMKKVGLSSQRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein5.4e-17980.72Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MPKEEDE LAMEI N LKKELEI+LQKS FLEKENQELRQEL RL+SQI S KA NN+RKSILWKKFH+SID+SVAGADSPP SP A  A D+ + T+S 
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRMAAAPA-PAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQSSW  VKE+ RM   PA P PPPPPPLP KLL GSKAVRRVPEVLELYR+LTKRDAQKENK AHGG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRMAAAPA-PAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        +FVNWLI++VE + PRDI+EVERFVKWLDG+LASLVDERAVLK+FP+WPE KADALREAAFSYRDLK LES+VC FRDNPKEEMNVV+KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                VEQSVSN+ERTREFNC+KY  FQIPCQWM DSALPTQ+K+S+LRLAKEYM RITRELQS ETPQ ENLFLQG RFAYRVHQYAGGFDSE I 
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLSSQRK
        AFEG+KK GLSSQRK
Subjt:  AFEGMKKVGLSSQRK

A0A1S3C4V9 protein CHUP1, chloroplastic isoform X17.3e-17680Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MPKE+DEELAMEI + LKK+LEI+LQKS FLE+ENQELR EL RLKSQI SLKA NN+RKSILWKKFH+S+D++VAGADSPP +P A AA D+ + T+  
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRMAAAPAPA-PPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQSSW  VKE+QRM A PA A PPPPPPLP KLL GSKAVRRVPEVL+LYR+LTKRDAQKENK AHGG P VAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRMAAAPAPA-PPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVNWLI++VE + PRDI+E E+FVKWLD +LASLVDERAVLKHFP+WPE KADALREAAFSYRDLK LES+VC FRDNPKEEMNVV+KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                VEQSVSN+ERTREFNCKKY  FQIPCQWM DSALPTQ+KLS+LRLAKEYM RITREL+S ET QAENLFLQGVRFAYRVHQYAGGFDSEAI 
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLSSQRK
        AFEG+KK GLSSQRK
Subjt:  AFEGMKKVGLSSQRK

A0A6J1DC83 protein CHUP1, chloroplastic7.8e-17078.47Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP+EEDEELAMEI  +L+KEL+IA+ KS+FLEKENQELRQELGRLKSQI SLKAHNNDRKS+LWKKF+NS+D     A+SPP       A+D+ + T+S 
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  -KQSSWGFVKENQRM-AAAPAPAP-PPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVET
         KQ  W  VKE+QRM   APAPAP PPPPPLP KLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGG PAVAFTKNMIGEIENRSAYL+AIKSEVET
Subjt:  -KQSSWGFVKENQRM-AAAPAPAP-PPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVET

Query:  HGEFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRR
        HGEFVNWLI++VEG  PRDITEVERFV WLD EL SLVDERAVLKHFP+WPEGKADALREAAFSYRDLK LESEVC FRDNPKEEM VV+KRAQALQDR 
Subjt:  HGEFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRR

Query:  ECTICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQS-NETPQAENLFLQGVRFAYRVHQYAGGFDSE
                  +EQSVSNVE+TREF+C KY NF+IPC+WM +S L  QMKLSSLRLAKEYMRRITRELQS + T QA+NL LQGVRFAYRVHQYAGGFDS+
Subjt:  ECTICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQS-NETPQAENLFLQGVRFAYRVHQYAGGFDSE

Query:  AIVAFEGMKKVGLSSQRK
        AI AFEG+KKVGLSSQRK
Subjt:  AIVAFEGMKKVGLSSQRK

A0A6J1G8X0 protein CHUP1, chloroplastic1.9e-17981.49Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKS FLEKENQEL+QEL R KS IHSLKAHNNDRKSILWKKFHNS+D  VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKAA+GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGELASLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNCKKYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+E Q NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVGLS-SQRK
        AFEGMK+VGL  +QRK
Subjt:  AFEGMKKVGLS-SQRK

A0A6J1K8G4 protein CHUP1, chloroplastic5.8e-18181.25Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        MP EEDEELAMEI + LK+ELEI+LQKSNFLEKENQEL+QEL R KS + SLK HNNDRKSILWKKFHNS+DV+VAG DS PQSP    A+D+W+ TR+Q
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG
        KQS+W  VKENQRM AAAP PAPPPPPPLP KLL GSKAVRRVPEVLELYR +TKRDAQKENKA +GG PAVAFTKNMIGEIENRSAYLSAIKSEVETHG
Subjt:  KQSSWGFVKENQRM-AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHG

Query:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC
        EFVN LIR+VE   PRDI EVERFVKWLDGELASLVDERAVLKHFP+WPEGKADALREAAFSY+DLK LE+EVC FR+NPKEE N ++KRAQALQDR   
Subjt:  EFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRREC

Query:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV
                +EQSVSNVERTREFNC KYN FQIPCQWMLDS LP QMKLSSLRL KE MRRIT+ELQ NETPQ ENLFLQGVRFAYRVHQYAGGFDSEAIV
Subjt:  TICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIV

Query:  AFEGMKKVG-LSSQRK
        AFEGMK+VG L SQRK
Subjt:  AFEGMKKVG-LSSQRK

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic2.1e-6346.98Show/hide
Query:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V  
Subjt:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG

Query:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS
            DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL +LE +V  F D+P       +K+   L           ++ VEQS
Subjt:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown6.7e-9746.97Show/hide
Query:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ
        +P  ED+   +     L KEL+  L +++ LEKEN ELRQE+ RL++Q+ +LK+H N+RKS+LWKK  +S D S     S  ++P +  ++ +    R+ 
Subjt:  MPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPPQSPAAAAASDRWDPTRSQ

Query:  KQSSWGFVKENQRMAAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE
                 +      + A  PPPPPPLP K   G ++VRR PEV+E YR+LTKR++   NK    G+ + AF +NMIGEIENRS YLS IKS+ + H +
Subjt:  KQSSWGFVKENQRMAAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGE

Query:  FVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECT
         ++ LI +VE     DI+EVE FVKW+D EL+SLVDERAVLKHFP+WPE K D+LREAA +Y+  K L +E+  F+DNPK+ +   ++R Q+LQDR    
Subjt:  FVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECT

Query:  ICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVA
               +E+SV+N E+ R+   K+Y +FQIP +WMLD+ L  Q+K SSLRLA+EYM+RI +EL+SN + +  NL LQGVRFAY +HQ+AGGFD E +  
Subjt:  ICCWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVA

Query:  FEGMKKVGLSSQR
        F  +KK+     R
Subjt:  FEGMKKVGLSSQR

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.5e-6446.98Show/hide
Query:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V  
Subjt:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG

Query:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS
            DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL +LE +V  F D+P       +K+   L           ++ VEQS
Subjt:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.5e-6446.98Show/hide
Query:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V  
Subjt:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG

Query:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS
            DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL +LE +V  F D+P       +K+   L           ++ VEQS
Subjt:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein1.5e-6446.98Show/hide
Query:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG
        P   PPPPPP P  L     G   V R PE++E Y+SL KR+++KE   +  + G   + A   NMIGEIENRS +L A+K++VET G+FV  L  +V  
Subjt:  PAPAPPPPPPLPMKL---LAGSKAVRRVPEVLELYRSLTKRDAQKE---NKAAHGGLPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIRQVEG

Query:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS
            DI ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL +LE +V  F D+P       +K+   L           ++ VEQS
Subjt:  VVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDLVEQS

Query:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        V  + RTR+    +Y  F IP  W+ D+ +  ++KLSS++LAK+YM+R+  EL     S++ P  E L LQGVRFA+RVHQ+AGGFD+E++ AFE ++
Subjt:  VSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQ----SNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-7248.67Show/hide
Query:  AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIR
        + + AP PPPPPP P  L   S  VRRVPEV+E Y SL +RD+    + + GG  A A         ++MIGEIENRS YL AIK++VET G+F+ +LI+
Subjt:  AAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVA-------FTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIR

Query:  QVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDL
        +V      DI +V  FVKWLD EL+ LVDERAVLKHF +WPE KADALREAAF Y DLK+L SE   FR++P++  +  +K+ QAL ++           
Subjt:  QVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTICCWIDL

Query:  VEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK
        +E  V ++ R RE    K+ +FQIP  WML++ + +Q+KL+S++LA +YM+R++ EL++ E   P+ E L +QGVRFA+RVHQ+AGGFD+E + AFE ++
Subjt:  VEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNE--TPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCCAAAAAGCCAAGCAAATTCTACTTGGCTGGCTTCTCCTTCACCCCATTTCTTCAAAAACCCACAAAATCAAAGGAGAATGCCAAAGGAAGAAGATGAAGAACT
GGCCATGGAGATCAACAACACCCTCAAAAAAGAACTCGAAATCGCTCTGCAGAAATCTAATTTTCTCGAGAAAGAAAATCAAGAACTCCGGCAAGAATTGGGTCGATTGA
AATCCCAGATTCACTCTCTCAAGGCTCACAACAATGACAGAAAATCCATTCTCTGGAAGAAATTTCATAATTCCATCGACGTCTCCGTCGCCGGAGCCGACTCGCCGCCG
CAGAGTCCGGCGGCGGCGGCGGCGTCGGACAGATGGGACCCGACCAGATCGCAGAAACAGAGCAGCTGGGGCTTTGTGAAGGAGAATCAGAGAATGGCGGCGGCACCGGC
TCCGGCACCGCCGCCACCGCCGCCGCTTCCGATGAAGCTGCTCGCCGGATCGAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTCACTGACGAAAAGGG
ATGCGCAGAAGGAAAACAAGGCCGCACACGGCGGACTTCCGGCGGTGGCGTTCACCAAAAATATGATCGGAGAAATTGAGAACCGATCGGCTTATCTGTCTGCAATAAAA
TCGGAGGTGGAGACACATGGAGAGTTTGTGAACTGGTTGATAAGACAAGTAGAGGGCGTAGTGCCGAGGGACATTACGGAGGTGGAGAGGTTTGTGAAGTGGCTGGACGG
GGAGCTGGCGTCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCGCAGTGGCCGGAGGGGAAGGCGGATGCGCTGAGGGAGGCGGCGTTCAGCTACAGGGACTTGA
AGAGGTTAGAAAGTGAAGTGTGTGGGTTCAGAGACAATCCAAAGGAGGAGATGAATGTGGTTATGAAGAGGGCTCAGGCGTTGCAAGACAGGCGAGAATGTACAATCTGT
TGTTGGATTGATTTGGTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAATTGTAAAAAGTACAACAATTTTCAAATCCCCTGCCAATGGATGTTGGACTC
TGCCCTGCCCACTCAGATGAAGTTGAGCTCATTGAGACTGGCGAAGGAATACATGCGAAGGATAACAAGAGAACTACAATCAAACGAAACCCCACAAGCAGAAAACCTTT
TTCTTCAAGGGGTTCGCTTTGCTTACAGGGTTCACCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCTTTTGAAGGAATGAAGAAAGTTGGGCTTAGTAGTCAG
AGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCCAAAAAGCCAAGCAAATTCTACTTGGCTGGCTTCTCCTTCACCCCATTTCTTCAAAAACCCACAAAATCAAAGGAGAATGCCAAAGGAAGAAGATGAAGAACT
GGCCATGGAGATCAACAACACCCTCAAAAAAGAACTCGAAATCGCTCTGCAGAAATCTAATTTTCTCGAGAAAGAAAATCAAGAACTCCGGCAAGAATTGGGTCGATTGA
AATCCCAGATTCACTCTCTCAAGGCTCACAACAATGACAGAAAATCCATTCTCTGGAAGAAATTTCATAATTCCATCGACGTCTCCGTCGCCGGAGCCGACTCGCCGCCG
CAGAGTCCGGCGGCGGCGGCGGCGTCGGACAGATGGGACCCGACCAGATCGCAGAAACAGAGCAGCTGGGGCTTTGTGAAGGAGAATCAGAGAATGGCGGCGGCACCGGC
TCCGGCACCGCCGCCACCGCCGCCGCTTCCGATGAAGCTGCTCGCCGGATCGAAGGCAGTGCGGCGAGTCCCGGAAGTGTTGGAGCTGTACCGTTCACTGACGAAAAGGG
ATGCGCAGAAGGAAAACAAGGCCGCACACGGCGGACTTCCGGCGGTGGCGTTCACCAAAAATATGATCGGAGAAATTGAGAACCGATCGGCTTATCTGTCTGCAATAAAA
TCGGAGGTGGAGACACATGGAGAGTTTGTGAACTGGTTGATAAGACAAGTAGAGGGCGTAGTGCCGAGGGACATTACGGAGGTGGAGAGGTTTGTGAAGTGGCTGGACGG
GGAGCTGGCGTCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCGCAGTGGCCGGAGGGGAAGGCGGATGCGCTGAGGGAGGCGGCGTTCAGCTACAGGGACTTGA
AGAGGTTAGAAAGTGAAGTGTGTGGGTTCAGAGACAATCCAAAGGAGGAGATGAATGTGGTTATGAAGAGGGCTCAGGCGTTGCAAGACAGGCGAGAATGTACAATCTGT
TGTTGGATTGATTTGGTGGAGCAGAGCGTGAGCAATGTGGAGAGGACGAGGGAGTTCAATTGTAAAAAGTACAACAATTTTCAAATCCCCTGCCAATGGATGTTGGACTC
TGCCCTGCCCACTCAGATGAAGTTGAGCTCATTGAGACTGGCGAAGGAATACATGCGAAGGATAACAAGAGAACTACAATCAAACGAAACCCCACAAGCAGAAAACCTTT
TTCTTCAAGGGGTTCGCTTTGCTTACAGGGTTCACCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCTTTTGAAGGAATGAAGAAAGTTGGGCTTAGTAGTCAG
AGAAAATAG
Protein sequenceShow/hide protein sequence
MNPKSQANSTWLASPSPHFFKNPQNQRRMPKEEDEELAMEINNTLKKELEIALQKSNFLEKENQELRQELGRLKSQIHSLKAHNNDRKSILWKKFHNSIDVSVAGADSPP
QSPAAAAASDRWDPTRSQKQSSWGFVKENQRMAAAPAPAPPPPPPLPMKLLAGSKAVRRVPEVLELYRSLTKRDAQKENKAAHGGLPAVAFTKNMIGEIENRSAYLSAIK
SEVETHGEFVNWLIRQVEGVVPRDITEVERFVKWLDGELASLVDERAVLKHFPQWPEGKADALREAAFSYRDLKRLESEVCGFRDNPKEEMNVVMKRAQALQDRRECTIC
CWIDLVEQSVSNVERTREFNCKKYNNFQIPCQWMLDSALPTQMKLSSLRLAKEYMRRITRELQSNETPQAENLFLQGVRFAYRVHQYAGGFDSEAIVAFEGMKKVGLSSQ
RK