; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013571 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013571
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationscaffold402:2522172..2524576
RNA-Seq ExpressionMS013571
SyntenyMS013571
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052630.1 protein CHUP1 [Cucumis melo var. makuwa]5.2e-12656.61Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ ++EEV Q +E ERR S+E M  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPT
                         +T    D  K +   P VP KP+  P PP S S P PPP   +      AKVRR+PEVVEFYHSLMR DSR+D  S V D P+
Subjt:  ----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPT

Query:  GANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFE
         AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  E
Subjt:  GANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFE

Query:  ASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVR
        ASSF  D RQ     LKKMQA LEKLEHGVYNL RMRESA  RY+       WMLD+G +SQIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGVR
Subjt:  ASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVR

Query:  FGFRVHQ
        F FRVHQ
Subjt:  FGFRVHQ

KAG6581351.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.1e-12355.45Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVP+LEN I +K+ E+ERA   I  L+AE ER++ E+EEV Q  E +RR  +E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQEL-------------KAYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK   +              +  AKN  ++ E P +S CNSEE AES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQEL-------------KAYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN
        K               +TV   D  K +   P VP K +  PS       P PPP K +  + AKVRR+PEVVEFYHSLMR DSR+D  S VMD P+ A 
Subjt:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN

Query:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS
         R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EASS
Subjt:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS

Query:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG
        F  D RQ     LKKMQA LEKLEHG+YNL R+RESAT RY+       WMLDTG +SQIKL   KLA++YM+RVSAEL+TV   G  +E+LI+QGVRF 
Subjt:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG

Query:  FRVHQ
        FRVHQ
Subjt:  FRVHQ

KAG7034636.1 Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-12355.45Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVP+LEN I +K+ E+ERA   I  L+AE ER++ E+EEV Q  E +RR  +E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELK-------------AYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK   +                 AKN  ++ E P +S CNSEE AES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELK-------------AYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN
        K               +TV   D  K +   P VP K +  PS       P PPP K +  + AKVRR+PEVVEFYHSLMR DSR+D  S VMD P+ A 
Subjt:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN

Query:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS
         R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EASS
Subjt:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS

Query:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG
        F  D RQ     LKKMQA LEKLEHG+YNL R+RESAT RY+       WMLDTG +SQIKL   KLA++YM+RVSAEL+TV   G  +E+LI+QGVRF 
Subjt:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG

Query:  FRVHQ
        FRVHQ
Subjt:  FRVHQ

XP_008439756.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo]2.6e-12556.52Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ ++EEV Q +E ERR S+E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAES L N  S    +P  P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG
                        +T    D  K +   P VP KP+  P PP S S P PPP   +      AKVRR+PEVVEFYHSLMR DSR+D  S V D P+ 
Subjt:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG

Query:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA
        AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EA
Subjt:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA

Query:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF
        SSF  D RQ     LKKMQA LEKLEHGVYNL RMRESA  RY+       WMLD+G +SQIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGVRF
Subjt:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF

Query:  GFRVHQ
         FRVHQ
Subjt:  GFRVHQ

XP_038883847.1 protein CHUP1, chloroplastic [Benincasa hispida]6.4e-12456.13Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ E+EEV Q +E ERR S+E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAE  L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG
                        +T    D  K +   P VP KP+  P PP S S P PPP   +       KVRR+PEVVEFYHSLMR DSR+D  S+V D P+ 
Subjt:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG

Query:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA
        AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELS+LVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EA
Subjt:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA

Query:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF
        SSF  D RQ     LKKMQA LEKLEHGVYNL RMRESAT RY+       WMLD+G + QIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGVRF
Subjt:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF

Query:  GFRVHQ
         FRVHQ
Subjt:  GFRVHQ

TrEMBL top hitse value%identityAlignment
A0A0A0KHU8 Uncharacterized protein1.5e-12355.51Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ ++EE  Q +E ERR S+E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  -----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSV---PAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIP
                          +T    D  K +   P VP K +  P PP S S    P PPP  + +  AKVRR+PEVVEFYHSLMR DSR+D  S V + P
Subjt:  -----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSV---PAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIP

Query:  TGANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSF
        + AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELS+LVDERAVLKHF+W EQKAD +REAAFGY D+KKL  
Subjt:  TGANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSF

Query:  EASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGV
        EASSF  D RQ     LKKMQA LEKLEHGVYNL RMRESA  RY+       WMLD G +SQIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGV
Subjt:  EASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGV

Query:  RFGFRVHQ
        RF FRVHQ
Subjt:  RFGFRVHQ

A0A1S3AZH3 protein CHUP1, chloroplastic1.3e-12556.52Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ ++EEV Q +E ERR S+E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAES L N  S    +P  P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG
                        +T    D  K +   P VP KP+  P PP S S P PPP   +      AKVRR+PEVVEFYHSLMR DSR+D  S V D P+ 
Subjt:  ---------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG

Query:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA
        AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EA
Subjt:  ANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEA

Query:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF
        SSF  D RQ     LKKMQA LEKLEHGVYNL RMRESA  RY+       WMLD+G +SQIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGVRF
Subjt:  SSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRF

Query:  GFRVHQ
         FRVHQ
Subjt:  GFRVHQ

A0A5D3CMM2 Protein CHUP12.5e-12656.61Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVPVLEN I  K+ EIERA   I  L+AE ER++ ++EEV Q +E ERR S+E M  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK                 E     K     ERP +S CNSEELAES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPT
                         +T    D  K +   P VP KP+  P PP S S P PPP   +      AKVRR+PEVVEFYHSLMR DSR+D  S V D P+
Subjt:  ----------------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL---AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPT

Query:  GANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFE
         AN R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  E
Subjt:  GANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFE

Query:  ASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVR
        ASSF  D RQ     LKKMQA LEKLEHGVYNL RMRESA  RY+       WMLD+G +SQIKL S KLA++YM+RVSAEL+TV  G  +E+LI+QGVR
Subjt:  ASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVR

Query:  FGFRVHQ
        F FRVHQ
Subjt:  FGFRVHQ

A0A6J1ECF9 protein CHUP1, chloroplastic-like isoform X15.3e-12455.45Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLR+VEELRDRE+RLK +LLE+K LKES AIVP+LEN I +K+ E+ERA   I  L+AE ER++ E+EEV Q  E +RR  +E +  M+ E    K+M
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELK-------------AYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP
          DR   +   ++DE S S++F+ LMEVS  SNLI NLK   +                 AKN  ++ E P +S CNSEE AES L N  S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELK-------------AYAKNGAII-ERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN
        K               +TV   D  K +   P VP K +  PS       P PPP K +  + AKVRR+PEVVEFYHSLMR DSR+D  S VMD P+ A 
Subjt:  K---------------TTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDL-AKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGAN

Query:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS
         R+MIGE+EN SA+LLAIK DVET+G+FI  LIKEVENA  TDIE+VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EASS
Subjt:  ERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASS

Query:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG
        F  D RQ     LKKMQA LEKLEHG+YNL R+RESAT RY+       WMLDTG +SQIKL   KLA++YM+RVSAEL+TV   G  +E+LI+QGVRF 
Subjt:  FGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTV--YGCSKEQLIIQGVRFG

Query:  FRVHQ
        FRVHQ
Subjt:  FRVHQ

M5X6T3 Uncharacterized protein1.3e-12256.29Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        TELLRLVEELR+RESRLK ELLENK L+ES AIVPVLEN I  K  +IERA   ++ L+AE ER++ ++EEV   +E ERR S + +  M+ E    K+ 
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP
         SDR  A+   + DE S S++F+ LMEV+  SNLI NLK+G               E     +  A  ERP +S CNSEELAES L    S    +P+ P
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQ--------------ELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESP

Query:  ----------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSV---PAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGANERN
                  K T      F    P P    K V  P PP S +    P PPP       AKVRRVPEVVEFYHSLMR DSR+D  S   D P  AN R+
Subjt:  ----------KTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNSV---PAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGANERN

Query:  MIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGD
        MIGE+EN SAYLLAIK DVET+G+FI  LIKEVENA  TDI++VV F KWLDDELSYLVDERAVLKHF+W EQKAD +REAAFGY D+KKL  EASSF D
Subjt:  MIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGD

Query:  DGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRFGFRVH
        D R     TLKKMQA LEKLEHGVYNL R+RESAT RY+       WMLDT  +SQIKL S KLA++YM+RVSAEL+ V  G  +E+LI+QGVRF FRVH
Subjt:  DGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTV-YGCSKEQLIIQGVRFGFRVH

Query:  Q
        Q
Subjt:  Q

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic6.7e-6048.06Show/hide
Query:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET
        P  P  P   P PP     P PPP    +        KV R PE+VEFY SLM+ +S+K+ + +++   TG   A   NMIGE+EN S +LLA+K DVET
Subjt:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET

Query:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE
        +G+F+  L  EV  +  TDIE+++ F  WLD+ELS+LVDERAVLKHF+W E KAD +REAAF Y D+ KL  + +SF DD   +    LKKM   LEK+E
Subjt:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE

Query:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ
          VY L R R+ A  RY+       W+ DTG + +IKL S +LA +YM+RV+ ELD+V G  K    E L++QGVRF FRVHQ
Subjt:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ

Q9LI74 Protein CHUP1, chloroplastic1.4e-0134.48Show/hide
Query:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM
        L +LV+EL +RE +L+ ELLE   LKE  + +  L+  + +K  EI+   I I  L+AE +++++EL +   V +++E  R   +E+
Subjt:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown2.1e-4842.7Show/hide
Query:  PRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGANERNMIGEVENCSAYLLAIKMDVETKGEFIT
        P+P +  +  +   PP     P P PSK  +    VRR PEVVEFY +L + +S          + + A  RNMIGE+EN S YL  IK D +   + I 
Subjt:  PRPAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGANERNMIGEVENCSAYLLAIKMDVETKGEFIT

Query:  HLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHF-EWLEQKADEIREAAFGYDM-KKLSFEASSFGDDGRQARNITLKKMQAWLEKLEHGVYN
         LI +VE A  TDI  V TF KW+D+ELS LVDERAVLKHF +W E+K D +REAA  Y   K L  E  SF D+ + +    L+++Q+  ++LE  V N
Subjt:  HLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHF-EWLEQKADEIREAAFGYDM-KKLSFEASSFGDDGRQARNITLKKMQAWLEKLEHGVYN

Query:  LCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSKEQLIIQGVRFGFRVHQ
          +MR+S   RY+       WMLDTG I Q+K  S +LA EYM+R++ EL++     +  L++QGVRF + +HQ
Subjt:  LCRMRESATMRYR------YWMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSKEQLIIQGVRFGFRVHQ

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein4.8e-6148.06Show/hide
Query:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET
        P  P  P   P PP     P PPP    +        KV R PE+VEFY SLM+ +S+K+ + +++   TG   A   NMIGE+EN S +LLA+K DVET
Subjt:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET

Query:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE
        +G+F+  L  EV  +  TDIE+++ F  WLD+ELS+LVDERAVLKHF+W E KAD +REAAF Y D+ KL  + +SF DD   +    LKKM   LEK+E
Subjt:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE

Query:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ
          VY L R R+ A  RY+       W+ DTG + +IKL S +LA +YM+RV+ ELD+V G  K    E L++QGVRF FRVHQ
Subjt:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein1.0e-0234.48Show/hide
Query:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM
        L +LV+EL +RE +L+ ELLE   LKE  + +  L+  + +K  EI+   I I  L+AE +++++EL +   V +++E  R   +E+
Subjt:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein4.8e-6148.06Show/hide
Query:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET
        P  P  P   P PP     P PPP    +        KV R PE+VEFY SLM+ +S+K+ + +++   TG   A   NMIGE+EN S +LLA+K DVET
Subjt:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET

Query:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE
        +G+F+  L  EV  +  TDIE+++ F  WLD+ELS+LVDERAVLKHF+W E KAD +REAAF Y D+ KL  + +SF DD   +    LKKM   LEK+E
Subjt:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE

Query:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ
          VY L R R+ A  RY+       W+ DTG + +IKL S +LA +YM+RV+ ELD+V G  K    E L++QGVRF FRVHQ
Subjt:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein1.0e-0234.48Show/hide
Query:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM
        L +LV+EL +RE +L+ ELLE   LKE  + +  L+  + +K  EI+   I I  L+AE +++++EL +   V +++E  R   +E+
Subjt:  LLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEE---VHQKMEGERRNSREM

AT3G25690.3 Hydroxyproline-rich glycoprotein family protein4.8e-6148.06Show/hide
Query:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET
        P  P  P   P PP     P PPP    +        KV R PE+VEFY SLM+ +S+K+ + +++   TG   A   NMIGE+EN S +LLA+K DVET
Subjt:  PAVPVKPVSQPSPPLSNSVPAPPPSKEEVDLA-----KVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTG---ANERNMIGEVENCSAYLLAIKMDVET

Query:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE
        +G+F+  L  EV  +  TDIE+++ F  WLD+ELS+LVDERAVLKHF+W E KAD +REAAF Y D+ KL  + +SF DD   +    LKKM   LEK+E
Subjt:  KGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIREAAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLE

Query:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ
          VY L R R+ A  RY+       W+ DTG + +IKL S +LA +YM+RV+ ELD+V G  K    E L++QGVRF FRVHQ
Subjt:  HGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVYGCSK----EQLIIQGVRFGFRVHQ

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-9546.93Show/hide
Query:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM
        +EL R VEELR+RE+ LK E LE K L+ES +++P+LE+ I  K  EI+       RL  + ER+++E +    + E  RR        M+ E V  +++
Subjt:  TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRM

Query:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELKAYAKNGAIIERPNYS----------------------ECNSEELAESALFNPISD
         S        SDD   S S++F+ LM+VS  SNLI +LK    L+   +     E  N S                        NSEEL ES+  + +  
Subjt:  DSDRLNAKPASDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELKAYAKNGAIIERPNYS----------------------ECNSEELAESALFNPISD

Query:  RF-TIPE-SPKTTVPIVDYF---------KTVAPRPAVPVKP-VSQPSPPLSNS------VPAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGD---SRKD
        R   +P+  PK ++ + D           K++ P P  P  P + QP PP S S       P PPP    +  AKVRRVPEVVEFYHSLMR D   SR+D
Subjt:  RF-TIPE-SPKTTVPIVDYF---------KTVAPRPAVPVKP-VSQPSPPLSNS------VPAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGD---SRKD

Query:  C----SSAVMDIPTGANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIRE
             ++A   I   +N R+MIGE+EN S YLLAIK DVET+G+FI  LIKEV NA  +DIE+VV F KWLDDELSYLVDERAVLKHFEW EQKAD +RE
Subjt:  C----SSAVMDIPTGANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSYLVDERAVLKHFEWLEQKADEIRE

Query:  AAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVY
        AAF Y D+KKL  EAS F +D RQ+ +  LKKMQA  EKLEHGVY+L RMRESA  +++       WML+TG  SQIKL S KLA++YM+RVSAEL+ + 
Subjt:  AAFGY-DMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYRY------WMLDTGTISQIKLQSFKLAVEYMERVSAELDTVY

Query:  --GCSKEQLIIQGVRFGFRVHQ
          G  +E+LI+QGVRF FRVHQ
Subjt:  --GCSKEQLIIQGVRFGFRVHQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACAGAGCTTCTACGCCTGGTTGAGGAGCTTCGCGACCGGGAGTCACGGTTGAAGGCTGAGCTTCTCGAGAACAAACACCTGAAGGAATCGTTTGCCATTGTTCCGGTGCT
TGAGAACGGGATTTATCTCAAGGAAACGGAAATTGAAAGAGCTTTGATTCTAATCAAACGCTTGAAGGCCGAAACTGAGAGGATAAAGAAAGAGTTGGAGGAAGTTCATC
AGAAGATGGAAGGAGAGAGGAGAAATAGTCGGGAAATGATGACAGTGATGGACGATGAGACGGTGCGATCGAAGAGGATGGATTCGGATCGTCTCAATGCGAAGCCGGCG
TCGGATGACGATGAGTCTTCTGGTTCACGAAAGTTTCGGAGACTGATGGAAGTGTCTGTTAATTCGAATCTGATTAGTAATTTGAAGGAAGGTCAAGAACTTAAAGCATA
TGCGAAGAACGGAGCAATTATTGAAAGACCGAATTACTCAGAGTGTAACTCGGAAGAACTCGCCGAGTCCGCACTCTTCAATCCAATATCTGACCGTTTTACGATTCCAG
AATCCCCGAAGACAACAGTTCCTATCGTCGATTACTTCAAAACTGTTGCACCACGGCCAGCAGTTCCGGTGAAGCCAGTATCGCAGCCTTCTCCTCCACTATCTAATTCA
GTTCCGGCACCGCCACCATCCAAAGAAGAGGTTGATCTGGCCAAAGTTAGGCGAGTGCCGGAGGTGGTGGAATTCTACCACTCTCTGATGCGGGGAGACTCCCGAAAAGA
CTGCAGCTCCGCCGTGATGGACATACCGACAGGCGCCAATGAACGTAACATGATCGGGGAGGTTGAAAACTGTTCTGCATATCTACTCGCCATAAAAATGGACGTAGAAA
CCAAAGGAGAGTTCATAACGCATTTGATCAAAGAAGTTGAGAATGCAGATCTTACGGATATTGAGAATGTTGTGACTTTTGCTAAATGGCTGGATGATGAGCTATCTTAC
CTGGTTGATGAACGAGCAGTGCTAAAACATTTTGAGTGGCTGGAGCAAAAGGCTGACGAGATACGTGAGGCTGCATTTGGTTATGATATGAAGAAGCTTTCATTCGAGGC
ATCATCTTTCGGTGACGATGGTCGACAGGCACGCAATATCACTCTCAAGAAGATGCAAGCTTGGCTTGAAAAGTTAGAGCATGGGGTTTACAATCTTTGTCGGATGAGAG
AATCTGCTACTATGAGATACAGGTATTGGATGCTTGATACCGGGACCATTAGCCAAATTAAGCTCCAATCCTTCAAATTGGCAGTGGAGTATATGGAACGAGTATCTGCA
GAACTTGATACGGTGTATGGCTGCTCCAAAGAACAGTTGATCATCCAAGGGGTAAGGTTCGGATTTCGAGTACATCAG
mRNA sequenceShow/hide mRNA sequence
ACAGAGCTTCTACGCCTGGTTGAGGAGCTTCGCGACCGGGAGTCACGGTTGAAGGCTGAGCTTCTCGAGAACAAACACCTGAAGGAATCGTTTGCCATTGTTCCGGTGCT
TGAGAACGGGATTTATCTCAAGGAAACGGAAATTGAAAGAGCTTTGATTCTAATCAAACGCTTGAAGGCCGAAACTGAGAGGATAAAGAAAGAGTTGGAGGAAGTTCATC
AGAAGATGGAAGGAGAGAGGAGAAATAGTCGGGAAATGATGACAGTGATGGACGATGAGACGGTGCGATCGAAGAGGATGGATTCGGATCGTCTCAATGCGAAGCCGGCG
TCGGATGACGATGAGTCTTCTGGTTCACGAAAGTTTCGGAGACTGATGGAAGTGTCTGTTAATTCGAATCTGATTAGTAATTTGAAGGAAGGTCAAGAACTTAAAGCATA
TGCGAAGAACGGAGCAATTATTGAAAGACCGAATTACTCAGAGTGTAACTCGGAAGAACTCGCCGAGTCCGCACTCTTCAATCCAATATCTGACCGTTTTACGATTCCAG
AATCCCCGAAGACAACAGTTCCTATCGTCGATTACTTCAAAACTGTTGCACCACGGCCAGCAGTTCCGGTGAAGCCAGTATCGCAGCCTTCTCCTCCACTATCTAATTCA
GTTCCGGCACCGCCACCATCCAAAGAAGAGGTTGATCTGGCCAAAGTTAGGCGAGTGCCGGAGGTGGTGGAATTCTACCACTCTCTGATGCGGGGAGACTCCCGAAAAGA
CTGCAGCTCCGCCGTGATGGACATACCGACAGGCGCCAATGAACGTAACATGATCGGGGAGGTTGAAAACTGTTCTGCATATCTACTCGCCATAAAAATGGACGTAGAAA
CCAAAGGAGAGTTCATAACGCATTTGATCAAAGAAGTTGAGAATGCAGATCTTACGGATATTGAGAATGTTGTGACTTTTGCTAAATGGCTGGATGATGAGCTATCTTAC
CTGGTTGATGAACGAGCAGTGCTAAAACATTTTGAGTGGCTGGAGCAAAAGGCTGACGAGATACGTGAGGCTGCATTTGGTTATGATATGAAGAAGCTTTCATTCGAGGC
ATCATCTTTCGGTGACGATGGTCGACAGGCACGCAATATCACTCTCAAGAAGATGCAAGCTTGGCTTGAAAAGTTAGAGCATGGGGTTTACAATCTTTGTCGGATGAGAG
AATCTGCTACTATGAGATACAGGTATTGGATGCTTGATACCGGGACCATTAGCCAAATTAAGCTCCAATCCTTCAAATTGGCAGTGGAGTATATGGAACGAGTATCTGCA
GAACTTGATACGGTGTATGGCTGCTCCAAAGAACAGTTGATCATCCAAGGGGTAAGGTTCGGATTTCGAGTACATCAG
Protein sequenceShow/hide protein sequence
TELLRLVEELRDRESRLKAELLENKHLKESFAIVPVLENGIYLKETEIERALILIKRLKAETERIKKELEEVHQKMEGERRNSREMMTVMDDETVRSKRMDSDRLNAKPA
SDDDESSGSRKFRRLMEVSVNSNLISNLKEGQELKAYAKNGAIIERPNYSECNSEELAESALFNPISDRFTIPESPKTTVPIVDYFKTVAPRPAVPVKPVSQPSPPLSNS
VPAPPPSKEEVDLAKVRRVPEVVEFYHSLMRGDSRKDCSSAVMDIPTGANERNMIGEVENCSAYLLAIKMDVETKGEFITHLIKEVENADLTDIENVVTFAKWLDDELSY
LVDERAVLKHFEWLEQKADEIREAAFGYDMKKLSFEASSFGDDGRQARNITLKKMQAWLEKLEHGVYNLCRMRESATMRYRYWMLDTGTISQIKLQSFKLAVEYMERVSA
ELDTVYGCSKEQLIIQGVRFGFRVHQ