; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015918 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015918
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CHUP1, chloroplastic
Genome locationtig00006297:680800..683309
RNA-Seq ExpressionSgr015918
SyntenySgr015918
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009707 - chloroplast outer membrane (cellular component)
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060029.1 protein CHUP1 [Cucumis melo var. makuwa]1.4e-12961.4Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP+E+D+++AMEIN L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKSILWKKFH+SMD++V  ADS P  P   +            
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
              DKRE T+ PKQ    ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKENK   GG P VAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I++VE+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPA
        EEM VV+KRAQALQDRRE    S  +++           +  + F+ P    +  A   ++   K+        +VE IEA +GIH +DNKRTT NRN  
Subjt:  EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPA

Query:  PRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS
         RKP   R S+C+QGS      +GGFDSEAI AFEGL+K GLS
Subjt:  PRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS

TYJ97286.1 protein CHUP1 [Cucumis melo var. makuwa]3.4e-12862.64Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP+E+D+++AMEI+ L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKSILWKKFH+SMD++V  ADS P  P   + AA         
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
              DKRE T+ PKQ    ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKENK   GG P VAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I+E E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP
        EEM VV+KRAQALQDR E       M R R  +  +  + F+ P   C  +  S    Q      +   ++VE IEA +GIH +DNKRTT NRN   RKP
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP

Query:  LSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS
           R S+C+QGS      +GGFDSEAI AFEGL+K GLS
Subjt:  LSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS

XP_022150972.1 protein CHUP1, chloroplastic [Momordica charantia]1.3e-12460.59Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MPQEED+++AMEI SLRK+L+IAV KS+FLEKENQELRQE+GRLKSQIQSLKAH+N+RKS+LWKKF+NSMD                             
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIG
        + P  TDKRE T+S  + P    W  VKE+QR       PAPAPPPPPLPTKLL GSKAVRRVPEVLELYRSLTKRDAQKENK   GG+PAVAF+KNMIG
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIG

Query:  EIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP
        EIENRSAYL+AIKSEVETHGEFVNWLIKEVE AAPR+ITEVERFV WLD EL +LVDERAVLKHFPRWPEGKADALREAAFSYRDLK LESEV SF+DNP
Subjt:  EIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP

Query:  KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPR
        KEEMGVV+KRAQALQDR E     +   R    +  R+   F+ P    W   S  V  M++ +L  +    +  +   ++ + + DN +   N      
Subjt:  KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPR

Query:  KPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGLS
                + +QG   +    QYAGGFDS+AI AFEGL+KVGLS
Subjt:  KPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGLS

XP_022998607.1 protein CHUP1, chloroplastic [Cucurbita maxima]3.6e-12259.64Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS +QSLK H+N+RKSILWKKFHNSMDV+V   DSSPQ PP               
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
             TDK E TR+ KQ    + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK   GG+PAVAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EVERFVKWLDGELA+LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP
        EE   ++KRAQALQDR E           +  S V  TR F           C  +  S    Q+K  +  L  + +  I           K    N  P
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP

Query:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL
                  ++ +QG   +    QYAGGFDSEAIVAFEG+++VGL
Subjt:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL

XP_023523072.1 protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]2.1e-12259.87Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS IQSLKAH+N+RKSILWKKFHNSMDV+V   DSSPQ PP               
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
             TDK E TR+ KQ    + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK   GG+PAVAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EVERFVKWLDGEL +LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP
        EE   ++KRAQALQDR E           +  S V  TR F           C  +  S    Q+K  +  L  + +  I           K    N  P
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP

Query:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL
                  ++ +QG   +    QYAGGFDSEAIVAFEG+++VGL
Subjt:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL

TrEMBL top hitse value%identityAlignment
A0A0A0LVK7 Uncharacterized protein4.8e-12060.18Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP+EED+ +AMEIN L+K+LEI++ KS FLEKENQELRQE+ RL+SQIQS KA +NERKSILWKKFH+S+D+SV  ADS P  P   +            
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQRK---PA--PAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
              DKRE T+SPKQ    ++W  VKE+ R    PA  P PPPPPLPTKLLGGSKAVRRVPEVLELYR+LTKRDAQKENK   GG PAVAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQRK---PA--PAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHG+FVNWLIKEVE  APR+I+EVERFVKWLDG+LA+LVDERAVLK+FPRWPE KADALREAAFSYRDLKGLES+V  F+DNPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP
        EEM VV+KRAQALQDR E       M R R  +  R  + F+ P   C  +  S    QIK  T  L  +   +I  ++ + + +  +            
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP

Query:  LSSRCSICVQGSPEPF---QYAGGFDSEAIVAFEGLRKVGLS
           R ++ +QG+   +   QYAGGFDSE I AFEGL+K GLS
Subjt:  LSSRCSICVQGSPEPF---QYAGGFDSEAIVAFEGLRKVGLS

A0A5A7V2M1 Protein CHUP16.7e-13061.4Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP+E+D+++AMEIN L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKSILWKKFH+SMD++V  ADS P  P   +            
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
              DKRE T+ PKQ    ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKENK   GG P VAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I++VE+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPA
        EEM VV+KRAQALQDRRE    S  +++           +  + F+ P    +  A   ++   K+        +VE IEA +GIH +DNKRTT NRN  
Subjt:  EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPA

Query:  PRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS
         RKP   R S+C+QGS      +GGFDSEAI AFEGL+K GLS
Subjt:  PRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS

A0A5D3BE56 Protein CHUP11.6e-12862.64Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP+E+D+++AMEI+ L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKSILWKKFH+SMD++V  ADS P  P   + AA         
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
              DKRE T+ PKQ    ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKENK   GG P VAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I+E E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP
        EEM VV+KRAQALQDR E       M R R  +  +  + F+ P   C  +  S    Q      +   ++VE IEA +GIH +DNKRTT NRN   RKP
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKP

Query:  LSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS
           R S+C+QGS      +GGFDSEAI AFEGL+K GLS
Subjt:  LSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLS

A0A6J1DC83 protein CHUP1, chloroplastic6.5e-12560.59Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MPQEED+++AMEI SLRK+L+IAV KS+FLEKENQELRQE+GRLKSQIQSLKAH+N+RKS+LWKKF+NSMD                             
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIG
        + P  TDKRE T+S  + P    W  VKE+QR       PAPAPPPPPLPTKLL GSKAVRRVPEVLELYRSLTKRDAQKENK   GG+PAVAF+KNMIG
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIG

Query:  EIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP
        EIENRSAYL+AIKSEVETHGEFVNWLIKEVE AAPR+ITEVERFV WLD EL +LVDERAVLKHFPRWPEGKADALREAAFSYRDLK LESEV SF+DNP
Subjt:  EIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP

Query:  KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPR
        KEEMGVV+KRAQALQDR E     +   R    +  R+   F+ P    W   S  V  M++ +L  +    +  +   ++ + + DN +   N      
Subjt:  KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPR

Query:  KPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGLS
                + +QG   +    QYAGGFDS+AI AFEGL+KVGLS
Subjt:  KPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGLS

A0A6J1K8G4 protein CHUP1, chloroplastic1.8e-12259.64Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS +QSLK H+N+RKSILWKKFHNSMDV+V   DSSPQ PP               
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE
             TDK E TR+ KQ    + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKENK   GG+PAVAF+KNMIGE
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGE

Query:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK
        IENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EVERFVKWLDGELA+LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Subjt:  IENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK

Query:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP
        EE   ++KRAQALQDR E           +  S V  TR F           C  +  S    Q+K  +  L  + +  I           K    N  P
Subjt:  EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNP

Query:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL
                  ++ +QG   +    QYAGGFDSEAIVAFEG+++VGL
Subjt:  APRKPLSSRCSICVQG---SPEPFQYAGGFDSEAIVAFEGLRKVGL

SwissProt top hitse value%identityAlignment
Q9LI74 Protein CHUP1, chloroplastic1.1e-3939.29Show/hide
Query:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR
        P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  
Subjt:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR

Query:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV
        +I ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L ++ E    AL   R R  ++ 
Subjt:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV

Query:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA
        R  + F  P +  W   + V        +Q+        A E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Subjt:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA

Query:  IVAFEGLR
        + AFE LR
Subjt:  IVAFEGLR

Arabidopsis top hitse value%identityAlignment
AT1G07120.1 FUNCTIONS IN: molecular_function unknown9.4e-6847.92Show/hide
Query:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG
        +P  EDD    ++  L K+L+  + +++ LEKEN ELRQEV RL++Q+ +LK+H NERKS+LWKK  +S D S T  D S  K PE              
Subjt:  MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTG

Query:  DFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGEIENRS
           ++  K +  R+P   P I       + Q      PPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK N  G  + AF++NMIGEIENRS
Subjt:  DFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGEIENRS

Query:  AYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGV
         YLS IKS+ + H + ++ LI +VEAA   +I+EVE FVKW+D EL++LVDERAVLKHFP+WPE K D+LREAA +Y+  K L +E+ SFKDNPK+ +  
Subjt:  AYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGV

Query:  VVKRAQALQDRRE
         ++R Q+LQDR E
Subjt:  VVKRAQALQDRRE

AT1G48280.1 hydroxyproline-rich glycoprotein family protein1.4e-4230.93Show/hide
Query:  DMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTD
        D+ +++ +L+ +LE A   +  LE  N++L Q++   +++I SL ++    K     +F    D+    A    Q   ++  A             E++ 
Subjt:  DMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTD

Query:  KREPTRSPKQLPPI---------TAWAVVKENQRK-----PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFS--KN
           P+ SP +LPP           A ++ K ++       P P PPPPP P + L  +   ++ P V +L++ L K+D  +    +  G  +   S   +
Subjt:  KREPTRSPKQLPPI---------TAWAVVKENQRK-----PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFS--KN

Query:  MIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFK
        ++GEI+NRSA+L AIK+++ET GEF+N LI++V      ++ +V +FV WLD ELA L DERAVLKHF +WPE KAD L+EAA  YR+LK LE E+ S+ 
Subjt:  MIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFK

Query:  DNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLAS----SVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFN
        D+P    GV +K+   L D+ E   R L   RG   S +RS + FK P    W L S     ++   IK L  +        +++++ +  E  K     
Subjt:  DNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLAS----SVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFN

Query:  RNPAPRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRK
        +               V+ +    Q+AGG D E + A E +++
Subjt:  RNPAPRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRK

AT3G25690.1 Hydroxyproline-rich glycoprotein family protein7.5e-4139.29Show/hide
Query:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR
        P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  
Subjt:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR

Query:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV
        +I ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L ++ E    AL   R R  ++ 
Subjt:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV

Query:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA
        R  + F  P +  W   + V        +Q+        A E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Subjt:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA

Query:  IVAFEGLR
        + AFE LR
Subjt:  IVAFEGLR

AT3G25690.2 Hydroxyproline-rich glycoprotein family protein7.5e-4139.29Show/hide
Query:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR
        P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   NMIGEIENRS +L A+K++VET G+FV  L  EV A++  
Subjt:  PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPR

Query:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV
        +I ++  FV WLD EL+ LVDERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L ++ E    AL   R R  ++ 
Subjt:  EITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV

Query:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA
        R  + F  P +  W   + V        +Q+        A E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Subjt:  RSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA

Query:  IVAFEGLR
        + AFE LR
Subjt:  IVAFEGLR

AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-5034.91Show/hide
Query:  EDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSI-----------------LWKKFHNSMDVSVTFADSS-------
        E DD A+ ++   + L     KSN +        + VG L++  + +    N  KSI                  + +  NS +++ + + S+       
Subjt:  EDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSI-----------------LWKKFHNSMDVSVTFADSS-------

Query:  -PQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGN
         P+ PP++S +  D    R    P+   K  P   P   PP+        +  K  P PPPPP P  L   S  VRRVPEV+E Y SL +RD+    + +
Subjt:  -PQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGN

Query:  AGGYPAVA-------FSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALRE
         GG  A A        +++MIGEIENRS YL AIK++VET G+F+ +LIKEV  AA  +I +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALRE
Subjt:  AGGYPAVA-------FSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALRE

Query:  AAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELI--
        AAF Y DLK L SE   F+++P++     +K+ QAL ++ E    +L   R   ++  +S   F+ P +  W L + +   QIK  +  L    ++ +  
Subjt:  AAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELI--

Query:  --EASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEAIVAFEGLR
          EA +G   E+ +                   + VQG    F   Q+AGGFD+E + AFE LR
Subjt:  --EASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEAIVAFEGLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCAGGAAGAAGATGATGATATGGCCATGGAGATCAACAGCTTGAGGAAACAACTGGAAATTGCTGTGGGGAAATCCAACTTTCTCGAGAAAGAGAATCAAGAACT
GAGACAAGAAGTTGGTCGTCTGAAATCTCAGATTCAGTCTCTGAAAGCCCACAGCAATGAGAGAAAATCCATTCTCTGGAAGAAATTCCACAACTCCATGGATGTCTCCG
TCACGTTCGCCGACTCGTCGCCGCAGAAGCCACCGGAGCAGAGTCCCGCGGCGAATGATAAACCGATAAGAAGAACCGGAGACTTCCCGGAAACAACCGATAAACGGGAG
CCAACCAGATCGCCGAAACAGCTTCCTCCGATAACTGCTTGGGCCGTCGTGAAAGAGAACCAGAGAAAGCCGGCTCCGGCTCCGCCCCCACCTCCGCTTCCGACGAAGCT
CCTCGGCGGATCAAAGGCAGTGCGTCGAGTCCCGGAAGTGCTGGAGCTGTACCGTTCACTGACGAAACGAGACGCCCAGAAGGAAAACAAGGGCAACGCGGGAGGATATC
CGGCGGTGGCATTCAGCAAAAACATGATCGGAGAGATCGAGAACCGGTCAGCGTATCTGTCAGCGATAAAATCGGAGGTGGAGACGCATGGGGAGTTCGTGAACTGGCTG
ATAAAGGAAGTGGAAGCGGCGGCGCCGAGGGAGATAACGGAGGTGGAGAGGTTCGTGAAGTGGCTGGACGGGGAGCTGGCGGCGCTGGTGGACGAGAGGGCGGTGCTGAA
GCACTTCCCGCGGTGGCCGGAGGGGAAGGCGGATGCGCTGCGGGAGGCGGCGTTCAGTTACAGGGATCTGAAGGGGCTGGAGAGTGAAGTGCGTTCGTTCAAAGACAATC
CGAAGGAGGAGATGGGTGTGGTTGTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAAAGTTGGAGCAGAGCGTTGGGAATGTGGAGAGGACGAGGGAGTTCAGTTGTA
AGAAGTACGAGAGTTTTCAAATCCCCTGCGAATGGATGTTGGACTCTGGCCTCCTCGGTCAGGCACATGCAAATAAAAAACCTAACTTGGTCATTGTTTGCAGATGAAGT
TGAGCTCATTGAGGCTAGCCAAGGAATACATGCGGAGGATAACAAGAGAACTACATTCAACCGAAACCCCGCACCCAGAAAACCTCTTTCTTCAAGGTGTTCGATTTGCG
TACAGGGTTCACCAGAACCTTTTCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCATTCGAAGGGCTGAGGAAAGTCGGGCTGAGCTGGGGAGGAGCTATTGTA
AGAATTAGCATTGCAGAGCACATTCAACAAAAAGATGTGATACAAATGCTTGATCAAAAAAATTGGGAATTGTATACCTTAATCAAATCTATCGCAACTTGTTCAAACCA
TCGCAAGGCAAAGGCAAATGTCATAACAGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCAGGAAGAAGATGATGATATGGCCATGGAGATCAACAGCTTGAGGAAACAACTGGAAATTGCTGTGGGGAAATCCAACTTTCTCGAGAAAGAGAATCAAGAACT
GAGACAAGAAGTTGGTCGTCTGAAATCTCAGATTCAGTCTCTGAAAGCCCACAGCAATGAGAGAAAATCCATTCTCTGGAAGAAATTCCACAACTCCATGGATGTCTCCG
TCACGTTCGCCGACTCGTCGCCGCAGAAGCCACCGGAGCAGAGTCCCGCGGCGAATGATAAACCGATAAGAAGAACCGGAGACTTCCCGGAAACAACCGATAAACGGGAG
CCAACCAGATCGCCGAAACAGCTTCCTCCGATAACTGCTTGGGCCGTCGTGAAAGAGAACCAGAGAAAGCCGGCTCCGGCTCCGCCCCCACCTCCGCTTCCGACGAAGCT
CCTCGGCGGATCAAAGGCAGTGCGTCGAGTCCCGGAAGTGCTGGAGCTGTACCGTTCACTGACGAAACGAGACGCCCAGAAGGAAAACAAGGGCAACGCGGGAGGATATC
CGGCGGTGGCATTCAGCAAAAACATGATCGGAGAGATCGAGAACCGGTCAGCGTATCTGTCAGCGATAAAATCGGAGGTGGAGACGCATGGGGAGTTCGTGAACTGGCTG
ATAAAGGAAGTGGAAGCGGCGGCGCCGAGGGAGATAACGGAGGTGGAGAGGTTCGTGAAGTGGCTGGACGGGGAGCTGGCGGCGCTGGTGGACGAGAGGGCGGTGCTGAA
GCACTTCCCGCGGTGGCCGGAGGGGAAGGCGGATGCGCTGCGGGAGGCGGCGTTCAGTTACAGGGATCTGAAGGGGCTGGAGAGTGAAGTGCGTTCGTTCAAAGACAATC
CGAAGGAGGAGATGGGTGTGGTTGTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAAAGTTGGAGCAGAGCGTTGGGAATGTGGAGAGGACGAGGGAGTTCAGTTGTA
AGAAGTACGAGAGTTTTCAAATCCCCTGCGAATGGATGTTGGACTCTGGCCTCCTCGGTCAGGCACATGCAAATAAAAAACCTAACTTGGTCATTGTTTGCAGATGAAGT
TGAGCTCATTGAGGCTAGCCAAGGAATACATGCGGAGGATAACAAGAGAACTACATTCAACCGAAACCCCGCACCCAGAAAACCTCTTTCTTCAAGGTGTTCGATTTGCG
TACAGGGTTCACCAGAACCTTTTCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCATTCGAAGGGCTGAGGAAAGTCGGGCTGAGCTGGGGAGGAGCTATTGTA
AGAATTAGCATTGCAGAGCACATTCAACAAAAAGATGTGATACAAATGCTTGATCAAAAAAATTGGGAATTGTATACCTTAATCAAATCTATCGCAACTTGTTCAAACCA
TCGCAAGGCAAAGGCAAATGTCATAACAGCATAA
Protein sequenceShow/hide protein sequence
MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKRE
PTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWL
IKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVV
RSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLSWGGAIV
RISIAEHIQQKDVIQMLDQKNWELYTLIKSIATCSNHRKAKANVITA