; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g33080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g33080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionChlorophyll a-b binding protein, chloroplastic
Genome locationchr5:24667152..24675424
RNA-Seq ExpressionMoc05g33080
SyntenyMoc05g33080
Gene Ontology termsGO:0006465 - signal peptide processing (biological process)
GO:0006511 - ubiquitin-dependent protein catabolic process (biological process)
GO:0009651 - response to salt stress (biological process)
GO:0009789 - positive regulation of abscisic acid-activated signaling pathway (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0005829 - cytosol (cellular component)
GO:0009536 - plastid (cellular component)
GO:0034357 - photosynthetic membrane (cellular component)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR036286 - LexA/Signal peptidase-like superfamily
IPR023329 - Chlorophyll a/b binding domain superfamily
IPR022796 - Chlorophyll A-B binding protein
IPR019533 - Peptidase S26
IPR017907 - Zinc finger, RING-type, conserved site
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR001841 - Zinc finger, RING-type
IPR000223 - Peptidase S26A, signal peptidase I


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4401483.1 hypothetical protein G4B88_001677 [Cannabis sativa]3.6e-13957.85Show/hide
Query:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR
        S+  GT   + H   SG+VQARF F  K +P K++  PS+DR LW+PGA APE+L G+LVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKN AG +I TR
Subjt:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR

Query:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK----------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA
         E ADVKST         G +        + G   +      L+VEWLTGVTWQDAGK          RNA+LDPEKRL P GK+FDPL LA DPEK A 
Subjt:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK----------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA

Query:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKKCFTFGIIGVT
        LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+                                   + +   M  ++++W  +K+ FTFG+IG+T
Subjt:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKKCFTFGIIGVT

Query:  VSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVPEGHCWVEGDN
        +SDRYAS+  +RG+SMSPTFNP     +G +  DYVLVEKFCLEKYKFS GDV+V+ SP N+KE+H+KRII LPGDWVG R + DV+K+P+GHCWVEGDN
Subjt:  VSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVPEGHCWVEGDN

Query:  AECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS
           SMDS SFGP+P+GL+QGR SHIVWPPQR+GAVERK P+GR+ S
Subjt:  AECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS

KZV31410.1 hypothetical protein F511_05514 [Dorcoceras hygrometricum]1.3e-14955.97Show/hide
Query:  MRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKIT
        MRLSYSP A  FLF VQWTDCHLAGALGL+RILIYKA+EDGKTTMSI ERKASL+EFYGVIFPSLLQLQ GI D+E+RKQRE+   +YKR+D++ +GK+ 
Subjt:  MRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKIT

Query:  EIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKM-MASGYGLSHQ
        EI++EREEECGICME+N  VVLPSC+HS+C+KCY +WR+RSQSCPFCRDSLKRVNSG+LWIYT  S+IV+LS I+RENLKRLFM++DK+ +     +   
Subjt:  EIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKM-MASGYGLSHQ

Query:  VLNPQLLPDMEFLEQPRVGA-------------MANALKGLYYF----SNRTGTSSIEYH------------------------GRRSG--QVQARFAFN
        +L  ++ PD + + +PR  +               ++ K L Y     ++ T  S +  H                          RSG  +VQARF F 
Subjt:  VLNPQLLPDMEFLEQPRVGA-------------MANALKGLYYF----SNRTGTSSIEYH------------------------GRRSG--QVQARFAFN

Query:  KKNTPQKR-ASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSVRHLGCRGSVS
        KK  P K+ A + S+DR LWYPGA AP++L GSL+GDYGFDPFGLGKPAEYLQF+LDSLDQNLAKNVAGD+I TR E +DVKST         G +    
Subjt:  KKNTPQKR-ASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSVRHLGCRGSVS

Query:  ASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA
            + G   +      ++VEWLTGVTWQDAGK                                   RNA+LDPEKRL P G YFDPL LA+DPEK AA
Subjt:  ASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA

Query:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS
        LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+
Subjt:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS

PPE02406.1 hypothetical protein GOBAR_DD00594 [Gossypium barbadense]5.1e-13355.77Show/hide
Query:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV
        SG+  ARF F  K    K+ S+ + DR LWYPGA+AP+WL GSLVGDYGFDPFGLGKPAEYLQ+DLDSLDQNLAKN AG++I TR E ADVK+T      
Subjt:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV

Query:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS
           G +        + G   +      L+VEWLTGVTWQDAGK                                   RNA+LDPEKRL P GKYFDPL 
Subjt:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS

Query:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFD-----IYALYFYRVPNRQKTEMANRSLVW
        LA DPEK A LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+  ++   +   +  + L   R E +     I A     V N    +M   S++W
Subjt:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFD-----IYALYFYRVPNRQKTEMANRSLVW

Query:  GVAKKCFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYD
         VAKKCFT G+I +TVSD +ASIVP+RGASMSPTFNP+  S   +++ D VLVEK CL KYKFS GDVIV+CSP N+KEKH+KRI+ LPGDWVGT   YD
Subjt:  GVAKKCFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYD

Query:  VVKVPEGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI
        VV +PEGHCWVEGDN+  S+DSRSFGP+P+GL++GRV+HI+WPP R+ ++ERK  + R+
Subjt:  VVKVPEGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI

PPR84801.1 hypothetical protein GOBAR_AA35911 [Gossypium barbadense]2.4e-12754.19Show/hide
Query:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV
        SG+  ARF F  K    K+AS+ +SDR LWYPGA+AP+WL GSLVGDYGFDPFGLGKPAEYLQ+DLDSLDQNLAKN+AG++I TR E ADVK+T      
Subjt:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV

Query:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS
           G +        + G   +      L+VEWLTGVTWQDAGK                                   RNA+LDPEKRL P GKYFDPL 
Subjt:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS

Query:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK
        LA DPEK A LQLAEIKHARLAMVAFL F VQAAATGK        G  L ++  +                              +M   S++W VAKK
Subjt:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK

Query:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP
        CFT G+I +TVSD ++SIVP+RGASMSPTFNP+  S   +++ D VLVEK CL  YKFS GDVIV+CSP N+KEKH+KRI+ LPGDWVGT   YDVV +P
Subjt:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP

Query:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI
        EGHCWVEGDN   S+DSRSFGP+P+GL++GRV+HI+WPP R+ +VERK  + R+
Subjt:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI

XP_022145744.1 uncharacterized protein LOC111015127 isoform X1 [Momordica charantia]5.1e-12599.56Show/hide
Query:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
        MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
Subjt:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS

Query:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
        LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
Subjt:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG

Query:  SSEIVDLSFISRENLKRLFMFIDKM
        SSEIVDLSFISRENLKRLFMFIDK+
Subjt:  SSEIVDLSFISRENLKRLFMFIDKM

TrEMBL top hitse value%identityAlignment
A0A2P5W153 Chlorophyll a-b binding protein, chloroplastic1.2e-12754.19Show/hide
Query:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV
        SG+  ARF F  K    K+AS+ +SDR LWYPGA+AP+WL GSLVGDYGFDPFGLGKPAEYLQ+DLDSLDQNLAKN+AG++I TR E ADVK+T      
Subjt:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV

Query:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS
           G +        + G   +      L+VEWLTGVTWQDAGK                                   RNA+LDPEKRL P GKYFDPL 
Subjt:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS

Query:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK
        LA DPEK A LQLAEIKHARLAMVAFL F VQAAATGK        G  L ++  +                              +M   S++W VAKK
Subjt:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK

Query:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP
        CFT G+I +TVSD ++SIVP+RGASMSPTFNP+  S   +++ D VLVEK CL  YKFS GDVIV+CSP N+KEKH+KRI+ LPGDWVGT   YDVV +P
Subjt:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP

Query:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI
        EGHCWVEGDN   S+DSRSFGP+P+GL++GRV+HI+WPP R+ +VERK  + R+
Subjt:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRI

A0A2Z7BAY2 Chlorophyll a-b binding protein, chloroplastic6.4e-15055.97Show/hide
Query:  MRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKIT
        MRLSYSP A  FLF VQWTDCHLAGALGL+RILIYKA+EDGKTTMSI ERKASL+EFYGVIFPSLLQLQ GI D+E+RKQRE+   +YKR+D++ +GK+ 
Subjt:  MRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKIT

Query:  EIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKM-MASGYGLSHQ
        EI++EREEECGICME+N  VVLPSC+HS+C+KCY +WR+RSQSCPFCRDSLKRVNSG+LWIYT  S+IV+LS I+RENLKRLFM++DK+ +     +   
Subjt:  EIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKM-MASGYGLSHQ

Query:  VLNPQLLPDMEFLEQPRVGA-------------MANALKGLYYF----SNRTGTSSIEYH------------------------GRRSG--QVQARFAFN
        +L  ++ PD + + +PR  +               ++ K L Y     ++ T  S +  H                          RSG  +VQARF F 
Subjt:  VLNPQLLPDMEFLEQPRVGA-------------MANALKGLYYF----SNRTGTSSIEYH------------------------GRRSG--QVQARFAFN

Query:  KKNTPQKR-ASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSVRHLGCRGSVS
        KK  P K+ A + S+DR LWYPGA AP++L GSL+GDYGFDPFGLGKPAEYLQF+LDSLDQNLAKNVAGD+I TR E +DVKST         G +    
Subjt:  KKNTPQKR-ASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSVRHLGCRGSVS

Query:  ASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA
            + G   +      ++VEWLTGVTWQDAGK                                   RNA+LDPEKRL P G YFDPL LA+DPEK AA
Subjt:  ASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA

Query:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS
        LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+
Subjt:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS

A0A5N5H761 Chlorophyll a-b binding protein, chloroplastic8.2e-12153.29Show/hide
Query:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV
        SG+VQARF F  K  P+K     S+DR LWYPGA APEWL GSLVGDYGFDPFGLGKPAEYLQ+D D LDQNLAKN+AGDVI TR E ADV+ST      
Subjt:  SGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSV

Query:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS
           G +        + G   +      L+VE +TG+TWQDAGK                                   RNA+LDPEKRL P GK+FDPL 
Subjt:  RHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKYFDPLS

Query:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK
        LA DPEK A LQLAEIKHARLAMVAFL F VQAA TGKGPL+NWA+  SL                                                  
Subjt:  LATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKK

Query:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP
                  VSDR+AS+ P+RG+SMSPT NP  TS  G    DYVLVEK CL+ YKFS GDV+V+ SPSN+KE H+KRI ALPG+W+GTR++YDVVK+P
Subjt:  CFTFGIIGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVP

Query:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS
        EGHCWVEGDN+  S+DS+SFGPIP+GL+QGRV+HIVWPPQR+GAVER  PQ  I S
Subjt:  EGHCWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS

A0A6J1CVC0 uncharacterized protein LOC111015127 isoform X12.5e-12599.56Show/hide
Query:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
        MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
Subjt:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS

Query:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
        LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
Subjt:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG

Query:  SSEIVDLSFISRENLKRLFMFIDKM
        SSEIVDLSFISRENLKRLFMFIDK+
Subjt:  SSEIVDLSFISRENLKRLFMFIDKM

A0A7J6I1T4 Chlorophyll a-b binding protein, chloroplastic1.7e-13957.85Show/hide
Query:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR
        S+  GT   + H   SG+VQARF F  K +P K++  PS+DR LW+PGA APE+L G+LVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKN AG +I TR
Subjt:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR

Query:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK----------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA
         E ADVKST         G +        + G   +      L+VEWLTGVTWQDAGK          RNA+LDPEKRL P GK+FDPL LA DPEK A 
Subjt:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK----------RNAKLDPEKRLCPRGKYFDPLSLATDPEKAAA

Query:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKKCFTFGIIGVT
        LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+                                   + +   M  ++++W  +K+ FTFG+IG+T
Subjt:  LQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKKCFTFGIIGVT

Query:  VSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVPEGHCWVEGDN
        +SDRYAS+  +RG+SMSPTFNP     +G +  DYVLVEKFCLEKYKFS GDV+V+ SP N+KE+H+KRII LPGDWVG R + DV+K+P+GHCWVEGDN
Subjt:  VSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVPEGHCWVEGDN

Query:  AECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS
           SMDS SFGP+P+GL+QGR SHIVWPPQR+GAVERK P+GR+ S
Subjt:  AECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS

SwissProt top hitse value%identityAlignment
Q07473 Chlorophyll a-b binding protein CP29.1, chloroplastic5.2e-6455.78Show/hide
Query:  SGQVQARFAFNKKNTPQKRASEP--SSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKST----
        SG+  A F F KK    K++++   ++DR LWYPGAI+P+WL GSLVGDYGFDPFGLGKPAEYLQFD+DSLDQNLAKN+AGDVI TR E AD KST    
Subjt:  SGQVQARFAFNKKNTPQKRASEP--SSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKST----

Query:  -RSSYSVRHLGCRGSVSASSSMEGGLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKY
            + ++       +    +M   L      LSVEWLTGVTWQDAGK                                   RNA+LD EKRL P GK+
Subjt:  -RSSYSVRHLGCRGSVSASSSMEGGLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCPRGKY

Query:  FDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS
        FDPL LA DPEK A LQLAEIKHARLAMVAFL F VQAAATGKGPL+NWA+
Subjt:  FDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS

Q6AZD4 Mitochondrial inner membrane protease subunit 21.3e-2740Show/hide
Query:  IGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWV---GTRQTYDVVKVPEGH
        + VTV DR A +  + GASM P+ NP      G  + D VL+ ++ +  Y    GD++   SP N ++K +KR+I + GD++   G +  Y  V+VP+GH
Subjt:  IGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWV---GTRQTYDVVKVPEGH

Query:  CWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGR
         W+EGD+   S DS +FGP+ +GL+ GR SHI+WPP R   +E   P  R
Subjt:  CWVEGDNAECSMDSRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGR

Q9M022 E3 ubiquitin-protein ligase AIRP25.5e-9874.67Show/hide
Query:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
        MRKSFKDSLKALEADIQFANTLAS+YP EYDG  +QMRLSYSPAA  FLF +QWTDCH AGALGLLRILIYKAY DGKTTMS+ ERK S++EFY V+FPS
Subjt:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS

Query:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
        LLQL GGI D+E+RKQ+E+   +Y++KD+ D+GK++EIDLEREEECGIC+E+   VVLP+CNHSMC+ CYR+WR RSQSCPFCR SLKRVNSGDLWIYT 
Subjt:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG

Query:  SSEIVDLSFISRENLKRLFMFIDKM
        S+EI DL  I +ENLKRL ++IDK+
Subjt:  SSEIVDLSFISRENLKRLFMFIDKM

Q9S7W1 Chlorophyll a-b binding protein CP29.3, chloroplastic6.2e-4946.8Show/hide
Query:  SGQVQARFAFN---KKNTP---QKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKST
        +G+VQARF F+   KK  P   + R  +   DR +W+PGA  PEWL GS++GD GFDPFGLGKPAEYLQ+D D LDQNLAKNVAGD+I    E++++K T
Subjt:  SGQVQARFAFN---KKNTP---QKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKST

Query:  -----RSSYSVRHLGCRGSVSASSSMEGGLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCP
                + ++       +    +M G L      ++VE LTG+ WQDAGK                                   RN++LDPEKR+ P
Subjt:  -----RSSYSVRHLGCRGSVSASSSMEGGLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDPEKRLCP

Query:  RGKYFDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPL
         G YFDPL LA DPEK   L+LAEIKH+RLAMVAFL F +QAA TGKGP+
Subjt:  RGKYFDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPL

Q9XF88 Chlorophyll a-b binding protein CP29.2, chloroplastic1.0e-5952.11Show/hide
Query:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR
        S+  GT  +      S +  ARF F  K    K+A    SDR LW+PGA +PE+L GSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKN+ G+VI TR
Subjt:  SNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYGFDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTR

Query:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDP
         E  D KST         G +        + G   +      ++VEWLTGVTWQDAGK                                   RNA+LD 
Subjt:  FENADVKSTRSSYSVRHLGCRGSVSASSSMEG--GLCWPRSVLSVEWLTGVTWQDAGK-----------------------------------RNAKLDP

Query:  EKRLCPRGKYFDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS
        EKRL P GK+FDPL LA+DP K A LQLAEIKHARLAMV FL F VQAAATGKGPL+NWA+
Subjt:  EKRLCPRGKYFDPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWAS

Arabidopsis top hitse value%identityAlignment
AT3G47160.1 RING/U-box superfamily protein1.0e-9164.52Show/hide
Query:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ
        SFKDSLKALEADIQ ANT+A DYPRE DGA +QMRLSY+PAAQF LF VQWTDCHLAG LGLLR+LIY  Y DGKTTMS+ ERK S+K+FY VIFPSLLQ
Subjt:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ

Query:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE
        L+ GI D++DRKQ+EV   +Y+ KD+ ++ K++EID+EREEECGICME+N MVVLP+C HS+C+KCYR W  RS+SCPFCRDSLKRVNSGDLW+    S+
Subjt:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE

Query:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP
         V++  I REN KRLF++I+K+             P ++PD  F   P
Subjt:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP

AT3G47160.2 RING/U-box superfamily protein3.7e-8961.54Show/hide
Query:  SFKDSLKALEADIQFANTLAS------------DYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLK
        SFKDSLKALEADIQ ANT+ S            DYPRE DGA +QMRLSY+PAAQF LF VQWTDCHLAG LGLLR+LIY  Y DGKTTMS+ ERK S+K
Subjt:  SFKDSLKALEADIQFANTLAS------------DYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLK

Query:  EFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVN
        +FY VIFPSLLQL+ GI D++DRKQ+EV   +Y+ KD+ ++ K++EID+EREEECGICME+N MVVLP+C HS+C+KCYR W  RS+SCPFCRDSLKRVN
Subjt:  EFYGVIFPSLLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVN

Query:  SGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP
        SGDLW+    S+ V++  I REN KRLF++I+K+             P ++PD  F   P
Subjt:  SGDLWIYTGSSEIVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP

AT5G01520.1 RING/U-box superfamily protein3.9e-9974.67Show/hide
Query:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS
        MRKSFKDSLKALEADIQFANTLAS+YP EYDG  +QMRLSYSPAA  FLF +QWTDCH AGALGLLRILIYKAY DGKTTMS+ ERK S++EFY V+FPS
Subjt:  MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPS

Query:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG
        LLQL GGI D+E+RKQ+E+   +Y++KD+ D+GK++EIDLEREEECGIC+E+   VVLP+CNHSMC+ CYR+WR RSQSCPFCR SLKRVNSGDLWIYT 
Subjt:  LLQLQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTG

Query:  SSEIVDLSFISRENLKRLFMFIDKM
        S+EI DL  I +ENLKRL ++IDK+
Subjt:  SSEIVDLSFISRENLKRLFMFIDKM

AT5G58787.1 RING/U-box superfamily protein6.9e-8863.31Show/hide
Query:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ
        SFKDSLKALEADIQ ANTLA DYPRE DGA +QMRLSYSP AQFFLF VQWTDC LAG LGLLR+LIY  Y DGKTTMS+ ERKAS++EF  VI PSL Q
Subjt:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ

Query:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE
        LQ G+ DI+D KQ+EV   +Y++KD+    +++EI++EREEECGICME+N  VVLP+C HS+C+KCYR WR RSQSCPFCRDSLKRV+SGDLW++   ++
Subjt:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE

Query:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP
         V+L+ I+REN KRLFM+I+K+             P ++PD  +   P
Subjt:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP

AT5G58787.2 RING/U-box superfamily protein6.9e-8059.68Show/hide
Query:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ
        SFKDSLKALEADIQ ANTLA DYPRE DGA +QMRLSYSP AQFFLF VQWTDC LAG LGLLR+LIY  Y DGKTTMS+ ERKAS++EF          
Subjt:  SFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQ

Query:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE
              DI+D KQ+EV   +Y++KD+    +++EI++EREEECGICME+N  VVLP+C HS+C+KCYR WR RSQSCPFCRDSLKRV+SGDLW++   ++
Subjt:  LQGGINDIEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSE

Query:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP
         V+L+ I+REN KRLFM+I+K+             P ++PD  +   P
Subjt:  IVDLSFISRENLKRLFMFIDKMMASGYGLSHQVLNPQLLPDMEFLEQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAAATCTTTCAAGGATTCCCTTAAGGCACTGGAGGCTGATATTCAGTTCGCCAATACCTTGGCTTCTGATTATCCCAGGGAATATGATGGTGCATGCCTTCAGAT
GAGATTATCCTACAGCCCAGCTGCACAGTTCTTTCTCTTTTTCGTGCAATGGACTGATTGTCATCTTGCTGGTGCATTGGGTTTGCTCAGAATTCTAATATATAAGGCAT
ATGAAGACGGGAAGACTACCATGTCTATTCAAGAAAGAAAGGCTAGTCTCAAAGAGTTCTATGGGGTGATCTTTCCATCTTTATTGCAACTTCAAGGAGGAATTAATGAC
ATAGAAGACAGAAAACAGAGAGAAGTTTATGCTGCCAAATACAAAAGAAAAGATCAATTGGATAGAGGAAAGATCACTGAAATCGACTTGGAAAGAGAGGAGGAATGTGG
AATTTGCATGGAGCTAAACTGCATGGTTGTATTGCCTAGTTGCAATCATTCAATGTGCATGAAGTGTTATAGAAGTTGGCGCACTCGGTCTCAATCATGTCCCTTCTGCC
GTGACAGTCTCAAGAGAGTCAACTCTGGAGATCTTTGGATCTATACCGGTAGCAGTGAAATCGTCGACTTGTCCTTCATCTCTAGGGAAAATTTAAAGAGGCTTTTCATG
TTCATTGATAAGATGATGGCATCGGGTTACGGCCTCAGTCACCAGGTTCTCAATCCTCAACTTCTTCCAGACATGGAATTTTTGGAACAACCGCGTGTAGGAGCAATGGC
AAATGCCTTGAAAGGATTATACTATTTTAGCAATAGGACAGGCACTTCGAGCATTGAATACCATGGACGCCGGTCGGGTCAGGTGCAAGCCCGATTCGCATTCAATAAAA
AAAATACCCCACAAAAGAGAGCATCAGAACCCAGTTCCGATCGTCGACTGTGGTACCCTGGAGCCATAGCACCCGAGTGGCTCGGCGGTAGCTTGGTGGGTGATTATGGG
TTCGACCCATTTGGTTTGGGCAAACCAGCAGAATACCTTCAGTTCGACTTGGACTCTTTGGACCAGAACTTGGCTAAGAATGTTGCCGGTGATGTGATCAGTACCCGGTT
TGAGAATGCAGATGTGAAATCGACCCGTTCCAGCTATTCAGTGAGGCATTTGGGTTGCAGAGGTTCTGTGAGTGCGAGCTCATCCATGGAAGGTGGGCTATGTTGGCCGC
GCTCGGTGCTCTCAGTCGAGTGGCTCACCGGTGTTACTTGGCAAGACGCCGGAAAGAGAAACGCCAAGCTTGACCCAGAGAAGAGACTTTGCCCCCGGGGCAAGTACTTC
GATCCACTCAGCCTGGCAACCGATCCCGAGAAGGCAGCGGCACTTCAATTGGCAGAGATCAAGCATGCTCGCCTCGCCATGGTTGCATTCCTCAGCTTTGTTGTTCAGGC
TGCTGCCACTGGCAAAGGCCCGCTCGATAACTGGGCGTCGGGCGACTCACTTGACCAAAATCGGCGACTTCCATCAGGTAAATTTTCGTTGAAATTCCTCCGATCAGAGT
TCGACATTTATGCGCTTTACTTTTATCGTGTTCCGAATCGACAAAAGACTGAAATGGCTAATCGGAGTTTAGTGTGGGGCGTTGCGAAGAAATGTTTCACATTTGGGATT
ATAGGCGTTACTGTTTCAGATCGTTATGCAAGTATTGTCCCTCTCCGGGGTGCTTCAATGTCTCCCACTTTCAACCCTAGAGCGACTTCTCAGGCGGGTGCAGTTACTGG
TGACTACGTACTGGTTGAGAAATTTTGCCTTGAGAAGTACAAGTTTTCTCCTGGTGACGTGATCGTTTACTGCTCCCCTAGTAATTACAAGGAGAAACATGTGAAAAGAA
TTATTGCCTTACCAGGAGATTGGGTTGGAACTCGTCAAACATATGATGTTGTAAAGGTTCCAGAAGGACATTGTTGGGTTGAGGGCGATAACGCAGAATGCAGCATGGAC
TCGAGATCTTTTGGCCCAATACCAATGGGTTTGATTCAAGGGAGGGTGTCACATATCGTATGGCCACCTCAAAGAATTGGTGCTGTCGAGAGAAAATATCCTCAGGGGAG
AATTAAGTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAAATCTTTCAAGGATTCCCTTAAGGCACTGGAGGCTGATATTCAGTTCGCCAATACCTTGGCTTCTGATTATCCCAGGGAATATGATGGTGCATGCCTTCAGAT
GAGATTATCCTACAGCCCAGCTGCACAGTTCTTTCTCTTTTTCGTGCAATGGACTGATTGTCATCTTGCTGGTGCATTGGGTTTGCTCAGAATTCTAATATATAAGGCAT
ATGAAGACGGGAAGACTACCATGTCTATTCAAGAAAGAAAGGCTAGTCTCAAAGAGTTCTATGGGGTGATCTTTCCATCTTTATTGCAACTTCAAGGAGGAATTAATGAC
ATAGAAGACAGAAAACAGAGAGAAGTTTATGCTGCCAAATACAAAAGAAAAGATCAATTGGATAGAGGAAAGATCACTGAAATCGACTTGGAAAGAGAGGAGGAATGTGG
AATTTGCATGGAGCTAAACTGCATGGTTGTATTGCCTAGTTGCAATCATTCAATGTGCATGAAGTGTTATAGAAGTTGGCGCACTCGGTCTCAATCATGTCCCTTCTGCC
GTGACAGTCTCAAGAGAGTCAACTCTGGAGATCTTTGGATCTATACCGGTAGCAGTGAAATCGTCGACTTGTCCTTCATCTCTAGGGAAAATTTAAAGAGGCTTTTCATG
TTCATTGATAAGATGATGGCATCGGGTTACGGCCTCAGTCACCAGGTTCTCAATCCTCAACTTCTTCCAGACATGGAATTTTTGGAACAACCGCGTGTAGGAGCAATGGC
AAATGCCTTGAAAGGATTATACTATTTTAGCAATAGGACAGGCACTTCGAGCATTGAATACCATGGACGCCGGTCGGGTCAGGTGCAAGCCCGATTCGCATTCAATAAAA
AAAATACCCCACAAAAGAGAGCATCAGAACCCAGTTCCGATCGTCGACTGTGGTACCCTGGAGCCATAGCACCCGAGTGGCTCGGCGGTAGCTTGGTGGGTGATTATGGG
TTCGACCCATTTGGTTTGGGCAAACCAGCAGAATACCTTCAGTTCGACTTGGACTCTTTGGACCAGAACTTGGCTAAGAATGTTGCCGGTGATGTGATCAGTACCCGGTT
TGAGAATGCAGATGTGAAATCGACCCGTTCCAGCTATTCAGTGAGGCATTTGGGTTGCAGAGGTTCTGTGAGTGCGAGCTCATCCATGGAAGGTGGGCTATGTTGGCCGC
GCTCGGTGCTCTCAGTCGAGTGGCTCACCGGTGTTACTTGGCAAGACGCCGGAAAGAGAAACGCCAAGCTTGACCCAGAGAAGAGACTTTGCCCCCGGGGCAAGTACTTC
GATCCACTCAGCCTGGCAACCGATCCCGAGAAGGCAGCGGCACTTCAATTGGCAGAGATCAAGCATGCTCGCCTCGCCATGGTTGCATTCCTCAGCTTTGTTGTTCAGGC
TGCTGCCACTGGCAAAGGCCCGCTCGATAACTGGGCGTCGGGCGACTCACTTGACCAAAATCGGCGACTTCCATCAGGTAAATTTTCGTTGAAATTCCTCCGATCAGAGT
TCGACATTTATGCGCTTTACTTTTATCGTGTTCCGAATCGACAAAAGACTGAAATGGCTAATCGGAGTTTAGTGTGGGGCGTTGCGAAGAAATGTTTCACATTTGGGATT
ATAGGCGTTACTGTTTCAGATCGTTATGCAAGTATTGTCCCTCTCCGGGGTGCTTCAATGTCTCCCACTTTCAACCCTAGAGCGACTTCTCAGGCGGGTGCAGTTACTGG
TGACTACGTACTGGTTGAGAAATTTTGCCTTGAGAAGTACAAGTTTTCTCCTGGTGACGTGATCGTTTACTGCTCCCCTAGTAATTACAAGGAGAAACATGTGAAAAGAA
TTATTGCCTTACCAGGAGATTGGGTTGGAACTCGTCAAACATATGATGTTGTAAAGGTTCCAGAAGGACATTGTTGGGTTGAGGGCGATAACGCAGAATGCAGCATGGAC
TCGAGATCTTTTGGCCCAATACCAATGGGTTTGATTCAAGGGAGGGTGTCACATATCGTATGGCCACCTCAAAGAATTGGTGCTGTCGAGAGAAAATATCCTCAGGGGAG
AATTAAGTCCTAA
Protein sequenceShow/hide protein sequence
MRKSFKDSLKALEADIQFANTLASDYPREYDGACLQMRLSYSPAAQFFLFFVQWTDCHLAGALGLLRILIYKAYEDGKTTMSIQERKASLKEFYGVIFPSLLQLQGGIND
IEDRKQREVYAAKYKRKDQLDRGKITEIDLEREEECGICMELNCMVVLPSCNHSMCMKCYRSWRTRSQSCPFCRDSLKRVNSGDLWIYTGSSEIVDLSFISRENLKRLFM
FIDKMMASGYGLSHQVLNPQLLPDMEFLEQPRVGAMANALKGLYYFSNRTGTSSIEYHGRRSGQVQARFAFNKKNTPQKRASEPSSDRRLWYPGAIAPEWLGGSLVGDYG
FDPFGLGKPAEYLQFDLDSLDQNLAKNVAGDVISTRFENADVKSTRSSYSVRHLGCRGSVSASSSMEGGLCWPRSVLSVEWLTGVTWQDAGKRNAKLDPEKRLCPRGKYF
DPLSLATDPEKAAALQLAEIKHARLAMVAFLSFVVQAAATGKGPLDNWASGDSLDQNRRLPSGKFSLKFLRSEFDIYALYFYRVPNRQKTEMANRSLVWGVAKKCFTFGI
IGVTVSDRYASIVPLRGASMSPTFNPRATSQAGAVTGDYVLVEKFCLEKYKFSPGDVIVYCSPSNYKEKHVKRIIALPGDWVGTRQTYDVVKVPEGHCWVEGDNAECSMD
SRSFGPIPMGLIQGRVSHIVWPPQRIGAVERKYPQGRIKS