; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024324 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024324
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00001291:1785045..1788939
RNA-Seq ExpressionSgr024324
SyntenySgr024324
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579446.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.3e-13967.16Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        KI LN  VNVKQATQIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +S+NP  FSWGCAVRFFSQNGQFMEA SHYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRI+CKF GS VH+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL DAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCGD+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                 +  AS  S    LGNLNYG WIESYMEKLGIELD++LATALVD YAKSGNIERAFELFN LKK+DLV+YSAMIFG
Subjt:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

XP_022155871.1 pentatricopeptide repeat-containing protein At4g22760 [Momordica charantia]2.1e-14067.33Show/hide
Query:  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST
        +ILLN   SVNVKQA+QIH+ I+VNG RHLE+LLVRQITRSEFTCARIVS YL++IL +SQNP AFSWGCAVRFFSQNGQF+EAFSHYVQMQ LGL PST
Subjt:  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST

Query:  FAVSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW
        F VSSTLRAC RIMCKFGGS+VH                              +QKVFDD+ EKNVVSWNSILSG+VKIGNLVDAQKVFDEMPEKDVISW
Subjt:  FAVSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW

Query:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------------
        NSMLSGFANSGN+D+ASCLFQQMREKSSASWNAMISGY+N GDIKSARNLFD MPQRNNVSWITLIAGYSKLG++                         
Subjt:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------------

Query:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                                   +  AS  S    LGNLNYG+W+ESYMEKLGI+LD++LATALVDLYAKSGNI+RAFELF+GLKK+DLVAYSAMI
Subjt:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
        FGCG
Subjt:  FGCG

XP_022922292.1 pentatricopeptide repeat-containing protein At4g22760 [Cucurbita moschata]1.2e-13866.67Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        KI LN  VNVKQA QIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +SQNP +FSWGCAVRFFS+NGQFMEA SHYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRI+CKF GS VH+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL DAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCGD+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                 +  AS  S    LGNLNYG WIESYMEKLGIELD++LATALVD YAKSGNIERAFELFN LKK+D V+YSAMIFG
Subjt:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

XP_038874892.1 pentatricopeptide repeat-containing protein At4g22760-like [Benincasa hispida]1.6e-14368.91Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        K+ LN+S+N KQATQIHAHILVNG  +LES LVRQITRSEFTCARIVS YL+QILH+SQNP AF+W CAVRFFSQNGQFMEA SHYVQMQRLGL P TFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRIMCKFGGSYVH                              +QKVFDD+ EKNVVSWNSILSGYVKIGNLVDAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GFANSGN+D+A CLFQQM EKSSASWNAMISGYVNCGDIKSARNLFD MP RNNVSWITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                 +  AS  S    LGNLN GTWIESYMEKLGIELD++LATALVDLYAKSGNIERAFELFNGLKKRDL+AYSAMIFG
Subjt:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

XP_038906935.1 pentatricopeptide repeat-containing protein At4g22760 [Benincasa hispida]1.6e-14368.91Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        K+ LN+S+N KQATQIHAHILVNG  +LES LVRQITRSEFTCARIVS YL+QILH+SQNP AF+W CAVRFFSQNGQFMEA SHYVQMQRLGL P TFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRIMCKFGGSYVH                              +QKVFDD+ EKNVVSWNSILSGYVKIGNLVDAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GFANSGN+D+A CLFQQM EKSSASWNAMISGYVNCGDIKSARNLFD MP RNNVSWITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                 +  AS  S    LGNLN GTWIESYMEKLGIELD++LATALVDLYAKSGNIERAFELFNGLKKRDL+AYSAMIFG
Subjt:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

TrEMBL top hitse value%identityAlignment
A0A1S3ATB6 pentatricopeptide repeat-containing protein At4g227609.6e-13966.34Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        K  LN+SV+VKQATQIHAHILVNG  +LES LVRQITRS+FTCARIVSRYL++ILH+SQNP AF+W CAVRFFS+NGQFMEA +HYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRIMCKFGG  +H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNLVDAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GF+NSGN+D+A CLFQQMREKSSASWNAMISGYVNCGD+K+ARNLFD MP RNNV+ ITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                                   +  AS  S    LGNL+YGTWIESYMEKLGIELD++LATALVDLYAKSGNI+RAFELFN LKKRDLVAYSAMI
Subjt:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
        FGCG
Subjt:  FGCG

A0A5A7THR1 Pentatricopeptide repeat-containing protein9.6e-13966.34Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        K  LN+SV+VKQATQIHAHILVNG  +LES LVRQITRS+FTCARIVSRYL++ILH+SQNP AF+W CAVRFFS+NGQFMEA +HYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRIMCKFGG  +H                              +QKVFDD+ EKNVVSWNSILSGYVKIGNLVDAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GF+NSGN+D+A CLFQQMREKSSASWNAMISGYVNCGD+K+ARNLFD MP RNNV+ ITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                                   +  AS  S    LGNL+YGTWIESYMEKLGIELD++LATALVDLYAKSGNI+RAFELFN LKKRDLVAYSAMI
Subjt:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
        FGCG
Subjt:  FGCG

A0A6J1DT09 pentatricopeptide repeat-containing protein At4g227601.0e-14067.33Show/hide
Query:  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST
        +ILLN   SVNVKQA+QIH+ I+VNG RHLE+LLVRQITRSEFTCARIVS YL++IL +SQNP AFSWGCAVRFFSQNGQF+EAFSHYVQMQ LGL PST
Subjt:  KILLNN--SVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST

Query:  FAVSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW
        F VSSTLRAC RIMCKFGGS+VH                              +QKVFDD+ EKNVVSWNSILSG+VKIGNLVDAQKVFDEMPEKDVISW
Subjt:  FAVSSTLRACGRIMCKFGGSYVH------------------------------SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW

Query:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------------
        NSMLSGFANSGN+D+ASCLFQQMREKSSASWNAMISGY+N GDIKSARNLFD MPQRNNVSWITLIAGYSKLG++                         
Subjt:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------------

Query:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                                   +  AS  S    LGNLNYG+W+ESYMEKLGI+LD++LATALVDLYAKSGNI+RAFELF+GLKK+DLVAYSAMI
Subjt:  ---------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
        FGCG
Subjt:  FGCG

A0A6J1E2U4 pentatricopeptide repeat-containing protein At4g227605.6e-13966.67Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        KI LN  VNVKQA QIHAHILVNG R+LES LVRQITRSEFTCARIVSRYL++IL +SQNP +FSWGCAVRFFS+NGQFMEA SHYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRI+CKF GS VH+Q                              KVFDD+ EKNVVSWNSILSGYVKIGNL DAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------
        ML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCGD+KSARN+FD MP RNNVSWITLIAGYSKLGE+                           
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL---------------------------

Query:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                 +  AS  S    LGNLNYG WIESYMEKLGIELD++LATALVD YAKSGNIERAFELFN LKK+D V+YSAMIFG
Subjt:  -------------------------VLLAS--SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

A0A6J1HXF1 pentatricopeptide repeat-containing protein At4g227601.5e-13666.17Show/hide
Query:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA
        KI LN  VNVKQATQIHA ILVNG R+LES LVRQITRSEF+ ARIVSRYL++IL +SQNP +FSWGCAVRFFSQNGQFME  SHYVQMQRLGL PSTFA
Subjt:  KILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFA

Query:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS
        VSSTLRACGRI+CKFGGS VH+Q                              KVFDD+ EKNVVSWNSILSGYVKIG L DAQKVFDEMP KDVISWNS
Subjt:  VSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNS

Query:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------VLLAS---
        ML+GFANSGN+D+ASCLFQQ+ E+SSASWNAMISGYVNCGD+KSARN+FD MP RNNVSWITLIAGYSKLGE+                    L+A    
Subjt:  MLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGEL-------------------VLLAS---

Query:  --------------------------------SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
                                        S    LGNLNYG WIESYMEKLGIELD++LATALVD YAKSGNI+RAFELFN LKK+DLV+YSAMIFG
Subjt:  --------------------------------SLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Query:  CG
        CG
Subjt:  CG

SwissProt top hitse value%identityAlignment
O49399 Pentatricopeptide repeat-containing protein At4g188401.7e-3930.63Show/hide
Query:  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSST
        ++ +  Q HA +L  G  H       L+    T  E    + VS Y   IL+   +P  F+    +R ++ +     A + + +M    + P  ++ +  
Subjt:  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSST

Query:  LRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMIS
        L+AC        G  +H   +   L+  +V   N++++ Y + G    A+KV D MP +D +SWNS+LS +   G +D+A  LF +M E++  SWN MIS
Subjt:  LRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMIS

Query:  GYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG-----------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA
        GY   G +K A+ +FD+MP R+ VSW  ++  Y+ +G                        LV + S+    LG+L+ G W+  Y++K GIE++ +LATA
Subjt:  GYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG-----------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA

Query:  LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
        LVD+Y+K G I++A E+F    KRD+  ++++I
Subjt:  LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

P0C8Q5 Pentatricopeptide repeat-containing protein At4g227601.0e-7638.37Show/hide
Query:  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST
        + +  L   V ++QA Q+HA ++VN + HLE +LV Q        +R +  Y+++IL       +FSWGC VRF SQ+ +F E    Y+ M   G+ PS+
Subjt:  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST

Query:  FAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW
         AV+S LRACG++     G  +H+Q                              K FDD+ EKN VSWNS+L GY++ G L +A++VFD++PEKD +SW
Subjt:  FAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW

Query:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE--------------------------
        N ++S +A  G++  A  LF  M  KS ASWN +I GYVNC ++K AR  FDAMPQ+N VSWIT+I+GY+KLG+                          
Subjt:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE--------------------------

Query:  ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                 L L A  L R                    LGN ++GTW+ESY+ + GI++D+ L+T+L+DLY K G+  +AF++F+ L K+D V+YSAMI
Subjt:  ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
         GCG
Subjt:  FGCG

Q1PEU4 Pentatricopeptide repeat-containing protein At2g448805.9e-3728.21Show/hide
Query:  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQR-LGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFD
        Q   +F     ++ + +  Q+ ++F+ Y  +++     P  F  ++  ++C   MC + G  +HSQ                                FD
Subjt:  QNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQR-LGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFD

Query:  DLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPE-KDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRN
        ++  ++ VSW +++SGY++ G L  A K+FD+MP  KDV+ +N+M+ GF  SG++  A  LF +M  K+  +W  MI GY N  DI +AR LFDAMP+RN
Subjt:  DLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPE-KDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRN

Query:  NVSWITLIAGYSKLGE----------------------LVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKK
         VSW T+I GY +  +                       +L         G L+ G W   ++++  ++    + TA++D+Y+K G IE+A  +F+ + +
Subjt:  NVSWITLIAGYSKLGE----------------------LVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKK

Query:  RDLVAYSAMIFG
        + + +++AMI G
Subjt:  RDLVAYSAMIFG

Q9LS72 Pentatricopeptide repeat-containing protein At3g292303.5e-4532.16Show/hide
Query:  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSS
        L    N+ Q  Q+HA I+       E L +     S  +  R  +  +R + +  Q P        +R  +QN Q  +AF  + +MQR GL    F    
Subjt:  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSS

Query:  TLRAC-----------------------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSM
         L+AC                               I C  + GG  V  + K+F+ + E++ VSWNS+L G VK G L DA+++FDEMP++D+ISWN+M
Subjt:  TLRAC-----------------------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSM

Query:  LSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL-----------
        L G+A    + +A  LF++M E+++ SW+ M+ GY   GD++ AR +FD M  P +N V+W  +IAGY++ G L         ++AS L           
Subjt:  LSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL-----------

Query:  --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFGCG
              G L+ G  I S +++  +  + Y+  AL+D+YAK GN+++AF++FN + K+DLV+++ M+ G G
Subjt:  --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFGCG

Q9MAT2 Pentatricopeptide repeat-containing protein At1g048403.5e-3728.14Show/hide
Query:  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRA
        IHA       RH+ + ++R+   S    A++VS         Y   I   S+    F     +R  ++N +F  +  H++ M RLG++P        L++
Subjt:  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRA

Query:  CGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMI
          ++  ++ G  +H+     + ++ +     S++  Y K G L  A +VF+E P++     ++ WN +++G+  + ++  A+ LF+ M E++S SW+ +I
Subjt:  CGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMI

Query:  SGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL
         GYV+ G++  A+ LF+ MP++N VSW TLI G+S+ G+     S+    L                     G L  G  I  Y+   GI+LD  + TAL
Subjt:  SGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL

Query:  VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
        VD+YAK G ++ A  +F+ +  +D+++++AMI G
Subjt:  VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

Arabidopsis top hitse value%identityAlignment
AT1G04840.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-3828.14Show/hide
Query:  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRA
        IHA       RH+ + ++R+   S    A++VS         Y   I   S+    F     +R  ++N +F  +  H++ M RLG++P        L++
Subjt:  IHAHILVNGFRHLESLLVRQITRSEFTCARIVS--------RYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSSTLRA

Query:  CGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMI
          ++  ++ G  +H+     + ++ +     S++  Y K G L  A +VF+E P++     ++ WN +++G+  + ++  A+ LF+ M E++S SW+ +I
Subjt:  CGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEK----DVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMI

Query:  SGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL
         GYV+ G++  A+ LF+ MP++N VSW TLI G+S+ G+     S+    L                     G L  G  I  Y+   GI+LD  + TAL
Subjt:  SGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWL---------------------GNLNYGTWIESYMEKLGIELDEYLATAL

Query:  VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG
        VD+YAK G ++ A  +F+ +  +D+++++AMI G
Subjt:  VDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFG

AT1G13410.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-4237.67Show/hide
Query:  GSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSAR
        G    + KVF +++EKNVV W S+++GY+   +LV A++ FD  PE+D++ WN+M+SG+   GN+ +A  LF QM  +   SWN ++ GY N GD+++  
Subjt:  GSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSAR

Query:  NLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRW----------------------LGNLNYGTWIESYMEKLGI-ELDEYLATALVDLYAKSGNIE
         +FD MP+RN  SW  LI GY++ G +  +  S  R                       LG  ++G W+  Y E LG  ++D  +  AL+D+Y K G IE
Subjt:  NLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRW----------------------LGNLNYGTWIESYMEKLGI-ELDEYLATALVDLYAKSGNIE

Query:  RAFELFNGLKKRDLVAYSAMIFG
         A E+F G+K+RDL++++ MI G
Subjt:  RAFELFNGLKKRDLVAYSAMIFG

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-4632.16Show/hide
Query:  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSS
        L    N+ Q  Q+HA I+       E L +     S  +  R  +  +R + +  Q P        +R  +QN Q  +AF  + +MQR GL    F    
Subjt:  LNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSS

Query:  TLRAC-----------------------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSM
         L+AC                               I C  + GG  V  + K+F+ + E++ VSWNS+L G VK G L DA+++FDEMP++D+ISWN+M
Subjt:  TLRAC-----------------------------GRIMC--KFGGSYVH-SQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSM

Query:  LSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL-----------
        L G+A    + +A  LF++M E+++ SW+ M+ GY   GD++ AR +FD M  P +N V+W  +IAGY++ G L         ++AS L           
Subjt:  LSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAM--PQRNNVSWITLIAGYSKLGEL--------VLLASSL-----------

Query:  --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFGCG
              G L+ G  I S +++  +  + Y+  AL+D+YAK GN+++AF++FN + K+DLV+++ M+ G G
Subjt:  --TRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMIFGCG

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-4030.63Show/hide
Query:  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSST
        ++ +  Q HA +L  G  H       L+    T  E    + VS Y   IL+   +P  F+    +R ++ +     A + + +M    + P  ++ +  
Subjt:  NVKQATQIHAHILVNGFRH----LESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPSTFAVSST

Query:  LRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMIS
        L+AC        G  +H   +   L+  +V   N++++ Y + G    A+KV D MP +D +SWNS+LS +   G +D+A  LF +M E++  SWN MIS
Subjt:  LRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMREKSSASWNAMIS

Query:  GYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG-----------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA
        GY   G +K A+ +FD+MP R+ VSW  ++  Y+ +G                        LV + S+    LG+L+ G W+  Y++K GIE++ +LATA
Subjt:  GYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLG-----------------------ELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATA

Query:  LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
        LVD+Y+K G I++A E+F    KRD+  ++++I
Subjt:  LVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

AT4G22760.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-7838.37Show/hide
Query:  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST
        + +  L   V ++QA Q+HA ++VN + HLE +LV Q        +R +  Y+++IL       +FSWGC VRF SQ+ +F E    Y+ M   G+ PS+
Subjt:  RAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEAFSHYVQMQRLGLRPST

Query:  FAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW
         AV+S LRACG++     G  +H+Q                              K FDD+ EKN VSWNS+L GY++ G L +A++VFD++PEKD +SW
Subjt:  FAVSSTLRACGRIMCKFGGSYVHSQ------------------------------KVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISW

Query:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE--------------------------
        N ++S +A  G++  A  LF  M  KS ASWN +I GYVNC ++K AR  FDAMPQ+N VSWIT+I+GY+KLG+                          
Subjt:  NSMLSGFANSGNLDQASCLFQQMREKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGE--------------------------

Query:  ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI
                 L L A  L R                    LGN ++GTW+ESY+ + GI++D+ L+T+L+DLY K G+  +AF++F+ L K+D V+YSAMI
Subjt:  ---------LVLLASSLTR-------------------WLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGLKKRDLVAYSAMI

Query:  FGCG
         GCG
Subjt:  FGCG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCCTATGATTACTAAGGAAGCAGACACGTGTACGGCAAAGATGAGAAGCGGTTCGAATTCTGATGCTTGCTTCCGAGCTAAAATATTGTTGAACAACTCTGTAAA
TGTAAAGCAGGCAACTCAAATTCATGCCCACATTCTCGTCAATGGGTTCCGACATCTCGAGTCTCTCTTGGTTCGTCAAATCACTCGCTCTGAGTTCACTTGCGCCAGAA
TTGTATCCCGTTATCTCCGACAAATCCTTCACTATTCGCAAAACCCAGTTGCTTTCTCATGGGGTTGCGCCGTTCGATTCTTTTCCCAGAACGGTCAATTCATGGAAGCT
TTCTCTCATTATGTTCAGATGCAGAGATTGGGACTGCGTCCAAGCACATTTGCTGTATCTTCGACTTTGAGAGCTTGCGGTAGAATTATGTGCAAGTTTGGAGGGAGTTA
TGTTCACTCTCAGAAGGTTTTTGATGATCTGATCGAGAAAAATGTGGTTTCGTGGAATTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGACGCTCAGAAAG
TGTTCGATGAAATGCCTGAGAAAGATGTCATATCTTGGAATTCGATGTTGTCGGGATTCGCCAACTCTGGAAATTTGGATCAAGCGTCCTGTTTATTCCAACAAATGCGG
GAGAAAAGTTCAGCTTCTTGGAACGCAATGATCAGTGGTTACGTGAACTGTGGAGACATAAAGTCTGCAAGAAACCTGTTTGATGCAATGCCTCAAAGAAATAATGTTTC
CTGGATTACATTGATTGCTGGGTATTCGAAGCTTGGGGAGTTGGTGCTGCTCGCGAGCTCTTTGACAAGATGGCTGGGGAATTTGAATTATGGTACTTGGATTGAATCGT
ATATGGAAAAACTTGGGATTGAATTGGATGAATATTTGGCCACTGCATTGGTAGACTTGTATGCAAAATCCGGGAACATCGAAAGGGCGTTTGAGCTGTTCAATGGTCTG
AAAAAGAGGGATTTAGTTGCTTATTCAGCTATGATCTTTGGATGTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGCCTATGATTACTAAGGAAGCAGACACGTGTACGGCAAAGATGAGAAGCGGTTCGAATTCTGATGCTTGCTTCCGAGCTAAAATATTGTTGAACAACTCTGTAAA
TGTAAAGCAGGCAACTCAAATTCATGCCCACATTCTCGTCAATGGGTTCCGACATCTCGAGTCTCTCTTGGTTCGTCAAATCACTCGCTCTGAGTTCACTTGCGCCAGAA
TTGTATCCCGTTATCTCCGACAAATCCTTCACTATTCGCAAAACCCAGTTGCTTTCTCATGGGGTTGCGCCGTTCGATTCTTTTCCCAGAACGGTCAATTCATGGAAGCT
TTCTCTCATTATGTTCAGATGCAGAGATTGGGACTGCGTCCAAGCACATTTGCTGTATCTTCGACTTTGAGAGCTTGCGGTAGAATTATGTGCAAGTTTGGAGGGAGTTA
TGTTCACTCTCAGAAGGTTTTTGATGATCTGATCGAGAAAAATGTGGTTTCGTGGAATTCAATCTTGTCTGGTTATGTGAAAATTGGGAACTTAGTTGACGCTCAGAAAG
TGTTCGATGAAATGCCTGAGAAAGATGTCATATCTTGGAATTCGATGTTGTCGGGATTCGCCAACTCTGGAAATTTGGATCAAGCGTCCTGTTTATTCCAACAAATGCGG
GAGAAAAGTTCAGCTTCTTGGAACGCAATGATCAGTGGTTACGTGAACTGTGGAGACATAAAGTCTGCAAGAAACCTGTTTGATGCAATGCCTCAAAGAAATAATGTTTC
CTGGATTACATTGATTGCTGGGTATTCGAAGCTTGGGGAGTTGGTGCTGCTCGCGAGCTCTTTGACAAGATGGCTGGGGAATTTGAATTATGGTACTTGGATTGAATCGT
ATATGGAAAAACTTGGGATTGAATTGGATGAATATTTGGCCACTGCATTGGTAGACTTGTATGCAAAATCCGGGAACATCGAAAGGGCGTTTGAGCTGTTCAATGGTCTG
AAAAAGAGGGATTTAGTTGCTTATTCAGCTATGATCTTTGGATGTGGATAA
Protein sequenceShow/hide protein sequence
MAPMITKEADTCTAKMRSGSNSDACFRAKILLNNSVNVKQATQIHAHILVNGFRHLESLLVRQITRSEFTCARIVSRYLRQILHYSQNPVAFSWGCAVRFFSQNGQFMEA
FSHYVQMQRLGLRPSTFAVSSTLRACGRIMCKFGGSYVHSQKVFDDLIEKNVVSWNSILSGYVKIGNLVDAQKVFDEMPEKDVISWNSMLSGFANSGNLDQASCLFQQMR
EKSSASWNAMISGYVNCGDIKSARNLFDAMPQRNNVSWITLIAGYSKLGELVLLASSLTRWLGNLNYGTWIESYMEKLGIELDEYLATALVDLYAKSGNIERAFELFNGL
KKRDLVAYSAMIFGCG