; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023410 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023410
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat superfamily protein
Genome locationtig00000892:3063722..3068428
RNA-Seq ExpressionSgr023410
SyntenySgr023410
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570725.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.4e-15369.66Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP++D R+FRK+CTWRRNLEE +ENDSQFVY +EQIVRGKQ+W+IAFNNA IS  LKPHHVEKVL++T DD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKRE DGVL+INLMR++GL PEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFD  VNAGVKPD+YIYTVVV+CLCELKDF+KA +II + +  GCGLSIVTYNVFIHGLCKS+RVWEAVEIKRLLGEKGLKAD+VTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVGVEV  EMIELG+VPSE+AVSGV+EGLRRMG+IE AF LLNKVGKLGV+PNLFVYNS+INSLCK+GKL+EAE LFSVMT+RGLFPNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        DGFGRSAKLDVA
Subjt:  DGFGRSAKLDVA

KAG7010569.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-15369.66Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP++D R+FRK+CTWRRNLEE +ENDSQFVY +EQIVRGKQ+W+IAFNNA IS  LKPHHVEKVL++T DD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKRE DGVL+INLMR++GL PEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFD  VNAGVKPD+YIYTVVV+CLCELKDF+KA +II + +  GCGLSIVTYNVFIHGLCKS+RVWEAVEIKRLLGEKGLKAD+VTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVGVEV  EMIELG+VPSE+AVSGV+EGLRRMG+IE AF LLNKVGKLGV+PNLFVYNS+INSLCK+GKL+EAE LFSVMT+RGLFPNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        DGFGRSAKLDVA
Subjt:  DGFGRSAKLDVA

XP_022148372.1 putative pentatricopeptide repeat-containing protein At5g59900 isoform X1 [Momordica charantia]6.3e-15471.12Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP+L RR+F+KYCTWRRNLEE  ENDSQF+Y LEQIVRGKQSWKIAF+NAFISGTLKPHHVE VLI+TLDD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKR MDGVLVINLMR+HG+ PEVRTLSALLNALARIRKF 
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFDT VNAGVKPDSYIYTV VRCLCELK F KAKE+I + +  GCGL+IVTYNVFIHGLCKSKRV EA+EIKRLLGEKGLKADLVTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFE+G+EV  EMI LGF PSE+AVSGVIEGLRRMGNI+ AF LL KVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTY+ILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        +GFGR A+LDVA
Subjt:  DGFGRSAKLDVA

XP_022148373.1 putative pentatricopeptide repeat-containing protein At5g59900 isoform X2 [Momordica charantia]6.3e-15471.12Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP+L RR+F+KYCTWRRNLEE  ENDSQF+Y LEQIVRGKQSWKIAF+NAFISGTLKPHHVE VLI+TLDD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKR MDGVLVINLMR+HG+ PEVRTLSALLNALARIRKF 
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFDT VNAGVKPDSYIYTV VRCLCELK F KAKE+I + +  GCGL+IVTYNVFIHGLCKSKRV EA+EIKRLLGEKGLKADLVTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFE+G+EV  EMI LGF PSE+AVSGVIEGLRRMGNI+ AF LL KVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTY+ILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        +GFGR A+LDVA
Subjt:  DGFGRSAKLDVA

XP_038901679.1 putative pentatricopeptide repeat-containing protein At5g59900 [Benincasa hispida]5.1e-15670.57Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        R LRTP++DRR+FRK+CTWRR+LEE +ENDS FVYVLEQIVRG QSWKIAFNNA ISG LKPHHVEKVLI+TLDD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFD+LIQ+YVQNKRE+D VLV+NLMRE+GLLPEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFDT VNAGVKPDSYIYTVVVRC CELKDFDKAKEII + +  GC LSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVGVE+  EMIELG+VPSE+AVSGVI+GLRR+G+I GAF  L+KVGKLGVVPNLFVYNS+INSLCKSGKLEEAESLF+VMTER L+PNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVASISSRK
        DGFGR  KLDVAS   +K
Subjt:  DGFGRSAKLDVASISSRK

TrEMBL top hitse value%identityAlignment
A0A6J1D3X3 putative pentatricopeptide repeat-containing protein At5g59900 isoform X26.8e-15471.12Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP+L RR+F+KYCTWRRNLEE  ENDSQF+Y LEQIVRGKQSWKIAF+NAFISGTLKPHHVE VLI+TLDD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKR MDGVLVINLMR+HG+ PEVRTLSALLNALARIRKF 
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFDT VNAGVKPDSYIYTV VRCLCELK F KAKE+I + +  GCGL+IVTYNVFIHGLCKSKRV EA+EIKRLLGEKGLKADLVTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFE+G+EV  EMI LGF PSE+AVSGVIEGLRRMGNI+ AF LL KVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTY+ILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        +GFGR A+LDVA
Subjt:  DGFGRSAKLDVA

A0A6J1D4W4 putative pentatricopeptide repeat-containing protein At5g59900 isoform X16.8e-15471.12Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP+L RR+F+KYCTWRRNLEE  ENDSQF+Y LEQIVRGKQSWKIAF+NAFISGTLKPHHVE VLI+TLDD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKR MDGVLVINLMR+HG+ PEVRTLSALLNALARIRKF 
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFDT VNAGVKPDSYIYTV VRCLCELK F KAKE+I + +  GCGL+IVTYNVFIHGLCKSKRV EA+EIKRLLGEKGLKADLVTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFE+G+EV  EMI LGF PSE+AVSGVIEGLRRMGNI+ AF LL KVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTY+ILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        +GFGR A+LDVA
Subjt:  DGFGRSAKLDVA

A0A6J1FVS2 putative pentatricopeptide repeat-containing protein At5g59900 isoform X17.5e-15369.42Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP++D   FRK+CTWRRNLEE +ENDSQFVY +EQIVRGKQ+W+IAFNNA IS  LKPHHVEKVL++T DD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKRE DGVL+INLMR++GL PEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFD  VNAGVKPD+YIYTVVV+CLCELKDF+KA +II + +  GCGLSIVTYNVFIHGLCKS+RVWEAVEIKRLLGEKGLKAD+VTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVGVEV  EMIELG+VPSE+AVSGV+EGLRRMG+IE AF LLNKVGKLGV+PNLFVYNS+INSLCK+GKL+EAE LFSVMT+RGLFPNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        DGFGRSAKLDVA
Subjt:  DGFGRSAKLDVA

A0A6J1FY36 putative pentatricopeptide repeat-containing protein At5g59900 isoform X27.5e-15369.42Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP++D   FRK+CTWRRNLEE +ENDSQFVY +EQIVRGKQ+W+IAFNNA IS  LKPHHVEKVL++T DD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      SSSGFDMLIQ+YVQNKRE DGVL+INLMR++GL PEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFD  VNAGVKPD+YIYTVVV+CLCELKDF+KA +II + +  GCGLSIVTYNVFIHGLCKS+RVWEAVEIKRLLGEKGLKAD+VTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVGVEV  EMIELG+VPSE+AVSGV+EGLRRMG+IE AF LLNKVGKLGV+PNLFVYNS+INSLCK+GKL+EAE LFSVMT+RGLFPNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        DGFGRSAKLDVA
Subjt:  DGFGRSAKLDVA

A0A6J1J8G9 putative pentatricopeptide repeat-containing protein At5g59900 isoform X21.7e-15269.17Show/hide
Query:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------
        RSLRTP++D R+FRK+CTWRRNLEE +ENDSQFVY +EQIVRGKQ+W+IAFNNA IS  LKPHHVEKVLI+T DD                         
Subjt:  RSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD-------------------------

Query:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC
                                                      S+SGFDMLIQ+YVQNKRE DGVL+INLMR++GL PEVRTLSALLNALARIRKFC
Subjt:  ----------------------------------------------SSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFC

Query:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR
        Q LELFD  VNAGVKPDSYIYTVVV+CLCELKDF+KA +II + +  GCGLSIVTYNVFIHGLCKS+RVWEAVEIKRLLGEKGLKAD+VTYCTLVLGLCR
Subjt:  QALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCR

Query:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI
        +QEFEVG+EV  EMIELG+VPSE+ VSGV+EGLR+MG+IE AF LLNKVGKLGV+PNLFVYNS+INSLCK+GKL+EAE LFSVMT+RGLFPNDVTYTILI
Subjt:  VQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILI

Query:  DGFGRSAKLDVA
        DGFGRSAKLDVA
Subjt:  DGFGRSAKLDVA

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.5e-3332Show/hide
Query:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG
        E+D V  +I +M+  GL P      +++  L RI K  +A E F   +  G+ PD+ +YT ++   C+  D   A +    +    ++  ++TY   I G
Subjt:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG

Query:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN
         C+   + EA ++   +  KGL+ D VT+  L+ G C+    +    V + MI+ G  P+    + +I+GL + G+++ A  LL+++ K+G+ PN+F YN
Subjt:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN

Query:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA
        S++N LCKSG +EEA  L       GL  + VTYT L+D + +S ++D A
Subjt:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA

Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial8.8e-3427.9Show/hide
Query:  VLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSG-------FDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKF
        +L+++ +GK    +   N  I       +V   L    +  + G       ++ LI+      R  D   +++ M E  + P V T SAL++A  +  K 
Subjt:  VLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSG-------FDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKF

Query:  CQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLC
         +A +L+D  +   + PD + Y+ ++   C     D+AK + + +    C  ++VTYN  I G CK+KRV E +E+ R + ++GL  + VTY TL+ G  
Subjt:  CQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLC

Query:  RVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTIL
        + +E +    V  +M+  G +P     S +++GL   G +E A  +   + +  + P+++ YN +I  +CK+GK+E+   LF  ++ +G+ PN VTYT +
Subjt:  RVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTIL

Query:  IDGFGRSAKLDVASISSRK
        + GF R    + A    R+
Subjt:  IDGFGRSAKLDVASISSRK

Q9FJE6 Putative pentatricopeptide repeat-containing protein At5g599002.6e-8647.14Show/hide
Query:  DSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD------------------------------------------------------
        D QFV  +++IVRGK+SW+IA ++  +S  LK  HVE++LI T+DD                                                      
Subjt:  DSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD------------------------------------------------------

Query:  -----------------SSSGFDMLIQHYVQNKREMDGVLVINLM-REHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCL
                         SSS FD+LIQHYV+++R +DGVLV  +M  +  LLPEVRTLSALL+ L + R F  A+ELF+  V+ G++PD YIYT V+R L
Subjt:  -----------------SSSGFDMLIQHYVQNKREMDGVLVINLM-REHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCL

Query:  CELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSG
        CELKD  +AKE+I  ++  GC ++IV YNV I GLCK ++VWEAV IK+ L  K LK D+VTYCTLV GLC+VQEFE+G+E+  EM+ L F PSE+AVS 
Subjt:  CELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSG

Query:  VIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA
        ++EGLR+ G IE A  L+ +V   GV PNLFVYN++I+SLCK  K  EAE LF  M + GL PNDVTY+ILID F R  KLD A
Subjt:  VIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.9e-3629.69Show/hide
Query:  LEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNAL
        +EEGD + +  + + EQ+V    SW     N  + G  K   VE  L                +++Q     D           G  P+  T + L+N L
Subjt:  LEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNAL

Query:  ARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCT
         +      A+E+ D  +  G  PD Y Y  V+  LC+L +  +A E++ ++    C  + VTYN  I  LCK  +V EA E+ R+L  KG+  D+ T+ +
Subjt:  ARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCT

Query:  LVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPND
        L+ GLC  +   V +E+  EM   G  P E   + +I+ L   G ++ A  +L ++   G   ++  YN++I+  CK+ K  EAE +F  M   G+  N 
Subjt:  LVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPND

Query:  VTYTILIDGFGRSAKLDVAS
        VTY  LIDG  +S +++ A+
Subjt:  VTYTILIDGFGRSAKLDVAS

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629102.6e-3331.13Show/hide
Query:  LIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGL--SI
        L+  Y  +KR  D V +++ M E G  P+  T + L++ L    K  +A+ L D  V  G +PD   Y  VV  LC+  D D A  ++K+++   +   +
Subjt:  LIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGL--SI

Query:  VTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLG
        V YN  I GLCK K + +A+ +   +  KG++ D+ TY +L+  LC    +     +  +MIE    P+    S +I+   + G +  A  L +++ K  
Subjt:  VTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLG

Query:  VVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLD
        + P++F Y+S+IN  C   +L+EA+ +F +M  +  FPN VTY+ LI GF ++ +++
Subjt:  VVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLD

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-3432Show/hide
Query:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG
        E+D V  +I +M+  GL P      +++  L RI K  +A E F   +  G+ PD+ +YT ++   C+  D   A +    +    ++  ++TY   I G
Subjt:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG

Query:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN
         C+   + EA ++   +  KGL+ D VT+  L+ G C+    +    V + MI+ G  P+    + +I+GL + G+++ A  LL+++ K+G+ PN+F YN
Subjt:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN

Query:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA
        S++N LCKSG +EEA  L       GL  + VTYT L+D + +S ++D A
Subjt:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.1e-3432Show/hide
Query:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG
        E+D V  +I +M+  GL P      +++  L RI K  +A E F   +  G+ PD+ +YT ++   C+  D   A +    +    ++  ++TY   I G
Subjt:  EMDGV-LVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLS--IVTYNVFIHG

Query:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN
         C+   + EA ++   +  KGL+ D VT+  L+ G C+    +    V + MI+ G  P+    + +I+GL + G+++ A  LL+++ K+G+ PN+F YN
Subjt:  LCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYN

Query:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA
        S++N LCKSG +EEA  L       GL  + VTYT L+D + +S ++D A
Subjt:  SVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.3e-3527.9Show/hide
Query:  VLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSG-------FDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKF
        +L+++ +GK    +   N  I       +V   L    +  + G       ++ LI+      R  D   +++ M E  + P V T SAL++A  +  K 
Subjt:  VLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSG-------FDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKF

Query:  CQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLC
         +A +L+D  +   + PD + Y+ ++   C     D+AK + + +    C  ++VTYN  I G CK+KRV E +E+ R + ++GL  + VTY TL+ G  
Subjt:  CQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLC

Query:  RVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTIL
        + +E +    V  +M+  G +P     S +++GL   G +E A  +   + +  + P+++ YN +I  +CK+GK+E+   LF  ++ +G+ PN VTYT +
Subjt:  RVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTIL

Query:  IDGFGRSAKLDVASISSRK
        + GF R    + A    R+
Subjt:  IDGFGRSAKLDVASISSRK

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-3729.69Show/hide
Query:  LEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNAL
        +EEGD + +  + + EQ+V    SW     N  + G  K   VE  L                +++Q     D           G  P+  T + L+N L
Subjt:  LEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDDSSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNAL

Query:  ARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCT
         +      A+E+ D  +  G  PD Y Y  V+  LC+L +  +A E++ ++    C  + VTYN  I  LCK  +V EA E+ R+L  KG+  D+ T+ +
Subjt:  ARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRV--DGCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCT

Query:  LVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPND
        L+ GLC  +   V +E+  EM   G  P E   + +I+ L   G ++ A  +L ++   G   ++  YN++I+  CK+ K  EAE +F  M   G+  N 
Subjt:  LVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPND

Query:  VTYTILIDGFGRSAKLDVAS
        VTY  LIDG  +S +++ A+
Subjt:  VTYTILIDGFGRSAKLDVAS

AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-8747.14Show/hide
Query:  DSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD------------------------------------------------------
        D QFV  +++IVRGK+SW+IA ++  +S  LK  HVE++LI T+DD                                                      
Subjt:  DSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLIQTLDD------------------------------------------------------

Query:  -----------------SSSGFDMLIQHYVQNKREMDGVLVINLM-REHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCL
                         SSS FD+LIQHYV+++R +DGVLV  +M  +  LLPEVRTLSALL+ L + R F  A+ELF+  V+ G++PD YIYT V+R L
Subjt:  -----------------SSSGFDMLIQHYVQNKREMDGVLVINLM-REHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCL

Query:  CELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSG
        CELKD  +AKE+I  ++  GC ++IV YNV I GLCK ++VWEAV IK+ L  K LK D+VTYCTLV GLC+VQEFE+G+E+  EM+ L F PSE+AVS 
Subjt:  CELKDFDKAKEIIKRVD--GCGLSIVTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSG

Query:  VIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA
        ++EGLR+ G IE A  L+ +V   GV PNLFVYN++I+SLCK  K  EAE LF  M + GL PNDVTY+ILID F R  KLD A
Subjt:  VIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNSVINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAACGGCGTGCTTGGCCAACTCACCGGGAAGAACGAGCCTCACGGCCGTTGGATCTCACGGGAAGTTATAGTGGCTTCTCAAGATGTCGTTGATGAAGCTGTTCATGATG
CCCATGGCCTTGCTGGAAATTCCGATGTCCGGATGGACTTGCTTCAGCACCTTGAAGATGTAGATCTTGTACGTCTCCGTACTCTTCTTCGTCCTCTTCTTCTTCTTGTC
TCCGGCAGAGGCCCCGCCTTCTTTCGGGAGCTTCTTCCCTGCCTTGGGCTTCTTCTCCGCCGGGGCTTTCTCGGCTACGGTCTTCTTCTCCTCCGCTGGCTTGTCGGAGG
CCGGCTTTTTCTCGGCGGGTTTCTTCTCGGCCTTCGGTGCCATCTGAAATTGCCTTGCAAGTTAGCAGAAGGAGAGTCGGTTGACGCAAAAGAGCATCTCGATGAAGCTC
ATTCAGCTAGTCGATCGCTGAGAACTCCTAGTTTAGACAGAAGAAAATTCAGGAAGTATTGTACATGGAGAAGGAACCTCGAAGAGGGCGACGAAAATGATTCGCAGTTC
GTTTATGTACTTGAGCAAATTGTGCGAGGAAAGCAAAGCTGGAAGATTGCCTTCAACAACGCATTCATTTCAGGCACTTTAAAGCCCCATCACGTAGAAAAGGTTTTGAT
CCAAACTCTTGACGACTCCAGCTCAGGTTTTGATATGTTGATTCAGCATTACGTGCAGAACAAGAGAGAAATGGATGGTGTTCTGGTCATAAATCTCATGAGGGAGCACG
GGCTGTTGCCTGAAGTTAGAACTTTGAGTGCTTTGTTAAATGCTCTCGCGCGAATCAGGAAATTCTGCCAAGCCTTGGAACTCTTTGATACCTTTGTGAATGCAGGTGTT
AAGCCCGACAGTTATATCTACACGGTGGTTGTTCGGTGCTTGTGTGAATTGAAGGACTTTGACAAGGCCAAGGAAATAATTAAACGTGTCGATGGATGTGGTTTGAGTAT
TGTAACATATAATGTGTTTATCCATGGGCTCTGCAAGAGCAAGAGAGTTTGGGAGGCTGTTGAGATCAAGAGATTGCTGGGTGAAAAGGGTTTGAAAGCAGATTTGGTTA
CATATTGTACGTTAGTATTGGGATTGTGCAGAGTACAGGAATTTGAGGTTGGTGTGGAGGTGACGCATGAAATGATTGAGCTGGGTTTTGTTCCAAGCGAATCTGCTGTT
TCAGGAGTCATAGAGGGGTTGAGGAGAATGGGGAATATTGAAGGTGCTTTTGGGTTGCTAAACAAGGTTGGGAAACTTGGGGTAGTGCCTAATCTATTTGTTTATAATTC
AGTGATCAATTCATTGTGCAAAAGTGGGAAATTGGAAGAAGCCGAGTCTCTTTTTAGTGTAATGACTGAAAGGGGTTTGTTTCCCAATGATGTCACATATACTATATTGA
TCGATGGATTTGGAAGAAGCGCCAAACTGGATGTTGCTTCTATTTCTTCAAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
AAACGGCGTGCTTGGCCAACTCACCGGGAAGAACGAGCCTCACGGCCGTTGGATCTCACGGGAAGTTATAGTGGCTTCTCAAGATGTCGTTGATGAAGCTGTTCATGATG
CCCATGGCCTTGCTGGAAATTCCGATGTCCGGATGGACTTGCTTCAGCACCTTGAAGATGTAGATCTTGTACGTCTCCGTACTCTTCTTCGTCCTCTTCTTCTTCTTGTC
TCCGGCAGAGGCCCCGCCTTCTTTCGGGAGCTTCTTCCCTGCCTTGGGCTTCTTCTCCGCCGGGGCTTTCTCGGCTACGGTCTTCTTCTCCTCCGCTGGCTTGTCGGAGG
CCGGCTTTTTCTCGGCGGGTTTCTTCTCGGCCTTCGGTGCCATCTGAAATTGCCTTGCAAGTTAGCAGAAGGAGAGTCGGTTGACGCAAAAGAGCATCTCGATGAAGCTC
ATTCAGCTAGTCGATCGCTGAGAACTCCTAGTTTAGACAGAAGAAAATTCAGGAAGTATTGTACATGGAGAAGGAACCTCGAAGAGGGCGACGAAAATGATTCGCAGTTC
GTTTATGTACTTGAGCAAATTGTGCGAGGAAAGCAAAGCTGGAAGATTGCCTTCAACAACGCATTCATTTCAGGCACTTTAAAGCCCCATCACGTAGAAAAGGTTTTGAT
CCAAACTCTTGACGACTCCAGCTCAGGTTTTGATATGTTGATTCAGCATTACGTGCAGAACAAGAGAGAAATGGATGGTGTTCTGGTCATAAATCTCATGAGGGAGCACG
GGCTGTTGCCTGAAGTTAGAACTTTGAGTGCTTTGTTAAATGCTCTCGCGCGAATCAGGAAATTCTGCCAAGCCTTGGAACTCTTTGATACCTTTGTGAATGCAGGTGTT
AAGCCCGACAGTTATATCTACACGGTGGTTGTTCGGTGCTTGTGTGAATTGAAGGACTTTGACAAGGCCAAGGAAATAATTAAACGTGTCGATGGATGTGGTTTGAGTAT
TGTAACATATAATGTGTTTATCCATGGGCTCTGCAAGAGCAAGAGAGTTTGGGAGGCTGTTGAGATCAAGAGATTGCTGGGTGAAAAGGGTTTGAAAGCAGATTTGGTTA
CATATTGTACGTTAGTATTGGGATTGTGCAGAGTACAGGAATTTGAGGTTGGTGTGGAGGTGACGCATGAAATGATTGAGCTGGGTTTTGTTCCAAGCGAATCTGCTGTT
TCAGGAGTCATAGAGGGGTTGAGGAGAATGGGGAATATTGAAGGTGCTTTTGGGTTGCTAAACAAGGTTGGGAAACTTGGGGTAGTGCCTAATCTATTTGTTTATAATTC
AGTGATCAATTCATTGTGCAAAAGTGGGAAATTGGAAGAAGCCGAGTCTCTTTTTAGTGTAATGACTGAAAGGGGTTTGTTTCCCAATGATGTCACATATACTATATTGA
TCGATGGATTTGGAAGAAGCGCCAAACTGGATGTTGCTTCTATTTCTTCAAGAAAATGA
Protein sequenceShow/hide protein sequence
NGVLGQLTGKNEPHGRWISREVIVASQDVVDEAVHDAHGLAGNSDVRMDLLQHLEDVDLVRLRTLLRPLLLLVSGRGPAFFRELLPCLGLLLRRGFLGYGLLLLRWLVGG
RLFLGGFLLGLRCHLKLPCKLAEGESVDAKEHLDEAHSASRSLRTPSLDRRKFRKYCTWRRNLEEGDENDSQFVYVLEQIVRGKQSWKIAFNNAFISGTLKPHHVEKVLI
QTLDDSSSGFDMLIQHYVQNKREMDGVLVINLMREHGLLPEVRTLSALLNALARIRKFCQALELFDTFVNAGVKPDSYIYTVVVRCLCELKDFDKAKEIIKRVDGCGLSI
VTYNVFIHGLCKSKRVWEAVEIKRLLGEKGLKADLVTYCTLVLGLCRVQEFEVGVEVTHEMIELGFVPSESAVSGVIEGLRRMGNIEGAFGLLNKVGKLGVVPNLFVYNS
VINSLCKSGKLEEAESLFSVMTERGLFPNDVTYTILIDGFGRSAKLDVASISSRK