; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G220190 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G220190
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmU531Chr11:28591082..28595109
RNA-Seq ExpressionCmUC11G220190
SyntenyCmUC11G220190
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063347.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]6.6e-28088.09Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QK WRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGL+PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRD-----------RCNPDICSYTTMLSAFVNASDMEGAENFFRR
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISG+VEQAK VFKSM+RD           RCNPDICSYTTMLSA+VNASDMEGAE FFRR
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRD-----------RCNPDICSYTTMLSAFVNASDMEGAENFFRR

Query:  LKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEAN
        LKQDGFRPNVVTYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDF SAVIWFKEI SCGLRPDQKAKNILLSLAKT EEL+EAN
Subjt:  LKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEAN

Query:  QLVGYSSQSSNPQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        QLVGYSSQSSNPQR GKFSRS+A+DEEEEDDELDY DDVI HTNQR EKIILNGIHQ+NLEQNLEGLCAKI
Subjt:  QLVGYSSQSSNPQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

XP_004149531.1 pentatricopeptide repeat-containing protein At3g59040 isoform X1 [Cucumis sativus]8.9e-28590.54Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKD ADEA+QKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRG RYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSM+RDRC+PDICSYTTMLSA+VNASDMEGAENFFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWF EI SCGLRPDQKAKNILLSLAKT EELDEANQLVGYSSQSS+
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        PQR GKFSRS+A+DEEEE+DELDYADDVIPHTNQR EKIILNGIHQQNLEQNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

XP_008464650.1 PREDICTED: pentatricopeptide repeat-containing protein At3g59040 isoform X1 [Cucumis melo]1.9e-28289.82Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QK WRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGL+PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISG+VEQAK VFKSM+RDRCNPDICSYTTMLSA+VNASDMEGAE FFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDF SAVIWFKEI SCGLRPDQKAKNILLSLAKT EEL+EANQLVGYSSQSSN
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        PQR GKFSRS+A+DEEEEDDELDY DDVI HTNQR EKIILNGIHQ+NLEQNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

XP_022139934.1 pentatricopeptide repeat-containing protein At3g59040 [Momordica charantia]5.6e-27187.52Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QKNWRRLM EIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLV E                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWW+FSEMDFLMLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRG RYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EG KFKEAEELFDSLLN ERAVLKPDQKMFHM+IYMFKKAGNYEKARKVFAEMAAR VPQTTVTYNSLMSFETNYKEVSKIYDQMQRAG++PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIV KSMRRDRC+PDICSYTTMLSA+VNASDMEGAENFFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMI KYEEMKVNGIR NQTILTTIMDAYGKNKDFGSAVIWFKEI SCGL PDQKAKNILLSLAKT EELDEANQLVGY +QSSN
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDD--ELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAK
        P+RA KFSRSV E+EEE+DD  ELDYAD+VIPH NQR EKIILN IHQ    QNLEGLCAK
Subjt:  PQRAGKFSRSVAEDEEEEDD--ELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAK

XP_038899508.1 pentatricopeptide repeat-containing protein At3g59040 [Benincasa hispida]2.1e-28690.89Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGYPPNV+SHTALMEAYGRGGRY+NAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELF+SLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGM+EQAKIVFKSMRRDRC+PDICSYTTMLSA++NASDM+GAENFFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEI SCGLRPDQKAKNILLSLAKT EELDEANQLVGYSSQSSN
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        PQRAGKF  SV EDEEE+DDELDYADDVIPHTNQR EKIILNGIHQQNLEQNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

TrEMBL top hitse value%identityAlignment
A0A0A0L2Y5 Uncharacterized protein4.3e-28590.54Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKD ADEA+QKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRG RYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSM+RDRC+PDICSYTTMLSA+VNASDMEGAENFFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWF EI SCGLRPDQKAKNILLSLAKT EELDEANQLVGYSSQSS+
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        PQR GKFSRS+A+DEEEE+DELDYADDVIPHTNQR EKIILNGIHQQNLEQNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

A0A1S3CNI8 pentatricopeptide repeat-containing protein At3g59040 isoform X19.0e-28389.82Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QK WRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGL+PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISG+VEQAK VFKSM+RDRCNPDICSYTTMLSA+VNASDMEGAE FFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDF SAVIWFKEI SCGLRPDQKAKNILLSLAKT EEL+EANQLVGYSSQSSN
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        PQR GKFSRS+A+DEEEEDDELDY DDVI HTNQR EKIILNGIHQ+NLEQNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

A0A5D3DW60 Pentatricopeptide repeat-containing protein3.2e-28088.09Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QK WRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMDF+MLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EGSKFKEAEELFDSLLNKE+ VLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGL+PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRD-----------RCNPDICSYTTMLSAFVNASDMEGAENFFRR
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISG+VEQAK VFKSM+RD           RCNPDICSYTTMLSA+VNASDMEGAE FFRR
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRD-----------RCNPDICSYTTMLSAFVNASDMEGAENFFRR

Query:  LKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEAN
        LKQDGFRPNVVTYGTLIKGYAKINNLEKMIK+YEEMKVNGIRVNQTILTTIMDAYGKNKDF SAVIWFKEI SCGLRPDQKAKNILLSLAKT EEL+EAN
Subjt:  LKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEAN

Query:  QLVGYSSQSSNPQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
        QLVGYSSQSSNPQR GKFSRS+A+DEEEEDDELDY DDVI HTNQR EKIILNGIHQ+NLEQNLEGLCAKI
Subjt:  QLVGYSSQSSNPQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

A0A6J1CEC4 pentatricopeptide repeat-containing protein At3g590402.7e-27187.52Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QKNWRRLM EIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLV E                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWW+FSEMDFLMLITAYGKLGDFNRAEKVL+LMNKKGY PNV+SHTALMEAYGRG RYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EG KFKEAEELFDSLLN ERAVLKPDQKMFHM+IYMFKKAGNYEKARKVFAEMAAR VPQTTVTYNSLMSFETNYKEVSKIYDQMQRAG++PDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIV KSMRRDRC+PDICSYTTMLSA+VNASDMEGAENFFRRLKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNLEKMI KYEEMKVNGIR NQTILTTIMDAYGKNKDFGSAVIWFKEI SCGL PDQKAKNILLSLAKT EELDEANQLVGY +QSSN
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDD--ELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAK
        P+RA KFSRSV E+EEE+DD  ELDYAD+VIPH NQR EKIILN IHQ    QNLEGLCAK
Subjt:  PQRAGKFSRSVAEDEEEEDD--ELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAK

A0A6J1FJ05 pentatricopeptide repeat-containing protein At3g590403.4e-26685.36Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD
        G +  +K L KRKKLEVFKDAADEA+QKNWR+LMNEIEETGSAVSVL+S RIKNEAI KDLVLGTLVRFKQLKKWN+VSE                    
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQLD

Query:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
            ILEWLRTQSWWNFSEMD+LMLITAYGKLG+FNRAEKVL+LMNKKGY PNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
Subjt:  YFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV

Query:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL
        EG KFKEAEELFDSLLNKE  VLKPDQKMFHM+IYMFKKAGNYEKARK+F+EMAARGVPQ+T+TYNSLMSFETNYKEVS+IYDQMQRAGLQPDVVSYALL
Subjt:  EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALL

Query:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV
        I AYGKARREEEALAVFEEMLDAG+RPT KAYNILLDAFAISGMVEQAKIVFKSMRRDRC+PDICSYTTMLSA+VNASDMEGAE FFR+LKQDGFRPNVV
Subjt:  ISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVV

Query:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN
        TYGTLIKGYAKINNL+KM+KKYEEMK+NGIRVNQ ILTTIMDA+GKNKDFGSAVIWFKEI SCG+RPDQKAKNILLSLA T EELDEANQLVGY +QSS+
Subjt:  TYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSN

Query:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI
         +RAGKFSRS+ EDE+EED+ELDYADDVI H NQR EKIILNGIHQ    QNLEGLCAKI
Subjt:  PQRAGKFSRSVAEDEEEEDDELDYADDVIPHTNQRGEKIILNGIHQQNLEQNLEGLCAKI

SwissProt top hitse value%identityAlignment
O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic1.5e-3725.05Show/hide
Query:  AADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRF-----KQLKKWNLVSEVQSIDICVLNYLAAQFGQLDYFISILEWLRTQSWW
        + D +  K    +  E E  GS   + + E +   +I +  + G L RF      +L + +LVS V+ +D           G  +  + + EWL   S  
Subjt:  AADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRF-----KQLKKWNLVSEVQSIDICVLNYLAAQFGQLDYFISILEWLRTQSWW

Query:  NFSEMDFLML---ITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF-------------
           ++D  ++   +   G+   ++ A K+L  +  + Y  +V ++T ++ AY R G+Y  A  +F RM+  GP P+ +TY ++L  F             
Subjt:  NFSEMDFLML---ITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF-------------

Query:  -----VEGSKF------------------KEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETN--
              +G KF                  +EA+E F  L   +    +P    ++ ++ +F KAG Y +A  V  EM     P  +VTYN L++      
Subjt:  -----VEGSKF------------------KEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETN--

Query:  -YKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSA
          KE + + + M + G+ P+ ++Y  +I AYGKA +E+EAL +F  M +AG  P    YN +L          +   +   M+ + C+P+  ++ TML+ 
Subjt:  -YKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSA

Query:  FVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKN
          N    +     FR +K  GF P+  T+ TLI  Y +  +     K Y EM   G     T    +++A  +  D+ S      ++ S G +P + + +
Subjt:  FVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKN

Query:  ILL
        ++L
Subjt:  ILL

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic6.9e-3825.99Show/hide
Query:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERAVLK
        +I  + K GD ++A ++L +    G      +  +++ A    GR   AEA+F  ++  G +P    Y  +LK +V+    K+AE +   +   E+  + 
Subjt:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERAVLK

Query:  PDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMS---FETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEML
        PD+  + ++I  +  AG +E AR V  EM A  V   +  ++ L++       +++  ++  +M+  G++PD   Y ++I  +GK    + A+  F+ ML
Subjt:  PDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMS---FETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEML

Query:  DAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKK
          GI P    +N L+D     G    A+ +F++M R  C P   +Y  M++++ +    +  +    ++K  G  PNVVT+ TL+  Y K       I+ 
Subjt:  DAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKK

Query:  YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQS
         EEMK  G++ + T+   +++AY +      AV  F+ + S GL+P   A N L++         EA  ++ Y  ++
Subjt:  YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQS

Q9LYT2 Pentatricopeptide repeat-containing protein At3g590405.2e-20367.1Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL
        G +  +K L KR+K+EVFKDAADE DQK WR LM EIE TGSAV VLR  +   ++ +P+DLVLGTLVRFKQLKKWNLVSE                   
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL

Query:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF
             ILEWLR Q+WWNFSE+DFLMLITAYGKLG+FN AE+VLS+++K G  PNVIS+TALME+YGRGG+ NNAEAIFRRMQS GPEPSA+TYQI+LKTF
Subjt:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF

Query:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL
        VEG KFKEAEE+F++LL+++++ LKPDQKM+HM+IYM+KKAGNYEKARKVF+ M  +GVPQ+TVTYNSLMSFET+YKEVSKIYDQMQR+ +QPDVVSYAL
Subjt:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL

Query:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV
        LI AYG+ARREEEAL+VFEEMLDAG+RPTHKAYNILLDAFAISGMVEQAK VFKSMRRDR  PD+ SYTTMLSA+VNASDMEGAE FF+R+K DGF PN+
Subjt:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV

Query:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS
        VTYGTLIKGYAK N++EKM++ YE+M+++GI+ NQTILTTIMDA G+ K+FGSA+ W+KE+ SCG+ PDQKAKN+LLSLA T +EL+EA +L G  ++++
Subjt:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS

Query:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI
                   + +     S   ++DE+E DD+ D A + +
Subjt:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.3e-4027.37Show/hide
Query:  MLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERA-V
        ++I+  GK G  + A  + + + + G+  +V S+T+L+ A+   GRY  A  +F++M+  G +P+ +TY ++L  F    K         SL+ K ++  
Subjt:  MLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERA-V

Query:  LKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSF---ETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEE
        + PD   ++ +I   K+   +++A +VF EM A G     VTYN+L+         KE  K+ ++M   G  P +V+Y  LISAY +    +EA+ +  +
Subjt:  LKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSF---ETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEE

Query:  MLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMI
        M + G +P    Y  LL  F  +G VE A  +F+ MR   C P+IC++   +  + N          F  +   G  P++VT+ TL+  + +     ++ 
Subjt:  MLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMI

Query:  KKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLS
          ++EMK  G    +    T++ AY +   F  A+  ++ +   G+ PD    N +L+
Subjt:  KKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLS

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic8.2e-3926.76Show/hide
Query:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGS-KFKEAEELFDSLLNKERAVL
        +I+  G+ G    A+++       GY   V + +AL+ AYGR G +  A ++F  M+  G  P+ +TY  ++    +G  +FK+  + FD +   +R  +
Subjt:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGS-KFKEAEELFDSLLNKERAVL

Query:  KPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVS---KIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEM
        +PD+  F+ ++ +  + G +E AR +F EM  R + Q   +YN+L+       ++    +I  QM    + P+VVSY+ +I  + KA R +EAL +F EM
Subjt:  KPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVS---KIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEM

Query:  LDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIK
           GI     +YN LL  +   G  E+A  + + M       D+ +Y  +L  +      +  +  F  +K++   PN++TY TLI GY+K    ++ ++
Subjt:  LDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIK

Query:  KYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSNPQRAGKFSRSVAEDEEEEDD
         + E K  G+R +  + + ++DA  KN   GSAV    E+   G+ P+    N ++     +  +D +     YS+  S P  +   S   A  E E + 
Subjt:  KYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSNPQRAGKFSRSVAEDEEEEDD

Query:  ELDYADDVIPHTNQRGEKIILNGIHQ
         +     +   +N R  K    G+ +
Subjt:  ELDYADDVIPHTNQRGEKIILNGIHQ

Arabidopsis top hitse value%identityAlignment
AT2G31400.1 genomes uncoupled 15.8e-4026.76Show/hide
Query:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGS-KFKEAEELFDSLLNKERAVL
        +I+  G+ G    A+++       GY   V + +AL+ AYGR G +  A ++F  M+  G  P+ +TY  ++    +G  +FK+  + FD +   +R  +
Subjt:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGS-KFKEAEELFDSLLNKERAVL

Query:  KPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVS---KIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEM
        +PD+  F+ ++ +  + G +E AR +F EM  R + Q   +YN+L+       ++    +I  QM    + P+VVSY+ +I  + KA R +EAL +F EM
Subjt:  KPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVS---KIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEM

Query:  LDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIK
           GI     +YN LL  +   G  E+A  + + M       D+ +Y  +L  +      +  +  F  +K++   PN++TY TLI GY+K    ++ ++
Subjt:  LDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIK

Query:  KYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSNPQRAGKFSRSVAEDEEEEDD
         + E K  G+R +  + + ++DA  KN   GSAV    E+   G+ P+    N ++     +  +D +     YS+  S P  +   S   A  E E + 
Subjt:  KYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSNPQRAGKFSRSVAEDEEEEDD

Query:  ELDYADDVIPHTNQRGEKIILNGIHQ
         +     +   +N R  K    G+ +
Subjt:  ELDYADDVIPHTNQRGEKIILNGIHQ

AT3G59040.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-20467.1Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL
        G +  +K L KR+K+EVFKDAADE DQK WR LM EIE TGSAV VLR  +   ++ +P+DLVLGTLVRFKQLKKWNLVSE                   
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL

Query:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF
             ILEWLR Q+WWNFSE+DFLMLITAYGKLG+FN AE+VLS+++K G  PNVIS+TALME+YGRGG+ NNAEAIFRRMQS GPEPSA+TYQI+LKTF
Subjt:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF

Query:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL
        VEG KFKEAEE+F++LL+++++ LKPDQKM+HM+IYM+KKAGNYEKARKVF+ M  +GVPQ+TVTYNSLMSFET+YKEVSKIYDQMQR+ +QPDVVSYAL
Subjt:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL

Query:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV
        LI AYG+ARREEEAL+VFEEMLDAG+RPTHKAYNILLDAFAISGMVEQAK VFKSMRRDR  PD+ SYTTMLSA+VNASDMEGAE FF+R+K DGF PN+
Subjt:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV

Query:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS
        VTYGTLIKGYAK N++EKM++ YE+M+++GI+ NQTILTTIMDA G+ K+FGSA+ W+KE+ SCG+ PDQKAKN+LLSLA T +EL+EA +L G  ++++
Subjt:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS

Query:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI
                   + +     S   ++DE+E DD+ D A + +
Subjt:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI

AT3G59040.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-20467.1Show/hide
Query:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL
        G +  +K L KR+K+EVFKDAADE DQK WR LM EIE TGSAV VLR  +   ++ +P+DLVLGTLVRFKQLKKWNLVSE                   
Subjt:  GYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIK-NEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLNYLAAQFGQL

Query:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF
             ILEWLR Q+WWNFSE+DFLMLITAYGKLG+FN AE+VLS+++K G  PNVIS+TALME+YGRGG+ NNAEAIFRRMQS GPEPSA+TYQI+LKTF
Subjt:  DYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTF

Query:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL
        VEG KFKEAEE+F++LL+++++ LKPDQKM+HM+IYM+KKAGNYEKARKVF+ M  +GVPQ+TVTYNSLMSFET+YKEVSKIYDQMQR+ +QPDVVSYAL
Subjt:  VEGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYAL

Query:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV
        LI AYG+ARREEEAL+VFEEMLDAG+RPTHKAYNILLDAFAISGMVEQAK VFKSMRRDR  PD+ SYTTMLSA+VNASDMEGAE FF+R+K DGF PN+
Subjt:  LISAYGKARREEEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNV

Query:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS
        VTYGTLIKGYAK N++EKM++ YE+M+++GI+ NQTILTTIMDA G+ K+FGSA+ W+KE+ SCG+ PDQKAKN+LLSLA T +EL+EA +L G  ++++
Subjt:  VTYGTLIKGYAKINNLEKMIKKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSS

Query:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI
                   + +     S   ++DE+E DD+ D A + +
Subjt:  -----------NPQRAGKFSRSVAEDEEEEDDELDYADDVI

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-4127.37Show/hide
Query:  MLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERA-V
        ++I+  GK G  + A  + + + + G+  +V S+T+L+ A+   GRY  A  +F++M+  G +P+ +TY ++L  F    K         SL+ K ++  
Subjt:  MLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERA-V

Query:  LKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSF---ETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEE
        + PD   ++ +I   K+   +++A +VF EM A G     VTYN+L+         KE  K+ ++M   G  P +V+Y  LISAY +    +EA+ +  +
Subjt:  LKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSF---ETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEE

Query:  MLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMI
        M + G +P    Y  LL  F  +G VE A  +F+ MR   C P+IC++   +  + N          F  +   G  P++VT+ TL+  + +     ++ 
Subjt:  MLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMI

Query:  KKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLS
          ++EMK  G    +    T++ AY +   F  A+  ++ +   G+ PD    N +L+
Subjt:  KKYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLS

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.9e-3925.99Show/hide
Query:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERAVLK
        +I  + K GD ++A ++L +    G      +  +++ A    GR   AEA+F  ++  G +P    Y  +LK +V+    K+AE +   +   E+  + 
Subjt:  LITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFVEGSKFKEAEELFDSLLNKERAVLK

Query:  PDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMS---FETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEML
        PD+  + ++I  +  AG +E AR V  EM A  V   +  ++ L++       +++  ++  +M+  G++PD   Y ++I  +GK    + A+  F+ ML
Subjt:  PDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMS---FETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARREEEALAVFEEML

Query:  DAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKK
          GI P    +N L+D     G    A+ +F++M R  C P   +Y  M++++ +    +  +    ++K  G  PNVVT+ TL+  Y K       I+ 
Subjt:  DAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIKK

Query:  YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQS
         EEMK  G++ + T+   +++AY +      AV  F+ + S GL+P   A N L++         EA  ++ Y  ++
Subjt:  YEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACACTAATTTTGATCTACTGTTATGTTTATTCGCTGATTGGAGGTTACCTGTATGGGTATGTTGACACCAAGAAAGTTCTTACAAAAAGGAAGAAGTTGGAAGT
ATTTAAGGATGCAGCTGACGAGGCAGATCAGAAGAATTGGAGGAGACTGATGAATGAAATAGAGGAGACAGGCTCTGCTGTTTCTGTGCTCAGAAGCGAAAGGATTAAAA
ATGAGGCTATTCCGAAGGACTTGGTGTTGGGCACCTTGGTTAGATTTAAACAGCTAAAGAAATGGAACCTAGTCAGTGAGGTACAGAGCATAGACATCTGTGTGTTGAAT
TACCTAGCCGCTCAATTTGGTCAGTTGGATTACTTTATTTCAATTCTTGAATGGCTCCGGACCCAGAGCTGGTGGAACTTTAGTGAAATGGATTTCTTGATGCTTATTAC
AGCTTATGGCAAGCTAGGGGACTTCAATCGTGCAGAAAAAGTTCTAAGCTTGATGAATAAGAAGGGTTATCCCCCAAATGTGATCTCTCATACTGCTCTGATGGAAGCAT
ATGGAAGAGGTGGCAGGTATAATAATGCCGAAGCAATCTTTAGAAGGATGCAATCTGGTGGCCCTGAACCTTCTGCGTTGACATATCAAATAATGCTAAAAACTTTTGTT
GAGGGATCTAAATTCAAGGAAGCTGAAGAACTCTTCGACTCCCTCTTGAACAAGGAAAGGGCAGTGTTGAAGCCAGACCAAAAGATGTTCCACATGATAATTTACATGTT
CAAGAAGGCAGGGAATTATGAAAAGGCTCGTAAAGTATTTGCGGAAATGGCAGCACGAGGAGTTCCACAGACTACAGTTACTTATAATAGTTTGATGTCTTTTGAAACTA
ACTACAAGGAGGTCTCGAAGATTTATGATCAGATGCAAAGAGCTGGACTTCAGCCAGATGTTGTTAGCTATGCTCTACTCATAAGTGCTTATGGTAAAGCTAGAAGGGAA
GAAGAAGCACTAGCAGTTTTTGAGGAAATGCTTGACGCTGGTATTAGACCAACTCACAAAGCTTATAATATTTTGCTTGATGCATTTGCAATCTCTGGAATGGTCGAGCA
AGCTAAGATCGTGTTCAAGAGTATGAGAAGGGACAGATGCAATCCAGATATTTGCTCTTATACAACCATGTTATCAGCTTTTGTTAATGCATCTGACATGGAGGGAGCTG
AAAATTTTTTCAGACGACTGAAACAAGATGGTTTTAGACCCAACGTTGTTACTTATGGTACATTGATCAAAGGATATGCTAAAATAAATAATCTTGAAAAGATGATTAAA
AAATATGAAGAAATGAAGGTCAATGGTATTCGAGTGAATCAGACGATTTTGACGACAATCATGGATGCATATGGAAAGAACAAGGATTTTGGTAGTGCTGTTATTTGGTT
CAAGGAAATTGGATCTTGTGGCCTTCGGCCTGATCAAAAAGCGAAAAATATCCTGTTATCTTTGGCAAAAACAACGGAAGAGCTCGATGAAGCGAATCAACTCGTAGGAT
ATTCAAGTCAGAGTAGCAATCCTCAAAGAGCTGGCAAGTTTTCCAGGTCTGTTGCTGAGGATGAGGAAGAAGAAGATGATGAATTAGATTATGCAGATGATGTAATCCCT
CACACTAACCAAAGAGGTGAGAAAATTATTTTAAATGGCATTCATCAACAAAACTTGGAGCAAAACTTGGAGGGGTTATGTGCTAAGATTTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTACACTAATTTTGATCTACTGTTATGTTTATTCGCTGATTGGAGGTTACCTGTATGGGTATGTTGACACCAAGAAAGTTCTTACAAAAAGGAAGAAGTTGGAAGT
ATTTAAGGATGCAGCTGACGAGGCAGATCAGAAGAATTGGAGGAGACTGATGAATGAAATAGAGGAGACAGGCTCTGCTGTTTCTGTGCTCAGAAGCGAAAGGATTAAAA
ATGAGGCTATTCCGAAGGACTTGGTGTTGGGCACCTTGGTTAGATTTAAACAGCTAAAGAAATGGAACCTAGTCAGTGAGGTACAGAGCATAGACATCTGTGTGTTGAAT
TACCTAGCCGCTCAATTTGGTCAGTTGGATTACTTTATTTCAATTCTTGAATGGCTCCGGACCCAGAGCTGGTGGAACTTTAGTGAAATGGATTTCTTGATGCTTATTAC
AGCTTATGGCAAGCTAGGGGACTTCAATCGTGCAGAAAAAGTTCTAAGCTTGATGAATAAGAAGGGTTATCCCCCAAATGTGATCTCTCATACTGCTCTGATGGAAGCAT
ATGGAAGAGGTGGCAGGTATAATAATGCCGAAGCAATCTTTAGAAGGATGCAATCTGGTGGCCCTGAACCTTCTGCGTTGACATATCAAATAATGCTAAAAACTTTTGTT
GAGGGATCTAAATTCAAGGAAGCTGAAGAACTCTTCGACTCCCTCTTGAACAAGGAAAGGGCAGTGTTGAAGCCAGACCAAAAGATGTTCCACATGATAATTTACATGTT
CAAGAAGGCAGGGAATTATGAAAAGGCTCGTAAAGTATTTGCGGAAATGGCAGCACGAGGAGTTCCACAGACTACAGTTACTTATAATAGTTTGATGTCTTTTGAAACTA
ACTACAAGGAGGTCTCGAAGATTTATGATCAGATGCAAAGAGCTGGACTTCAGCCAGATGTTGTTAGCTATGCTCTACTCATAAGTGCTTATGGTAAAGCTAGAAGGGAA
GAAGAAGCACTAGCAGTTTTTGAGGAAATGCTTGACGCTGGTATTAGACCAACTCACAAAGCTTATAATATTTTGCTTGATGCATTTGCAATCTCTGGAATGGTCGAGCA
AGCTAAGATCGTGTTCAAGAGTATGAGAAGGGACAGATGCAATCCAGATATTTGCTCTTATACAACCATGTTATCAGCTTTTGTTAATGCATCTGACATGGAGGGAGCTG
AAAATTTTTTCAGACGACTGAAACAAGATGGTTTTAGACCCAACGTTGTTACTTATGGTACATTGATCAAAGGATATGCTAAAATAAATAATCTTGAAAAGATGATTAAA
AAATATGAAGAAATGAAGGTCAATGGTATTCGAGTGAATCAGACGATTTTGACGACAATCATGGATGCATATGGAAAGAACAAGGATTTTGGTAGTGCTGTTATTTGGTT
CAAGGAAATTGGATCTTGTGGCCTTCGGCCTGATCAAAAAGCGAAAAATATCCTGTTATCTTTGGCAAAAACAACGGAAGAGCTCGATGAAGCGAATCAACTCGTAGGAT
ATTCAAGTCAGAGTAGCAATCCTCAAAGAGCTGGCAAGTTTTCCAGGTCTGTTGCTGAGGATGAGGAAGAAGAAGATGATGAATTAGATTATGCAGATGATGTAATCCCT
CACACTAACCAAAGAGGTGAGAAAATTATTTTAAATGGCATTCATCAACAAAACTTGGAGCAAAACTTGGAGGGGTTATGTGCTAAGATTTACTAATGTTATATATTACC
GCAGTTTTCAGTTTGTATACCCTTTATGTTGGTATATTGTAATACAAAAATCACTTAGCTGGAGAAATTTAAGCTGCAAAAACTTTGGTGGCCATTTTTGTTGTAATATA
TCCATTCGTTTTTCACACATCTTTTACTCTTTTTGGTTGTTA
Protein sequenceShow/hide protein sequence
MSTLILIYCYVYSLIGGYLYGYVDTKKVLTKRKKLEVFKDAADEADQKNWRRLMNEIEETGSAVSVLRSERIKNEAIPKDLVLGTLVRFKQLKKWNLVSEVQSIDICVLN
YLAAQFGQLDYFISILEWLRTQSWWNFSEMDFLMLITAYGKLGDFNRAEKVLSLMNKKGYPPNVISHTALMEAYGRGGRYNNAEAIFRRMQSGGPEPSALTYQIMLKTFV
EGSKFKEAEELFDSLLNKERAVLKPDQKMFHMIIYMFKKAGNYEKARKVFAEMAARGVPQTTVTYNSLMSFETNYKEVSKIYDQMQRAGLQPDVVSYALLISAYGKARRE
EEALAVFEEMLDAGIRPTHKAYNILLDAFAISGMVEQAKIVFKSMRRDRCNPDICSYTTMLSAFVNASDMEGAENFFRRLKQDGFRPNVVTYGTLIKGYAKINNLEKMIK
KYEEMKVNGIRVNQTILTTIMDAYGKNKDFGSAVIWFKEIGSCGLRPDQKAKNILLSLAKTTEELDEANQLVGYSSQSSNPQRAGKFSRSVAEDEEEEDDELDYADDVIP
HTNQRGEKIILNGIHQQNLEQNLEGLCAKIY