; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022850 (gene) of Snake gourd v1 genome

Gene IDTan0022850
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationLG02:35876117..35897248
RNA-Seq ExpressionTan0022850
SyntenyTan0022850
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]7.8e-13984.49Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        MLIRRF+RAA+WATPL+R  TVG+ MELGVSRLQVG S+YCTMIQ QM KQ+ADK  KNKDV+NSKAL   SEQNIGDIRKH+IGENV+RKDKI+FLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVY ALD WVAWEQDFPIASLK  LAALEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMI+IYYRNKMLEDLVKLFKDLEAFGR+PP+KSIVQRVADA EMLGLLEEKER+L KYK LF DEK+ S+KKYKR+SFEK  RKRKSTKG+EDN +L+
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K++
Subjt:  KAQ

XP_022154414.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia]5.2e-14384.16Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        ML+RRFHRA +W TPL+R+LT GQIM+LGVSRLQVGNS YCTM+QAQMC+Q+AD+ MKNKDVNNSKALCQ SEQN GD+RKHQIGENV+RKDKINFLV T
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVY ALD WVAWEQ+FPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKML++LVKLFKDLEAFGR+PPEKSIVQRVADAYEMLGL EEKER+L KYKDLFTDE+KG I+KY +ISFEKS R+RK TK ++DNGDL 
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K Q
Subjt:  KAQ

XP_022154416.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X3 [Momordica charantia]5.2e-14384.16Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        ML+RRFHRA +W TPL+R+LT GQIM+LGVSRLQVGNS YCTM+QAQMC+Q+AD+ MKNKDVNNSKALCQ SEQN GD+RKHQIGENV+RKDKINFLV T
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVY ALD WVAWEQ+FPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKML++LVKLFKDLEAFGR+PPEKSIVQRVADAYEMLGL EEKER+L KYKDLFTDE+KG I+KY +ISFEKS R+RK TK ++DNGDL 
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K Q
Subjt:  KAQ

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]2.3e-13884.39Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        MLIRRFHRAA+WATPL+R+ TVGQ+MELGV++LQ+GNS YCTM+Q QM K+ ADK M +KDVNNSK L QTSE+NIGDIRKHQIGENV+RKDKINFLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVY ALD WVAWEQDFPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKMLEDLVKLFKDLEAFGR+PPEKSIVQRVADA EMLGL+EEKER+L KY  LFTDEKKGSIKKYK         KRKSTKG +DN DLM
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  K
        K
Subjt:  K

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]8.0e-14486.47Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        ML+RRFHRA +WATPL+R+LTVGQIMELGVSRLQVG+  YCTMIQ QM KQ+A K +KNKD NNSKAL QTSEQNIGD+RKHQIG+NV RKDKINFLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVY ALD WVAWEQDFPI SLK  L  LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMI+IYYRNKMLEDLVKLFKDLEAFGR+PPEKSIVQRVADA E+LGLLEEKER+L KYK LFTDEK+GSIKKYKR+SFEKS  KRKSTK TEDN +LM
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        KAQ
Subjt:  KAQ

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic3.8e-13984.49Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        MLIRRF+RAA+WATPL+R  TVG+ MELGVSRLQVG S+YCTMIQ QM KQ+ADK  KNKDV+NSKAL   SEQNIGDIRKH+IGENV+RKDKI+FLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVY ALD WVAWEQDFPIASLK  LAALEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMI+IYYRNKMLEDLVKLFKDLEAFGR+PP+KSIVQRVADA EMLGLLEEKER+L KYK LF DEK+ S+KKYKR+SFEK  RKRKSTKG+EDN +L+
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K++
Subjt:  KAQ

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.5e-14384.16Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        ML+RRFHRA +W TPL+R+LT GQIM+LGVSRLQVGNS YCTM+QAQMC+Q+AD+ MKNKDVNNSKALCQ SEQN GD+RKHQIGENV+RKDKINFLV T
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVY ALD WVAWEQ+FPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKML++LVKLFKDLEAFGR+PPEKSIVQRVADAYEMLGL EEKER+L KYKDLFTDE+KG I+KY +ISFEKS R+RK TK ++DNGDL 
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K Q
Subjt:  KAQ

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X32.5e-14384.16Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        ML+RRFHRA +W TPL+R+LT GQIM+LGVSRLQVGNS YCTM+QAQMC+Q+AD+ MKNKDVNNSKALCQ SEQN GD+RKHQIGENV+RKDKINFLV T
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVY ALD WVAWEQ+FPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKML++LVKLFKDLEAFGR+PPEKSIVQRVADAYEMLGL EEKER+L KYKDLFTDE+KG I+KY +ISFEKS R+RK TK ++DNGDL 
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  KAQ
        K Q
Subjt:  KAQ

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.1e-13783.72Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        MLIRRFHRAA+WATPL+R+ TVGQIMELGV++LQ+GNS YCTM+Q QM K+  DK M +KDVNNSK L QTSE+NIGDIRKHQIGENV+RKDKI+FLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVY ALD WVAWEQDFPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKMLEDLVKLFK+LEAFGR+PPEKSIVQRVADA EMLGL+EEKER+L KY  LFTDEKKGSIKKYK         KRKSTKG +DN DLM
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  K
        K
Subjt:  K

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.1e-13884.39Show/hide
Query:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT
        MLIRRFHRAA+WATPL+R+ TVGQ+MELGV++LQ+GNS YCTM+Q QM K+ ADK M +KDVNNSK L QTSE+NIGDIRKHQIGENV+RKDKINFLVNT
Subjt:  MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNT

Query:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVY ALD WVAWEQDFPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM
        CRSMISIYYRNKMLEDLVKLFKDLEAFGR+PPEKSIVQRVADA EMLGL+EEKER+L KY  LFTDEKKGSIKKYK         KRKSTKG +DN DLM
Subjt:  CRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLM

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.6e-3342.08Show/hide
Query:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVY AL+ WVAWE +FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TMG Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY
        P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L   E ++    R L++YK ++ + ++  +K+Y
Subjt:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211906.2e-3039.63Show/hide
Query:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        ++  +  L + KE VY ALD+++AWE +FP+  +K+AL  LE E++W +++QV KWMLSKGQG TMG Y  L+ AL  D+R +EA + W       L   
Subjt:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKY
        P +    MISIYY+  M + L ++F D+E  G + P  +IV  V   +  L + ++ E+++ KY
Subjt:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)8.1e-7862.11Show/hide
Query:  KAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWML
        + +KN+D  +      + + N  + RKHQIGEN+ +KDKI FLVNTLLD+ D+KEAVY ALD WVAWE++FPIASLK  +A+LEKE QWHR+VQVIKW+L
Subjt:  KAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWML

Query:  SKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE
        SKGQG TMG YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML++LVKLFKDLE++ R+PP+K IVQ VADAYE+LG+L+EKE
Subjt:  SKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE

Query:  RILAKYKDLF----TDEKKGSIKKYKR
        R++ KY  L     +D+K     + K+
Subjt:  RILAKYKDLF----TDEKKGSIKKYKR

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)3.4e-7661.3Show/hide
Query:  KAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWML
        + +KN+D  +      + + N  + RKHQIGEN+ +KDKI FLVNTLLD+ D+KEAVY ALD WVAWE++FPIASLK  +A+LEKE QWHR+VQVIKW+L
Subjt:  KAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWML

Query:  SKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLV---KLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLE
        SKGQG TMG YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML++LV   KLFKDLE++ R+PP+K IVQ VADAYE+LG+L+
Subjt:  SKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLV---KLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLE

Query:  EKERILAKYKDLF----TDEKKGSIKKYKR
        EKER++ KY  L     +D+K     + K+
Subjt:  EKERILAKYKDLF----TDEKKGSIKKYKR

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-3442.08Show/hide
Query:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVY AL+ WVAWE +FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TMG Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY
        P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L   E ++    R L++YK ++ + ++  +K+Y
Subjt:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.1e-3442.08Show/hide
Query:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVY AL+ WVAWE +FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TMG Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY
        P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L   E ++    R L++YK ++ + ++  +K+Y
Subjt:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.1e-3442.08Show/hide
Query:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVY AL+ WVAWE +FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TMG Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY
        P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L   E ++    R L++YK ++ + ++  +K+Y
Subjt:  PWQLCRSMISIYYRNKMLEDLVKLFKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKE----RILAKYKDLFTDEKKGSIKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATCCGGAGGTTTCATCGAGCAGCGTCATGGGCAACGCCTCTAATGCGAGAACTAACCGTAGGACAAATAATGGAGCTTGGGGTCAGCAGGCTGCAAGTTGGGAA
CTCTTTTTACTGTACAATGATACAAGCTCAAATGTGTAAACAGATTGCTGATAAAGCTATGAAAAATAAGGATGTTAACAATAGTAAAGCTTTGTGCCAGACTTCAGAGC
AAAATATTGGAGACATTAGAAAGCACCAAATTGGGGAAAATGTTGCACGGAAGGACAAAATTAACTTCCTTGTAAATACGCTTCTCGATTTAAGAGATAGTAAGGAAGCT
GTTTATGCTGCTCTTGATACCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAGTTGT
TCAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGAACTACAATGGGAGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACACAAGT
TTTGGGTCATGAAAATTGGTTCGGATCTACATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGCTT
TTTAAGGATCTTGAAGCTTTCGGACGTAGACCACCAGAAAAATCAATAGTACAGAGGGTAGCAGATGCTTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGATATT
AGCGAAGTACAAAGACCTTTTTACAGATGAGAAGAAAGGGTCCATCAAGAAATATAAGAGGATTTCGTTTGAGAAATCAAACAGAAAAAGAAAATCGACCAAGGGCACTG
AAGACAATGGCGATCTTATGAAGGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
GTTTGATTTGGAGCATTTTGCCCCATTACCAGTTGGGACCGTGGCGCAAACATCTGACATTTGAAGGCAGCGCATTGCTCTACCATGCTCATCCGGAGGTTTCATCGAGC
AGCGTCATGGGCAACGCCTCTAATGCGAGAACTAACCGTAGGACAAATAATGGAGCTTGGGGTCAGCAGGCTGCAAGTTGGGAACTCTTTTTACTGTACAATGATACAAG
CTCAAATGTGTAAACAGATTGCTGATAAAGCTATGAAAAATAAGGATGTTAACAATAGTAAAGCTTTGTGCCAGACTTCAGAGCAAAATATTGGAGACATTAGAAAGCAC
CAAATTGGGGAAAATGTTGCACGGAAGGACAAAATTAACTTCCTTGTAAATACGCTTCTCGATTTAAGAGATAGTAAGGAAGCTGTTTATGCTGCTCTTGATACCTGGGT
TGCATGGGAGCAAGACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAGTTGTTCAGGTAATCAAATGGATGTTAAGCA
AGGGGCAGGGAACTACAATGGGAGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAAGCACACAAGTTTTGGGTCATGAAAATTGGTTCGGAT
CTACATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGCTTTTTAAGGATCTTGAAGCTTTCGGACG
TAGACCACCAGAAAAATCAATAGTACAGAGGGTAGCAGATGCTTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGATATTAGCGAAGTACAAAGACCTTTTTACAG
ATGAGAAGAAAGGGTCCATCAAGAAATATAAGAGGATTTCGTTTGAGAAATCAAACAGAAAAAGAAAATCGACCAAGGGCACTGAAGACAATGGCGATCTTATGAAGGCT
CAATGAACGAAATTAACTGTTAGTTATCAAATATAAAGAAGAATTATGTATTAGCAATTATGTATTGCATAAATTGACCTTTTCATTTATTTCACCAACTCAGGCTATGA
TTACGTCTCCTGAAGACCTCACCGACTGTTCTGGGTATTGACTTTGGCTCTGTAAATAATTGTTTTTGTTATTTGAATTATAAATTTAGTTCTCGAAATTTAGTAGGGAC
GTGTATTTTGCTCATAAACTGTTACATTCTTTGAATCCATGTAGTTTTTTTTGGCATTAACATCATTAATTAAACGATAATGTGCCATGTAAGCAAATATTAATTGATTT
GGTAGGTGGTGTTACTACGAATGATCAAAAGTACATATGGAC
Protein sequenceShow/hide protein sequence
MLIRRFHRAASWATPLMRELTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQIADKAMKNKDVNNSKALCQTSEQNIGDIRKHQIGENVARKDKINFLVNTLLDLRDSKEA
VYAALDTWVAWEQDFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMGVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLEDLVKL
FKDLEAFGRRPPEKSIVQRVADAYEMLGLLEEKERILAKYKDLFTDEKKGSIKKYKRISFEKSNRKRKSTKGTEDNGDLMKAQ