; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014658 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014658
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationchr12:3187436..3197665
RNA-Seq ExpressionLag0014658
SyntenyLag0014658
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]8.0e-13387.32Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQIMELGV++LQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKI+FLVNT
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTT+NVYGQLIRALDMDHRAEEAHKFWVM IGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK
        CRSMISIYYRNKMLE+LVKLFKDLEAFGRKPPEKSIVQRVADA EMLGL+EEKERVLVKY  LFTDEKK SIKKYK
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK

XP_022154414.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia]1.3e-13886.16Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRDLT GQIM+LGVSRLQVGNS YCTM+QAQMC+QLAD+DM NKD+N++KALCQ SE+N GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTT+ VYGQLIRALDMDHRAEE+HKFWVM IG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI
        CRSMISIYYRNKML+NLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGL EEKERVL KYKDLFTDE+K  I+KY +IS EKSKR++K+
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI

XP_022154416.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X3 [Momordica charantia]1.3e-13886.16Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRDLT GQIM+LGVSRLQVGNS YCTM+QAQMC+QLAD+DM NKD+N++KALCQ SE+N GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTT+ VYGQLIRALDMDHRAEE+HKFWVM IG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI
        CRSMISIYYRNKML+NLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGL EEKERVL KYKDLFTDE+K  I+KY +IS EKSKR++K+
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]4.7e-13387.32Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQ+MELGV++LQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTT+NVYGQLIRALDMDHRAEEAHKFWVM IGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK
        CRSMISIYYRNKMLE+LVKLFKDLEAFGRKPPEKSIVQRVADA EMLGL+EEKERVLVKY  LFTDEKK SIKKYK
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]2.1e-13385.76Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA  WATPLLRDLTVGQIMELGVSRLQVG+  YCTMIQ QM KQLA KD+ NKD N++KAL QTSE+N GD+RKHQIG+NV RKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQ+FPI SLK  L  LEKEQQWHRVVQVIKWMLSKGQGTT+NVYGQLIRALDMDHRAEEAHKFWVM IGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKK
        CRSMI+IYYRNKMLE+LVKLFKDLEAFGRKPPEKSIVQRVADA E+LGLLEEKERVL+KYK LFTDEK+ SIKKYKR+S EKSK K+K
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKK

TrEMBL top hitse value%identityAlignment
A0A6J1DJJ2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X21.3e-13183.74Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRDLT GQIM+LGVSRLQVGNS YCTM+QAQMC+QLAD+DM NK           SE+N GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTT+ VYGQLIRALDMDHRAEE+HKFWVM IG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI
        CRSMISIYYRNKML+NLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGL EEKERVL KYKDLFTDE+K  I+KY +IS EKSKR++K+
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X16.2e-13986.16Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRDLT GQIM+LGVSRLQVGNS YCTM+QAQMC+QLAD+DM NKD+N++KALCQ SE+N GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTT+ VYGQLIRALDMDHRAEE+HKFWVM IG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI
        CRSMISIYYRNKML+NLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGL EEKERVL KYKDLFTDE+K  I+KY +IS EKSKR++K+
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X36.2e-13986.16Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRDLT GQIM+LGVSRLQVGNS YCTM+QAQMC+QLAD+DM NKD+N++KALCQ SE+N GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTT+ VYGQLIRALDMDHRAEE+HKFWVM IG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI
        CRSMISIYYRNKML+NLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGL EEKERVL KYKDLFTDE+K  I+KY +IS EKSKR++K+
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKI

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X14.3e-13286.59Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQIMELGV++LQ+GNS YCTM+Q QM K+  DKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKI+FLVNT
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTT+NVYGQLIRALDMDHRAEEAHKFWVM IGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK
        CRSMISIYYRNKMLE+LVKLFK+LEAFGRKPPEKSIVQRVADA EMLGL+EEKERVLVKY  LFTDEKK SIKKYK
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.3e-13387.32Show/hide
Query:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQ+MELGV++LQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTT+NVYGQLIRALDMDHRAEEAHKFWVM IGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQL

Query:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK
        CRSMISIYYRNKMLE+LVKLFKDLEAFGRKPPEKSIVQRVADA EMLGL+EEKERVLVKY  LFTDEKK SIKKYK
Subjt:  CRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYK

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.7e-3235.5Show/hide
Query:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ
        + ++K+    D  +   +     K  G    H  +  ++     K   LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q
Subjt:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ

Query:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG
        + KWMLSKGQG T+  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L 
Subjt:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG

Query:  LLEEKE----RVLVKYKDLFTDEKKASIKKY
          E ++    R L +YK ++ + ++  +K+Y
Subjt:  LLEEKE----RVLVKYKDLFTDEKKASIKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211902.3e-2935.03Show/hide
Query:  HNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLN
        +N  +C          R  +  + +    K   ++  +  L + KE VYGALD+++AWE  FP+  +K+AL  LE E++W +++QV KWMLSKGQG T+ 
Subjt:  HNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLN

Query:  VYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKY
         Y  L+ AL  D+R +EA + W       L   P +    MISIYY+  M + L ++F D+E  G K P  +IV  V   +  L + ++ E+++ KY
Subjt:  VYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)1.3e-7760.71Show/hide
Query:  IQAQMCKQLADKDMNNKDINHN---KALCQTSEK-NTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEK
        +Q+   + +AD   + K I  N   +    +S+K N  + RKHQIGEN+ +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEK
Subjt:  IQAQMCKQLADKDMNNKDINHN---KALCQTSEK-NTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEK

Query:  EQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQR
        E QWHR+VQVIKW+LSKGQG T+  YGQLIRALDMD RAEEAH  W   +G+DLHSVPWQLC  M+ IY+RN ML+ LVKLFKDLE++ RKPP+K IVQ 
Subjt:  EQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQR

Query:  VADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKIDE
        VADAYE+LG+L+EKERV+ KY  L       S  K  R S +K K + +I E
Subjt:  VADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKIDE

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)5.6e-7660Show/hide
Query:  IQAQMCKQLADKDMNNKDINHN---KALCQTSEK-NTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEK
        +Q+   + +AD   + K I  N   +    +S+K N  + RKHQIGEN+ +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEK
Subjt:  IQAQMCKQLADKDMNNKDINHN---KALCQTSEK-NTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEK

Query:  EQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLV---KLFKDLEAFGRKPPEKSI
        E QWHR+VQVIKW+LSKGQG T+  YGQLIRALDMD RAEEAH  W   +G+DLHSVPWQLC  M+ IY+RN ML+ LV   KLFKDLE++ RKPP+K I
Subjt:  EQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLV---KLFKDLEAFGRKPPEKSI

Query:  VQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKIDE
        VQ VADAYE+LG+L+EKERV+ KY  L       S  K  R S +K K + +I E
Subjt:  VQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKIDE

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-3335.5Show/hide
Query:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ
        + ++K+    D  +   +     K  G    H  +  ++     K   LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q
Subjt:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ

Query:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG
        + KWMLSKGQG T+  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L 
Subjt:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG

Query:  LLEEKE----RVLVKYKDLFTDEKKASIKKY
          E ++    R L +YK ++ + ++  +K+Y
Subjt:  LLEEKE----RVLVKYKDLFTDEKKASIKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein4.1e-3435.93Show/hide
Query:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ
        Q+ +K+    D  +   +     K  G    H  +  ++     K   LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q
Subjt:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ

Query:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG
        + KWMLSKGQG T+  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L 
Subjt:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG

Query:  LLEEKE----RVLVKYKDLFTDEKKASIKKY
          E ++    R L +YK ++ + ++  +K+Y
Subjt:  LLEEKE----RVLVKYKDLFTDEKKASIKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.2e-3335.5Show/hide
Query:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ
        + ++K+    D  +   +     K  G    H  +  ++     K   LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q
Subjt:  QLADKDMNNKDINHNKALCQTSEKNTGDMRKH--QIGENVSRKDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQ

Query:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG
        + KWMLSKGQG T+  Y  L+ A DMD RA+EA   W M + +   S+P +L   MI++Y  + + + ++++F D+E     P E S  +RVA A+  L 
Subjt:  VIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPEKSIVQRVADAYEMLG

Query:  LLEEKE----RVLVKYKDLFTDEKKASIKKY
          E ++    R L +YK ++ + ++  +K+Y
Subjt:  LLEEKE----RVLVKYKDLFTDEKKASIKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATCAGGAGGCTTCATCGAGCAGCGACATGGGCAACGCCTCTATTGCGAGACCTAACTGTAGGACAAATAATGGAGCTTGGTGTCAGCAGGCTGCAAGTTGGGAA
CTCCTTTTACTGCACAATGATACAAGCTCAAATGTGTAAGCAGCTTGCTGATAAAGATATGAATAATAAGGATATTAACCATAATAAAGCTCTGTGCCAGACTTCAGAAA
AAAATACTGGAGACATGAGGAAGCACCAAATTGGGGAAAATGTATCACGGAAGGACAAAATTAACTTCCTTGTAAATACGCTTCTCGATCTGAGAGATAGTAAGGAGGCT
GTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGT
TCAGGTAATCAAATGGATGTTAAGCAAGGGACAGGGAACCACATTGAACGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAGGCACACAAGT
TTTGGGTCATGACAATTGGTTCAGATCTTCATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTATCGAAATAAAATGCTAGAAAATCTCGTAAAGCTT
TTTAAGGATCTCGAAGCTTTCGGTCGTAAACCCCCAGAAAAATCAATAGTGCAGAGGGTAGCAGATGCTTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTGTT
AGTAAAGTACAAAGACCTTTTTACTGATGAAAAGAAAGCGTCCATAAAAAAATATAAGAGGATTTCGTCTGAGAAATCAAAGAGAAAAAAGAAAATCGATGAAGGGCACT
GA
mRNA sequenceShow/hide mRNA sequence
ATGCTCATCAGGAGGCTTCATCGAGCAGCGACATGGGCAACGCCTCTATTGCGAGACCTAACTGTAGGACAAATAATGGAGCTTGGTGTCAGCAGGCTGCAAGTTGGGAA
CTCCTTTTACTGCACAATGATACAAGCTCAAATGTGTAAGCAGCTTGCTGATAAAGATATGAATAATAAGGATATTAACCATAATAAAGCTCTGTGCCAGACTTCAGAAA
AAAATACTGGAGACATGAGGAAGCACCAAATTGGGGAAAATGTATCACGGAAGGACAAAATTAACTTCCTTGTAAATACGCTTCTCGATCTGAGAGATAGTAAGGAGGCT
GTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGT
TCAGGTAATCAAATGGATGTTAAGCAAGGGACAGGGAACCACATTGAACGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCGGAAGAGGCACACAAGT
TTTGGGTCATGACAATTGGTTCAGATCTTCATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTATCGAAATAAAATGCTAGAAAATCTCGTAAAGCTT
TTTAAGGATCTCGAAGCTTTCGGTCGTAAACCCCCAGAAAAATCAATAGTGCAGAGGGTAGCAGATGCTTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTGTT
AGTAAAGTACAAAGACCTTTTTACTGATGAAAAGAAAGCGTCCATAAAAAAATATAAGAGGATTTCGTCTGAGAAATCAAAGAGAAAAAAGAAAATCGATGAAGGGCACT
GA
Protein sequenceShow/hide protein sequence
MLIRRLHRAATWATPLLRDLTVGQIMELGVSRLQVGNSFYCTMIQAQMCKQLADKDMNNKDINHNKALCQTSEKNTGDMRKHQIGENVSRKDKINFLVNTLLDLRDSKEA
VYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTLNVYGQLIRALDMDHRAEEAHKFWVMTIGSDLHSVPWQLCRSMISIYYRNKMLENLVKL
FKDLEAFGRKPPEKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTDEKKASIKKYKRISSEKSKRKKKIDEGH