; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022849 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022849
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:28954153..28961204
RNA-Seq ExpressionHG10022849
SyntenyHG10022849
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001266 - Ribosomal protein S19e
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR018277 - Ribosomal protein S19e, conserved site
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064089.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.8e-25889.73Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL
        MFRS R SLATAAARRFSGEA  AAAENT++EGGA   VVS  GGGRDTLGRRLMSLTFPKRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL

Query:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        EVCEWMTLQKDMKLLPGDYAV LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN
        EKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+IFLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLYTN
Subjt:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN

Query:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDKN V RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+ N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
Subjt:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR
        WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLNDETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR

Query:  EFLRLTSKMCGTYHSS
        E LRLTSKMC +  SS
Subjt:  EFLRLTSKMCGTYHSS

KAG6593129.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.05Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV
        M RS R  LAT AARRFSGEA   A ENT LE  +      GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV

Query:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK
        CEWMTLQKDMKLLPGDYAVHLDLI+KIRGL+SAEKFF DLPDKMR QSA T+LLHV+VQNNLSEKAEALM KMSE GFLKSPLSFNHMLSL+I+NK LEK
Subjt:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK

Query:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG
        VP L+Q LKKNTKPDV+TYNLLLNVCTLQNDVEAAE+IFLEMK  K EPDWVSFSTLANLYSK+QLTEKAASTLK+MEKMAS+RNRISFSSLLSLYTN G
Subjt:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG

Query:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG
        DK+GV RIWKKM S FRKM+DSEYTCMISSLVKL++L EAEKLYTEWESVSGTGDTRVPNILLAAYINKNQ +QAESFY+RM+LKGIVPSYTTWELLTWG
Subjt:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG

Query:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF
        YLKENQMEKVL FFKNAVGSVKKWNADERLV+GVCK+LEEQGN EGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLN+E+RE 
Subjt:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF

Query:  LRLTSKMCGTYHSSKTLEFEKGGNPSSAIGAFAGLCSLNSTAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRA
        L+LTSKMC  Y SSKTLEFEKGG+PSSAIGAFAGLCSLNSTAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPW DIVKTARFKELAPYD DWYYVRA
Subjt:  LRLTSKMCGTYHSSKTLEFEKGGNPSSAIGAFAGLCSLNSTAMATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRA

Query:  ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP
        ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP
Subjt:  ASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP

XP_008451368.1 PREDICTED: pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Cucumis melo]7.1e-25789.15Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL
        MFRS R SLATAAARRFSGEA  AAAENT++EG  G  VVS  GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL

Query:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        EVCEWMTLQKDMKLLPGDYAV LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN
        EKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+IFLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLY N
Subjt:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN

Query:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDKN V+RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+ N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
Subjt:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR
        WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLNDETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR

Query:  EFLRLTSKMCGTYHSS
        E LRLTSKMC +  SS
Subjt:  EFLRLTSKMCGTYHSS

XP_022150266.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 [Momordica charantia]2.2e-25087.94Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV
        M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV

Query:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK
        CEW T QKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQNNLS+KAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL+K
Subjt:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK

Query:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG
        VP LIQ L+KNTKPDVVTYNLLLNVCTLQNDVEAAE+I LEMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEKMAS+RNRI+FSSLLSLYTNLG
Subjt:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG

Query:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG
        DK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL EAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLTWG
Subjt:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG

Query:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF
        YLKENQMEKVLHFFKNAVGSVKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPL+VAERME D+V+L++ETRE 
Subjt:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF

Query:  LRLTSKMCGTYHSS
        ++LTSKMC +  SS
Subjt:  LRLTSKMCGTYHSS

XP_038900168.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Benincasa hispida]2.7e-26491.7Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADV----------VSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRK
        MFRSLRPSLATAAARRFSGEA  AAAEN  LEGGA V          VS TGGGRDTLGRRLMSLTFPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRK
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADV----------VSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRK

Query:  LKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLS
        LKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS CTALLH YVQNNL EKAEALMEKMSE GFLK PLSFNHMLS
Subjt:  LKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLS

Query:  LHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFS
        L+ISNKQLEKVPD+IQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAE+IFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLK+MEKMAS+RNRISFS
Subjt:  LHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFS

Query:  SLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPS
        SLLSLYTNLGDKNGV+RIWKKMKS FRKMSDSEYTCMISSLVKLNEL EAEKLY EWESVSGTGDTRV NILLAAYINKNQMEQAE+FYNRMS+KG+VPS
Subjt:  SLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPS

Query:  YTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDN
        YTTWELLTWGYLKENQMEKVLHF KNAVGSVKKWN DERLVK VCKKLEEQGNIEGAEQLL+ILRN GHVDTEIYNSLLRTYAKAGKMPL+VAERMEMDN
Subjt:  YTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDN

Query:  VQLNDETREFLRLTSKMC
        VQLNDETRE LRLTSKMC
Subjt:  VQLNDETREFLRLTSKMC

TrEMBL top hitse value%identityAlignment
A0A0A0K7E2 Uncharacterized protein5.0e-24886.43Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL
        MFRS RPSLATAAARRFSGEA  AA+ENTALEG  G  VVS  GGGRDTLGRRLMSL FPKRSAV +IRKWQEEG T+RKYELNR VRELRKLKRYKHAL
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL

Query:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        EVCEWMTLQKDM+L+PGDYAVHLDLI KIRGLN AEKFFEDLPDK+R+QS CT+LLH YVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN
        EKVP LI+ LKKNTKPDVVTYNLLLNVCTLQND EAAE+IFLEMKKTK +PDWVSFSTLANLY K QLTEKAA+TLKEMEKMA + NR+S SSLLSLYTN
Subjt:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN

Query:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDKN VYRIWKK+KS FRKMSD EY CMISSLVKLNEL EAEKLYTEWESVSGT DTRV N++L AYI KNQ+EQAESFYNRM  KG VPSYTTWELLT
Subjt:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR
        WGYLKENQMEKVLHFF+ AV  VKKWNADERLVKGVCKKLEEQGNI G EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLNDETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR

Query:  EFLRLTSKMCGTYHSS
        E LRLTSKMC +  SS
Subjt:  EFLRLTSKMCGTYHSS

A0A1S3BRD9 pentatricopeptide repeat-containing protein At4g02820, mitochondrial3.4e-25789.15Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL
        MFRS R SLATAAARRFSGEA  AAAENT++EG  G  VVS  GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEG--GADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL

Query:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        EVCEWMTLQKDMKLLPGDYAV LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN
        EKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+IFLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLY N
Subjt:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN

Query:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDKN V+RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+ N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
Subjt:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR
        WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLNDETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR

Query:  EFLRLTSKMCGTYHSS
        E LRLTSKMC +  SS
Subjt:  EFLRLTSKMCGTYHSS

A0A5D3D4M6 Pentatricopeptide repeat-containing protein1.8e-25889.73Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL
        MFRS R SLATAAARRFSGEA  AAAENT++EGGA   VVS  GGGRDTLGRRLMSLTFPKRSAVI+IRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGA--DVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHAL

Query:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        EVCEWMTLQKDMKLLPGDYAV LDLIAKIRGLNSAEKFFEDLPDK+R+QS CTALLH YVQ NLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN
        EKVP LI+VLKKNTKPDVVTYNLLLNVCTLQND EAAE+IFLEMKKTK +PDW+SFSTLANLY KKQLTEKAA+TLKEMEKMA +RNR+SFSSLLSLYTN
Subjt:  EKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTN

Query:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDKN V RIWKK+KS FRKMSDSEY CM+SSLVKLNEL EAEKLYTEWESVSGT DTR+ N++LAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT
Subjt:  LGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR
        WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEG EQLL+ILRNAGHVDTEIYNSLLRTYAKAGKMPL+VAERME DNVQLNDETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETR

Query:  EFLRLTSKMCGTYHSS
        E LRLTSKMC +  SS
Subjt:  EFLRLTSKMCGTYHSS

A0A6J1D809 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X15.5e-24784.49Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALE-
        M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE 
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALE-

Query:  --------------------VCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFL
                            +CEW T QKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQNNLS+KAEALMEKMSECGFL
Subjt:  --------------------VCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFL

Query:  KSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEK
        KSPLSFNHMLSLHISNKQL+KVP LIQ L+KNTKPDVVTYNLLLNVCTLQNDVEAAE+I LEMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEK
Subjt:  KSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEK

Query:  MASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFY
        MAS+RNRI+FSSLLSLYTNLGDK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL EAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFY
Subjt:  MASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFY

Query:  NRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMP
        NRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMP
Subjt:  NRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMP

Query:  LVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS
        L+VAERME D+V+L++ETRE ++LTSKMC +  SS
Subjt:  LVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSS

A0A6J1DB09 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X21.1e-25087.94Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV
        M RSLR SLAT AARRFSGEA  AA ENTA+EGG+   S  GGGRDTLGRRLMSL FPKRSAVI+IRKWQEEGHT+RKYELNRIVRELRKLKRYKHALE+
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEV

Query:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK
        CEW T QKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR QSACTALLHVYVQNNLS+KAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL+K
Subjt:  CEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEK

Query:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG
        VP LIQ L+KNTKPDVVTYNLLLNVCTLQNDVEAAE+I LEMKK K E DWV+ STL NLYSKKQLTEKAASTLKEMEKMAS+RNRI+FSSLLSLYTNLG
Subjt:  VPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLG

Query:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG
        DK+G +RIWKKMK+ FRKMSDSEYTCMISS+VKL+EL EAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLTWG
Subjt:  DKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWG

Query:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF
        YLKENQMEKVLHFFKNAVGSVKKWNADERLVK VCKKLEE+GNIEGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPL+VAERME D+V+L++ETRE 
Subjt:  YLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREF

Query:  LRLTSKMCGTYHSS
        ++LTSKMC +  SS
Subjt:  LRLTSKMCGTYHSS

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607704.9e-6734.11Show/hide
Query:  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECG
        K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F DLP+  + +    +LL+ Y +  L+EKAE L+ KM E  
Subjt:  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECG

Query:  FLKSPLSFNHMLSLHISNKQLEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFSTLANLYSKKQLTEKAASTLK
           S +S+N +++L+    + EKVP +IQ LK +N  PD  TYN+ +      ND+   E +  EM +  +  PDW ++S +A++Y    L++KA   L+
Subjt:  FLKSPLSFNHMLSLHISNKQLEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFSTLANLYSKKQLTEKAASTLK

Query:  EMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA
        E+E   +QR+  ++  L++LY  LG    VYRIW+ ++    K S+  Y  MI  LVKLN+L  AE L+ EW++   T D R+ N+L+ AY  +  +++A
Subjt:  EMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA

Query:  ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLR
             +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V+ +    E++ ++ GAE LL IL+N   ++  EI+  L+R
Subjt:  ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLR

Query:  TYAKAGKMPLVVAERMEMDNVQLNDETREFL
        TYA AGK    +  R++M+NV++N+ T++ L
Subjt:  TYAKAGKMPLVVAERMEMDNVQLNDETREFL

Q3E911 Pentatricopeptide repeat-containing protein At5g274607.3e-7135.24Show/hide
Query:  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIR
        ++L  G+D  SV    R++L + ++    P+RS    +++  + GH +   EL  I + L +  RY  AL++ EWM  QKD++    D A+ LDLI K  
Subjt:  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIR

Query:  GLNSAEKFFEDL---PDKMR-DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKP-DVVTYNLLL
        GL   E++FE L      MR  +SA   LL  YV+N + ++AEALMEK++  GFL +P  FN M+ L+ ++ Q EKV  ++ ++K N  P +V++YNL +
Subjt:  GLNSAEKFFEDL---PDKMR-DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKP-DVVTYNLLL

Query:  NVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS
        N C   + V A E+++ EM   K+ E  W S  TLAN+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K GV R+W+  KS+  ++S  
Subjt:  NVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS

Query:  EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVK
         Y C++SSLVK  +L EAE++++EWE+     D RV N+LL AY+   ++ +AES +  +  +G  P+Y TWE+L  G++K   MEK +         ++
Subjt:  EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVK

Query:  K--WNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV
        +  W     +V  + +  E++  IE A   +  L   G     +Y  LLR +  A +    + E M++D +
Subjt:  K--WNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021505.1e-7234.82Show/hide
Query:  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS
        +++  +  P+  A   + +W++ G  L K+EL R+V+ELRK KR   ALEV +WM  + +  +L   D A+ LDLI K+RG+  AE+FF  LP+  +D+ 
Subjt:  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS

Query:  ACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKK-TK
           +LL+ YV+    EKAEAL+  M + G+   PL FN M++L+++ ++ +KV  ++ ++ +K+ + D+ +YN+ L+ C     VE  E ++ +MK    
Subjt:  ACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKK-TK

Query:  TEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE
          P+W +FST+A +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  +YR+W   KS+   + +  Y  ++SSLV++ ++  AEK+Y E
Subjt:  TEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNI
        W  V  + D R+PN+L+ AY+  +Q+E AE  ++ M   G  PS +TWE+L  G+ ++  + + L   +NA  +     W     ++ G  K  EE+ ++
Subjt:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNI

Query:  EGAEQLLIILRNAGHVDTEIYNSLL
           E +L +LR +G ++ + Y +L+
Subjt:  EGAEQLLIILRNAGHVDTEIYNSLL

Q9SGA6 40S ribosomal protein S19-11.1e-6683.22Show/hide
Query:  MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAI
        MAT +TVKDVSPH+FVKAYA+HLKRSGK+ELP W DIVKT + KELAPYD DWYY+RAASMARK+YLRGGLGVGAF+RIYGGSKRNGSRPPHFC+SSG I
Subjt:  MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAI

Query:  ARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP
        ARHILQQL+ MNIV++D KGGRRITSSG+RDLDQVAGRI V P
Subjt:  ARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial3.2e-17560.34Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRE
        + RS RP+LA +  R FS  A  AA  +TA            +GG  A+      GGRDTLG RL+SL + KRSAV++IRKW+EEGH++RKYELNRIVRE
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRE

Query:  LRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNH
        LRK+KRYKHALE+CEWM +Q+D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MR  +ACT+LLH YVQN LS+KAEAL EKM ECGFLKS L +NH
Subjt:  LRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNH

Query:  MLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI
        MLS++IS  Q EKVP LI+ LK  T PD+VTYNL L      NDVE AE ++L+ K+ K  PDWV++S L NLY+K    EKA   LKEMEK+ S++NR+
Subjt:  MLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI

Query:  SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGI
        +++SL+SL+ NLGDK+GV   WKK+KS F+KM+D+EY  MIS++VKL E  +A+ LY EWESVSGTGD R+PN++LA Y+N++++   E FY R+  KGI
Subjt:  SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGI

Query:  VPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERME
         PSY+TWE+LTW YLK   MEKVL  F  A+ SVKKW  + RLVKG CK+LEEQGN++GAE+L+ +L+ AG+V+T++YNSLLRTYAKAG+M L+V ERM 
Subjt:  VPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERME

Query:  MDNVQLNDETREFLRLTSKMCGTYHSS
         DNV+L++ET+E +RLTS+M  T  SS
Subjt:  MDNVQLNDETREFLRLTSKMCGTYHSS

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-7334.82Show/hide
Query:  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS
        +++  +  P+  A   + +W++ G  L K+EL R+V+ELRK KR   ALEV +WM  + +  +L   D A+ LDLI K+RG+  AE+FF  LP+  +D+ 
Subjt:  RRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQS

Query:  ACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKK-TK
           +LL+ YV+    EKAEAL+  M + G+   PL FN M++L+++ ++ +KV  ++ ++ +K+ + D+ +YN+ L+ C     VE  E ++ +MK    
Subjt:  ACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLI-QVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKK-TK

Query:  TEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE
          P+W +FST+A +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  +YR+W   KS+   + +  Y  ++SSLV++ ++  AEK+Y E
Subjt:  TEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNI
        W  V  + D R+PN+L+ AY+  +Q+E AE  ++ M   G  PS +TWE+L  G+ ++  + + L   +NA  +     W     ++ G  K  EE+ ++
Subjt:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKGVCKKLEEQGNI

Query:  EGAEQLLIILRNAGHVDTEIYNSLL
           E +L +LR +G ++ + Y +L+
Subjt:  EGAEQLLIILRNAGHVDTEIYNSLL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-6834.11Show/hide
Query:  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECG
        K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F DLP+  + +    +LL+ Y +  L+EKAE L+ KM E  
Subjt:  KYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECG

Query:  FLKSPLSFNHMLSLHISNKQLEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFSTLANLYSKKQLTEKAASTLK
           S +S+N +++L+    + EKVP +IQ LK +N  PD  TYN+ +      ND+   E +  EM +  +  PDW ++S +A++Y    L++KA   L+
Subjt:  FLKSPLSFNHMLSLHISNKQLEKVPDLIQVLK-KNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKT-KTEPDWVSFSTLANLYSKKQLTEKAASTLK

Query:  EMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA
        E+E   +QR+  ++  L++LY  LG    VYRIW+ ++    K S+  Y  MI  LVKLN+L  AE L+ EW++   T D R+ N+L+ AY  +  +++A
Subjt:  EMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQA

Query:  ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLR
             +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V+ +    E++ ++ GAE LL IL+N   ++  EI+  L+R
Subjt:  ESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRN-AGHVDTEIYNSLLR

Query:  TYAKAGKMPLVVAERMEMDNVQLNDETREFL
        TYA AGK    +  R++M+NV++N+ T++ L
Subjt:  TYAKAGKMPLVVAERMEMDNVQLNDETREFL

AT3G02080.1 Ribosomal protein S19e family protein7.8e-6883.22Show/hide
Query:  MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAI
        MAT +TVKDVSPH+FVKAYA+HLKRSGK+ELP W DIVKT + KELAPYD DWYY+RAASMARK+YLRGGLGVGAF+RIYGGSKRNGSRPPHFC+SSG I
Subjt:  MATARTVKDVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAI

Query:  ARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP
        ARHILQQL+ MNIV++D KGGRRITSSG+RDLDQVAGRI V P
Subjt:  ARHILQQLQEMNIVDVDPKGGRRITSSGRRDLDQVAGRIVVAP

AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-17660.34Show/hide
Query:  MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRE
        + RS RP+LA +  R FS  A  AA  +TA            +GG  A+      GGRDTLG RL+SL + KRSAV++IRKW+EEGH++RKYELNRIVRE
Subjt:  MFRSLRPSLATAAARRFSGEAHKAAAENTAL-----------EGG--ADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRE

Query:  LRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNH
        LRK+KRYKHALE+CEWM +Q+D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MR  +ACT+LLH YVQN LS+KAEAL EKM ECGFLKS L +NH
Subjt:  LRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNH

Query:  MLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI
        MLS++IS  Q EKVP LI+ LK  T PD+VTYNL L      NDVE AE ++L+ K+ K  PDWV++S L NLY+K    EKA   LKEMEK+ S++NR+
Subjt:  MLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYNLLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRI

Query:  SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGI
        +++SL+SL+ NLGDK+GV   WKK+KS F+KM+D+EY  MIS++VKL E  +A+ LY EWESVSGTGD R+PN++LA Y+N++++   E FY R+  KGI
Subjt:  SFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGI

Query:  VPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERME
         PSY+TWE+LTW YLK   MEKVL  F  A+ SVKKW  + RLVKG CK+LEEQGN++GAE+L+ +L+ AG+V+T++YNSLLRTYAKAG+M L+V ERM 
Subjt:  VPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERME

Query:  MDNVQLNDETREFLRLTSKMCGTYHSS
         DNV+L++ET+E +RLTS+M  T  SS
Subjt:  MDNVQLNDETREFLRLTSKMCGTYHSS

AT5G27460.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-7235.24Show/hide
Query:  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIR
        ++L  G+D  SV    R++L + ++    P+RS    +++  + GH +   EL  I + L +  RY  AL++ EWM  QKD++    D A+ LDLI K  
Subjt:  TALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDMKLLPGDYAVHLDLIAKIR

Query:  GLNSAEKFFEDL---PDKMR-DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKP-DVVTYNLLL
        GL   E++FE L      MR  +SA   LL  YV+N + ++AEALMEK++  GFL +P  FN M+ L+ ++ Q EKV  ++ ++K N  P +V++YNL +
Subjt:  GLNSAEKFFEDL---PDKMR-DQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKP-DVVTYNLLL

Query:  NVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS
        N C   + V A E+++ EM   K+ E  W S  TLAN+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K GV R+W+  KS+  ++S  
Subjt:  NVCTLQNDVEAAESIFLEMKKTKT-EPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDS

Query:  EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVK
         Y C++SSLVK  +L EAE++++EWE+     D RV N+LL AY+   ++ +AES +  +  +G  P+Y TWE+L  G++K   MEK +         ++
Subjt:  EYTCMISSLVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVK

Query:  K--WNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV
        +  W     +V  + +  E++  IE A   +  L   G     +Y  LLR +  A +    + E M++D +
Subjt:  K--WNADERLVKGVCKKLEEQGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCGCTCGTTGCGGCCTTCTCTAGCGACGGCCGCCGCCCGCAGATTTTCCGGGGAAGCCCACAAGGCGGCGGCGGAGAACACAGCACTAGAAGGCGGTGCCGACGT
CGTCTCCGTCACAGGCGGTGGTCGAGACACGCTTGGACGAAGGCTCATGAGCCTCACTTTCCCCAAACGCAGCGCCGTGATTTCCATTCGAAAATGGCAAGAAGAGGGCC
ACACTCTTCGCAAGTACGAGCTCAATCGCATCGTTCGGGAGCTTCGCAAGCTCAAGCGCTACAAGCACGCACTTGAGGTGTGTGAATGGATGACATTACAGAAAGATATG
AAGTTGCTACCTGGTGACTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGAGGCCTGAATAGCGCAGAAAAGTTTTTTGAAGATCTCCCTGATAAAATGAGAGATCA
ATCAGCCTGCACAGCTCTTCTTCACGTGTACGTTCAAAATAATCTATCTGAAAAGGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTT
CTTTCAACCACATGCTATCTCTTCACATCTCAAACAAGCAACTAGAGAAGGTTCCTGATCTGATTCAAGTATTAAAGAAGAACACCAAACCAGATGTGGTAACATATAAT
CTTTTGTTGAATGTTTGCACTTTGCAAAATGACGTTGAAGCTGCAGAAAGCATTTTCCTTGAGATGAAGAAGACGAAAACCGAACCAGATTGGGTATCATTTAGCACATT
AGCTAACTTGTATTCCAAAAAACAACTTACCGAAAAAGCAGCGTCTACGTTGAAGGAGATGGAGAAAATGGCATCTCAAAGAAACAGAATCTCATTTTCGTCTCTTCTTA
GCTTGTATACCAATTTGGGGGATAAGAATGGAGTTTACAGAATATGGAAAAAGATGAAGTCATTGTTTCGCAAGATGAGTGATAGTGAGTACACTTGCATGATATCCTCT
CTTGTGAAACTTAATGAGCTTGGGGAAGCTGAGAAACTATATACCGAATGGGAGTCAGTATCCGGGACGGGTGATACTCGGGTTCCAAATATATTGCTTGCAGCGTATAT
CAACAAAAACCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCACTAAAAGGAATAGTTCCATCTTACACTACTTGGGAGCTCCTTACATGGGGTTATTTGAAAG
AGAACCAGATGGAGAAAGTGCTGCATTTCTTCAAGAATGCAGTTGGCAGCGTGAAGAAATGGAATGCGGACGAGAGGTTGGTTAAAGGAGTCTGTAAGAAACTCGAGGAG
CAGGGTAACATTGAAGGGGCAGAGCAGTTGTTGATTATTCTTAGGAATGCTGGTCATGTGGATACTGAGATATACAATTCTCTCTTGCGGACCTATGCAAAAGCTGGTAA
AATGCCACTTGTTGTTGCTGAAAGAATGGAAATGGACAACGTTCAGTTGAACGACGAGACTCGAGAGTTTCTAAGGTTGACCAGCAAGATGTGTGGCACTTATCACTCTT
CGAAAACCCTAGAGTTTGAGAAAGGCGGAAACCCTAGTTCAGCAATCGGAGCTTTTGCCGGCCTTTGCTCACTGAATTCAACAGCCATGGCCACTGCTAGGACCGTCAAG
GACGTCTCTCCTCACGAGTTTGTCAAGGCCTATGCCGCTCATCTCAAGCGATCCGGAAAGGTTGAACTTCCACCATGGGCGGACATTGTGAAAACTGCAAGGTTCAAAGA
GCTCGCTCCATATGATGCGGATTGGTATTACGTGAGAGCTGCATCCATGGCAAGGAAGATCTACTTGAGAGGAGGTCTTGGTGTTGGGGCATTTAAGCGGATTTATGGTG
GAAGCAAGAGGAATGGAAGTCGCCCTCCACACTTTTGTGAAAGCAGTGGAGCCATTGCCCGTCACATTCTACAACAGTTGCAGGAGATGAACATTGTTGATGTGGACCCA
AAGGGTGGAAGGAGAATTACTTCAAGTGGTCGACGAGACCTTGATCAAGTTGCTGGCCGGATTGTTGTTGCCCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCGCTCGTTGCGGCCTTCTCTAGCGACGGCCGCCGCCCGCAGATTTTCCGGGGAAGCCCACAAGGCGGCGGCGGAGAACACAGCACTAGAAGGCGGTGCCGACGT
CGTCTCCGTCACAGGCGGTGGTCGAGACACGCTTGGACGAAGGCTCATGAGCCTCACTTTCCCCAAACGCAGCGCCGTGATTTCCATTCGAAAATGGCAAGAAGAGGGCC
ACACTCTTCGCAAGTACGAGCTCAATCGCATCGTTCGGGAGCTTCGCAAGCTCAAGCGCTACAAGCACGCACTTGAGGTGTGTGAATGGATGACATTACAGAAAGATATG
AAGTTGCTACCTGGTGACTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGAGGCCTGAATAGCGCAGAAAAGTTTTTTGAAGATCTCCCTGATAAAATGAGAGATCA
ATCAGCCTGCACAGCTCTTCTTCACGTGTACGTTCAAAATAATCTATCTGAAAAGGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTT
CTTTCAACCACATGCTATCTCTTCACATCTCAAACAAGCAACTAGAGAAGGTTCCTGATCTGATTCAAGTATTAAAGAAGAACACCAAACCAGATGTGGTAACATATAAT
CTTTTGTTGAATGTTTGCACTTTGCAAAATGACGTTGAAGCTGCAGAAAGCATTTTCCTTGAGATGAAGAAGACGAAAACCGAACCAGATTGGGTATCATTTAGCACATT
AGCTAACTTGTATTCCAAAAAACAACTTACCGAAAAAGCAGCGTCTACGTTGAAGGAGATGGAGAAAATGGCATCTCAAAGAAACAGAATCTCATTTTCGTCTCTTCTTA
GCTTGTATACCAATTTGGGGGATAAGAATGGAGTTTACAGAATATGGAAAAAGATGAAGTCATTGTTTCGCAAGATGAGTGATAGTGAGTACACTTGCATGATATCCTCT
CTTGTGAAACTTAATGAGCTTGGGGAAGCTGAGAAACTATATACCGAATGGGAGTCAGTATCCGGGACGGGTGATACTCGGGTTCCAAATATATTGCTTGCAGCGTATAT
CAACAAAAACCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCACTAAAAGGAATAGTTCCATCTTACACTACTTGGGAGCTCCTTACATGGGGTTATTTGAAAG
AGAACCAGATGGAGAAAGTGCTGCATTTCTTCAAGAATGCAGTTGGCAGCGTGAAGAAATGGAATGCGGACGAGAGGTTGGTTAAAGGAGTCTGTAAGAAACTCGAGGAG
CAGGGTAACATTGAAGGGGCAGAGCAGTTGTTGATTATTCTTAGGAATGCTGGTCATGTGGATACTGAGATATACAATTCTCTCTTGCGGACCTATGCAAAAGCTGGTAA
AATGCCACTTGTTGTTGCTGAAAGAATGGAAATGGACAACGTTCAGTTGAACGACGAGACTCGAGAGTTTCTAAGGTTGACCAGCAAGATGTGTGGCACTTATCACTCTT
CGAAAACCCTAGAGTTTGAGAAAGGCGGAAACCCTAGTTCAGCAATCGGAGCTTTTGCCGGCCTTTGCTCACTGAATTCAACAGCCATGGCCACTGCTAGGACCGTCAAG
GACGTCTCTCCTCACGAGTTTGTCAAGGCCTATGCCGCTCATCTCAAGCGATCCGGAAAGGTTGAACTTCCACCATGGGCGGACATTGTGAAAACTGCAAGGTTCAAAGA
GCTCGCTCCATATGATGCGGATTGGTATTACGTGAGAGCTGCATCCATGGCAAGGAAGATCTACTTGAGAGGAGGTCTTGGTGTTGGGGCATTTAAGCGGATTTATGGTG
GAAGCAAGAGGAATGGAAGTCGCCCTCCACACTTTTGTGAAAGCAGTGGAGCCATTGCCCGTCACATTCTACAACAGTTGCAGGAGATGAACATTGTTGATGTGGACCCA
AAGGGTGGAAGGAGAATTACTTCAAGTGGTCGACGAGACCTTGATCAAGTTGCTGGCCGGATTGTTGTTGCCCCTTGA
Protein sequenceShow/hide protein sequence
MFRSLRPSLATAAARRFSGEAHKAAAENTALEGGADVVSVTGGGRDTLGRRLMSLTFPKRSAVISIRKWQEEGHTLRKYELNRIVRELRKLKRYKHALEVCEWMTLQKDM
KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRDQSACTALLHVYVQNNLSEKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLEKVPDLIQVLKKNTKPDVVTYN
LLLNVCTLQNDVEAAESIFLEMKKTKTEPDWVSFSTLANLYSKKQLTEKAASTLKEMEKMASQRNRISFSSLLSLYTNLGDKNGVYRIWKKMKSLFRKMSDSEYTCMISS
LVKLNELGEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKGVCKKLEE
QGNIEGAEQLLIILRNAGHVDTEIYNSLLRTYAKAGKMPLVVAERMEMDNVQLNDETREFLRLTSKMCGTYHSSKTLEFEKGGNPSSAIGAFAGLCSLNSTAMATARTVK
DVSPHEFVKAYAAHLKRSGKVELPPWADIVKTARFKELAPYDADWYYVRAASMARKIYLRGGLGVGAFKRIYGGSKRNGSRPPHFCESSGAIARHILQQLQEMNIVDVDP
KGGRRITSSGRRDLDQVAGRIVVAP