; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028089 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028089
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153056:3316716..3322864
RNA-Seq ExpressionSgr028089
SyntenySgr028089
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134787.1 pentatricopeptide repeat-containing protein At2g35030, mitochondrial-like [Momordica charantia]0.0e+0087.8Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKSSLRSIGEKGSHVFN NLKISQLGKSGRIEEAVAVFS MTEKNTV+YNSMISAYAKNGRIE+AR+LF++MP++NLVSWNSMIAGYLHND+V++AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKMYKRDLYSWTLMITCYTRIGEL KARELF+LLPDKQDTVC NALIAGYAKKRRFDEAKKLFD MLVKN+VSWNS+LAGYTKNGE+ SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGY+EVGDLDSAWMIFRKIPTPNAVS+VTMFSGLAHYG   EAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDAFKLF EM+ERDSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MING VRVGKLSQAREILD+MPYKN+AAQTAMINGYVQSSRMDEANEIF+QIS+RD+VCWNT+I GY HCGRMDEALCLFREM  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQG-EKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMY
        A Q DKALGIFNEMR RN+VSWNSLITGYVQN  Y EALK F+ +K QG EKPDQSTFVCCLRASANLAAL+VGMQLHHLTIK+GFG++LFVKN IITMY
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQG-EKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMY

Query:  TKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVD
        TKSGRVLDAE+VFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIR VIPDEVTFTGLLSACNHGGFV+QGLK+FKCMTETYLIKP SEHYACVVD
Subjt:  TKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVD

Query:  LLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEI
        LLGRVGRL EAMEIVEG KTVSSAKIWGALLWACR HQN ELAKY  ERLLELEP+NASNYVLLS++HAEA RWDMVERVRV M+ENKAEKQPGCSWIE+
Subjt:  LLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEI

Query:  GNQVHCFLSEAPAEL-RPEICNILQTVTAQIRNTECMY
        GN+VH F+SE P+E  RPEIC IL+T+TAQIRNTE M+
Subjt:  GNQVHCFLSEAPAEL-RPEICNILQTVTAQIRNTECMY

XP_022936051.1 pentatricopeptide repeat-containing protein At4g02750-like [Cucurbita moschata]0.0e+0086.34Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKS++RSIGEKGS VFNQNL+I+QLGKSGRIEEAVAVFS M EKNTVTYNSMIS YAKNGRI NAR+LF+ MP++NLVSWNSMIAGYLHN+LVE+AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM KRDLYSWTLMITCYTRIGELEKAR+LFNLLPDKQD VC NALIAGYAKKRRFDEAK+LFD MLVKN+VSWNS+L+GYTKNGEM SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIFRKIP PN VSWVTMFSG AHYGRI EAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDA+KLF  M E+DSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MI+G VRV KLSQA EIL LMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRD+VCWNT+I GY HCGRM+EALCLFREM  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKALG+FNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTF+CCLRASANLA L+VG+QLH+LTIKSGFGN+LFVKN IITMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL AENVF+EINNKDVVSWNS+IAGYALNGYGKEAVELFEEMSIRGV PDEVTFTGLLSACNHGGFV+QGL LFKCMT+TYLIKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEAME+VEG + VSSAKIWGALLWACR+H N +LAKY  ERLL LEPQNASNYVLLS+MHAE GRWDMVE +RV M++NKAEKQPGCSWIE+ 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT
        NQVHCFL EAP +LRPEICNIL+TVTAQIRNT
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT

XP_022976715.1 pentatricopeptide repeat-containing protein At4g02750-like [Cucurbita maxima]0.0e+0086.2Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKS++RSIG KGS VFNQNL+I+QLGK GRIEEAVAVFS MTEKNTVTYNSMIS YAKNGRI NAR+LF+ MP++NLVSWNSMIAGYLHN+LVE+AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM KRDLYSWTLMITCYTRIGELEKARELFNLLPDKQD VC NALIAGYAKKRRFD+AK+LFD MLVKN+VSWNS+L+GYTKNGEM SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIFR+IPTPN VSWVTM SG AHYGRITEAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDA+KLF EM E+DSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MI+G VRVGKLSQA EIL LMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRD+VCWNT+I GY HCGRM+EALCLFREM  KDMVSWNTMI+GY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKALG+FNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTF+CCLRASANLA L+VG+QLH+LTIKSGFGN+LFVKN IITMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL AE++F+EINNKDVVSWNS+IAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFV+QGL LFKCMT+TYLIKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEA+E+VEG + VSSAKIWGALLWACR+H N +LAKY  ERLL LEPQNASNYVLLS+MHA  GRWDMVE +RV M++NKAEKQPGCSWIEI 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT
        NQVHCFLSEAP +LRPEICNIL+TVTAQIRNT
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT

XP_023535919.1 pentatricopeptide repeat-containing protein At4g02750-like [Cucurbita pepo subsp. pepo]0.0e+0086.48Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKS++RSIGEKGS VFNQNL+I+QLGKSGRIEEAVAVFS M EKNTVTYNSMIS YAKNGRI NAR+LF+ MPR+NLVSWNSMIAGYLHN+LVE+AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM KRDLYSWTLMITCYTRIGELEKARELFNLLPDKQD VC NALIAGYAKKRRFDEAK+LFD MLVKN+VSWNS+L+GYTKNGEM SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIFRKIPTPN VSWVTMFSG AHYGRI EAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDA+KLF  M E+DSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MI+G VRVGKLSQA EIL LMPYKN+AAQTAMINGYVQS RMDEANEIFSQISVRD+VCWNT+I GY HCGRM+EALCLFREM  KDMVSWNTMI+GY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKALG+FNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTF+CCLRASANLA L+VG+QLH+LTIKSGFGN+LFVKN IITMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL AENVF+EINNKDVVSWNS+IAGYALNGYGKEAVELFEEMSIRGV PDEVTFTGLLSACNHGGFV+QGL LFKCMT+TYLIKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEAME+VEG + VSSAKIWGALLWACR+H N +LAKY  ERLL LEPQNASNYVLLS+MHAE  RWDMVE +RV M++NKAEKQPGCSWIEI 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT
        +QVHCFLSEAP +LRPEICNIL+TVTAQIRNT
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT

XP_038897172.1 pentatricopeptide repeat-containing protein At1g09410, mitochondrial-like [Benincasa hispida]0.0e+0087.21Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MK +++SIGEKGS+VF QNL+ISQLGKSGRIEEAVAVFS MTEKN VTYNSMISAYAKNGRIENARELF+LMP++NLVSWNSMIAGYLHN+LVEDAAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM+KRDLYSWTLMITCYTRIGELEKA+ELFNLLPDKQDTVC NALIAGYAKKRRFDEAKKLFDEMLVKN+VSWNS+L+GYTKNGEM  GLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIF+KIPTPN VSWVTMFSG AHYGRITEAR LFD++PSKNLVSWNAMIGAYVR++QIDDA+KLF EM E+DSVS+TT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MING VRVGKLSQAREIL+LMPYKNIAAQTAMINGYVQSSR+DEANEIFSQISVRD+VCWNT+I GY HCGRMDEALCLF++M  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        +GQ  KALGIFNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTFVCCLRASANLAAL +G+QLHHL IK+GFG DLFVKN I+TMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL+AENVFAE NNKDVVSWNSLIAGYALNG GKEA+ LFEEMSIRG+IPDEVTFTGLLSACNHGG V+QGL LFKCMTETY IKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEAMEIVEG KT+SSAKIWGALLW CR HQN ELAKY  ERLL LEPQNASNYVLLS+MHAEAGRWDMVERVRV M+ENKAEKQPGCSWIEI 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNTECM
         QVHCFLSEAP  LR EICNIL+TVTAQIRNTE M
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNTECM

TrEMBL top hitse value%identityAlignment
A0A0A0KZ81 Uncharacterized protein0.0e+0084.06Show/hide
Query:  KISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTR
        +  +LG+SGRIEEAVAVF  MTE+N VTYNSMISAYAKNGRI NARELF+LMP++NLVSWNSMIAGYLHN+LVEDAA+LFD+M+KRD+YSWTLMITCYTR
Subjt:  KISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTR

Query:  IGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDS
        IGELEKARELFNLLPDKQDTVCRNALIAGYAKKR F EAKKLFDEMLVKNVVSWNS+L+GYTKNG+M  GLQFFEAMGERNVVSWNLMVDGY+ VGDLDS
Subjt:  IGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDS

Query:  AWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDL
        AWM F+KIPTPN VSWVTM SG AHYGR+TEAR+LF++MP+KNLVSWNAMIGAYVR +QIDDA+KLF EM E+DSVSWT MING VRVGKL QAREIL+L
Subjt:  AWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDL

Query:  MPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVV
        MPYKNIAAQTAMINGY+QS RMDEANEIFSQISVRD+VCWN++I GYAHCGR DEAL LF+EM  KDMVSWNTMIA Y QAGQ DKAL +FNEM+ERNVV
Subjt:  MPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVV

Query:  SWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDV
        SWNSLITGYVQNGLY EAL  FI MKQQGEKPDQ+T VCCLRASANLAAL VG+QLHHLTIK+GFGNDLFVKN I+TMY KSGRV +AENVFAEI NKDV
Subjt:  SWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDV

Query:  VSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTV
        VSWNSLIAGYALNG GKEAVELFE M +RG+IPDEVTFTGLLSACNHGGFV+QGL LFK MTETY IKP SEHYACV++LLGRVGRLEEA+EIV+G KTV
Subjt:  VSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTV

Query:  SSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPEICN
        SSAKIWGALLWACR+H N ELAKY  ERLL LEPQNASNYVLLS+MHAEAGRWDMVERVRV M+ENKAEKQPGCSWIEI NQ+HCFLS+AP +LRPEICN
Subjt:  SSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPEICN

Query:  ILQTVTAQIRNTECM
        IL+TV    RNTE M
Subjt:  ILQTVTAQIRNTECM

A0A1S3CM06 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0084.63Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MK ++RSIGEKGS+V  QNL+ISQLGKSGR+EEAVA+F  MTEKN VTYNSMISAYAKNGRI NARELF+LMP++NLVSWNSMIAGYLHN+LVEDAAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        D+M+KRD+YSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKN+VSWNS+L+GYTKNGEM  GLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLM+DGYI VGDLDSAWM F+KIPTP+ VSWVTM SG AH GR+TEAR+LF++MP+KNLVSWNAMIGAYVR++QIDDA+KLF EM E+DSVSWT 
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MING VRVGKL QAREIL+LMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRD+VCWNT+I GYAHCGRMDEAL LF+EM  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKAL +FN M+ERNVVSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQ+T VCCLRASANLAAL+VG+QLHHLTIK+GF NDLFVKN I+TMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRV +AENVFAEINNKDVVSWNSLI+GYALNGYGKEAVELFE M +RGVIPDEVTFTGLLSACNHGGFV+QGL LFK MTETY IKP  EHYACV++L
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEA+EIVEG K VSSAKIWGALLWACR+H N ELAKY   RLL LEPQNASNYVLLS+MHAEAGRWDMVERVRV M+ENKAEKQPGCSWIEI 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNTECM
        NQ+HCFLS+AP +LRPEICNIL+TV    RNTE M
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNTECM

A0A6J1C0K6 pentatricopeptide repeat-containing protein At2g35030, mitochondrial-like0.0e+0087.8Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKSSLRSIGEKGSHVFN NLKISQLGKSGRIEEAVAVFS MTEKNTV+YNSMISAYAKNGRIE+AR+LF++MP++NLVSWNSMIAGYLHND+V++AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKMYKRDLYSWTLMITCYTRIGEL KARELF+LLPDKQDTVC NALIAGYAKKRRFDEAKKLFD MLVKN+VSWNS+LAGYTKNGE+ SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGY+EVGDLDSAWMIFRKIPTPNAVS+VTMFSGLAHYG   EAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDAFKLF EM+ERDSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MING VRVGKLSQAREILD+MPYKN+AAQTAMINGYVQSSRMDEANEIF+QIS+RD+VCWNT+I GY HCGRMDEALCLFREM  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQG-EKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMY
        A Q DKALGIFNEMR RN+VSWNSLITGYVQN  Y EALK F+ +K QG EKPDQSTFVCCLRASANLAAL+VGMQLHHLTIK+GFG++LFVKN IITMY
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQG-EKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMY

Query:  TKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVD
        TKSGRVLDAE+VFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIR VIPDEVTFTGLLSACNHGGFV+QGLK+FKCMTETYLIKP SEHYACVVD
Subjt:  TKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVD

Query:  LLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEI
        LLGRVGRL EAMEIVEG KTVSSAKIWGALLWACR HQN ELAKY  ERLLELEP+NASNYVLLS++HAEA RWDMVERVRV M+ENKAEKQPGCSWIE+
Subjt:  LLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEI

Query:  GNQVHCFLSEAPAEL-RPEICNILQTVTAQIRNTECMY
        GN+VH F+SE P+E  RPEIC IL+T+TAQIRNTE M+
Subjt:  GNQVHCFLSEAPAEL-RPEICNILQTVTAQIRNTECMY

A0A6J1F6F3 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0086.34Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKS++RSIGEKGS VFNQNL+I+QLGKSGRIEEAVAVFS M EKNTVTYNSMIS YAKNGRI NAR+LF+ MP++NLVSWNSMIAGYLHN+LVE+AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM KRDLYSWTLMITCYTRIGELEKAR+LFNLLPDKQD VC NALIAGYAKKRRFDEAK+LFD MLVKN+VSWNS+L+GYTKNGEM SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIFRKIP PN VSWVTMFSG AHYGRI EAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDA+KLF  M E+DSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MI+G VRV KLSQA EIL LMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRD+VCWNT+I GY HCGRM+EALCLFREM  KDMVSWNTMIAGY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKALG+FNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTF+CCLRASANLA L+VG+QLH+LTIKSGFGN+LFVKN IITMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL AENVF+EINNKDVVSWNS+IAGYALNGYGKEAVELFEEMSIRGV PDEVTFTGLLSACNHGGFV+QGL LFKCMT+TYLIKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEAME+VEG + VSSAKIWGALLWACR+H N +LAKY  ERLL LEPQNASNYVLLS+MHAE GRWDMVE +RV M++NKAEKQPGCSWIE+ 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT
        NQVHCFL EAP +LRPEICNIL+TVTAQIRNT
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT

A0A6J1IK90 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0086.2Show/hide
Query:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF
        MKS++RSIG KGS VFNQNL+I+QLGK GRIEEAVAVFS MTEKNTVTYNSMIS YAKNGRI NAR+LF+ MP++NLVSWNSMIAGYLHN+LVE+AAKLF
Subjt:  MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLF

Query:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER
        DKM KRDLYSWTLMITCYTRIGELEKARELFNLLPDKQD VC NALIAGYAKKRRFD+AK+LFD MLVKN+VSWNS+L+GYTKNGEM SGLQFFEAMGER
Subjt:  DKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER

Query:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT
        NVVSWNLMVDGYI VGDLDSAWMIFR+IPTPN VSWVTM SG AHYGRITEAR+LFD+MPSKNLVSWNAMIGAYVRN+QIDDA+KLF EM E+DSVSWTT
Subjt:  NVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTT

Query:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ
        MI+G VRVGKLSQA EIL LMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRD+VCWNT+I GY HCGRM+EALCLFREM  KDMVSWNTMI+GY Q
Subjt:  MINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQ

Query:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT
        AGQ DKALG+FNEM+ERN+VSWNSLITGYVQNGLY EAL  FI MKQQGEKPDQSTF+CCLRASANLA L+VG+QLH+LTIKSGFGN+LFVKN IITMY 
Subjt:  AGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYT

Query:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL
        KSGRVL AE++F+EINNKDVVSWNS+IAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFV+QGL LFKCMT+TYLIKP SEHYACVVDL
Subjt:  KSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDL

Query:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG
        LGRVGRLEEA+E+VEG + VSSAKIWGALLWACR+H N +LAKY  ERLL LEPQNASNYVLLS+MHA  GRWDMVE +RV M++NKAEKQPGCSWIEI 
Subjt:  LGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIG

Query:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT
        NQVHCFLSEAP +LRPEICNIL+TVTAQIRNT
Subjt:  NQVHCFLSEAPAELRPEICNILQTVTAQIRNT

SwissProt top hitse value%identityAlignment
O04590 Pentatricopeptide repeat-containing protein At1g62260, mitochondrial8.2e-12134.76Show/hide
Query:  NSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAG
        N  ++   ++G I  AR++F  +  +N V+WN+MI+GY+    +  A KLFD M KRD+ +W  MI+ Y   G +                         
Subjt:  NSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAG

Query:  YAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRI
            R  +EA+KLFDEM  ++  SWN++++GY KN  +   L  FE M ERN VSW+ M+ G+ + G++DSA ++FRK+P  ++     + +GL    R+
Subjt:  YAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRI

Query:  TEARDLFDQMPS-----KNLV-SWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD
        +EA  +  Q  S     ++LV ++N +I  Y +  Q++ A  LF ++ +         + G    G+  +          KN+ +  +MI  Y++   + 
Subjt:  TEARDLFDQMPS-----KNLV-SWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD

Query:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI
         A  +F Q+  RDT+ WNT+I GY H  RM++A  LF EM  +D  SWN M++GY   G  + A   F +  E++ VSWNS+I  Y +N  Y EA+  FI
Subjt:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI

Query:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEIN-NKDVVSWNSLIAGYALNGYGKEAVEL
         M  +GEKPD  T    L AS  L  LR+GMQ+H + +K+    D+ V N +ITMY++ G ++++  +F E+   ++V++WN++I GYA +G   EA+ L
Subjt:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEIN-NKDVVSWNSLIAGYALNGYGKEAVEL

Query:  FEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA
        F  M   G+ P  +TF  +L+AC H G V++    F  M   Y I+P  EHY+ +V++    G+ EEAM I+          +WGALL ACR++ N  LA
Subjt:  FEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA

Query:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIE
            E +  LEP++++ YVLL +M+A+ G WD   +VR++M   + +K+ G SW++
Subjt:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIE

O64766 Pentatricopeptide repeat-containing protein At2g35030, mitochondrial4.2e-12537.14Show/hide
Query:  EKARELFNLLPDKQDTVCR------NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER-NVVSWNLMVDGYIEVGD
        +++ +LFNL+     +  R        LI    K  +  EA+KLFD +  ++VV+W  ++ GY K G+M    + F+ +  R NVV+W  MV GY+    
Subjt:  EKARELFNLLPDKQDTVCR------NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER-NVVSWNLMVDGYIEVGD

Query:  LDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREI
        L  A M+F+++P  N VSW TM  G A  GRI +A +LFD+MP +N+VSWN+M+ A V+  +ID+A  LF+ M  RD VSWT M++G  + GK+ +AR +
Subjt:  LDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREI

Query:  LDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRER
         D MP +NI +  AMI GY Q++R+DEA++                               LF+ M  +D  SWNTMI G+++  + +KA G+F+ M E+
Subjt:  LDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRER

Query:  NVVSWNSLITGYVQNGLYVEALKRFISMKQQGE-KPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAE--
        NV+SW ++ITGYV+N    EAL  F  M + G  KP+  T+V  L A ++LA L  G Q+H L  KS    +  V + ++ MY+KSG ++ A  +F    
Subjt:  NVVSWNSLITGYVQNGLYVEALKRFISMKQQGE-KPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAE--

Query:  INNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIV
        +  +D++SWNS+IA YA +G+GKEA+E++ +M   G  P  VT+  LL AC+H G V +G++ FK +     +    EHY C+VDL GR GRL++    +
Subjt:  INNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIV

Query:  EGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAEL
               S   +GA+L AC VH    +AK  V+++LE    +A  YVL+S+++A  G+ +    +R+ M+E   +KQPGCSW+++G Q H F+     + 
Subjt:  EGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAEL

Query:  RPEICNILQTVTAQIRNTECMYIHTSLDAE
         P+    L ++ + +RN      + + DAE
Subjt:  RPEICNILQTVTAQIRNTECMYIHTSLDAE

Q56XI1 Pentatricopeptide repeat-containing protein At1g09410, mitochondrial4.8e-13738.26Show/hide
Query:  NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGL
        N  I   ++  +  EA+KLFD    K++ SWNS++AGY  N       + F+ M +RN++SWN +V GY++ G++D A  +F  +P  N VSW  +  G 
Subjt:  NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGL

Query:  AHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD
         H G++  A  LF +MP KN VSW  M+  ++++ +IDDA KL++ + ++D+++ T+MI+G  + G++ +AREI D M  +++   T M+ GY Q++R+D
Subjt:  AHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD

Query:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI
        +A +IF  +  +  V W +++ GY   GR+++A  LF  M  K +++ N MI+G  Q G+  KA  +F+ M+ERN  SW ++I  + +NG  +EAL  FI
Subjt:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI

Query:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELF
         M++QG +P   T +  L   A+LA+L  G Q+H   ++  F  D++V +V++TMY K G ++ ++ +F    +KD++ WNS+I+GYA +G G+EA+++F
Subjt:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELF

Query:  EEMSIRG-VIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA
         EM + G   P+EVTF   LSAC++ G V +GLK+++ M   + +KP++ HYAC+VD+LGR GR  EAME+++       A +WG+LL ACR H   ++A
Subjt:  EEMSIRG-VIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA

Query:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPE---ICNILQTVTAQIR----NTECM
        ++  ++L+E+EP+N+  Y+LLS+M+A  GRW  V  +R  M+     K PGCSW E+ N+VH F +       PE   I  IL  +   +R    N +C 
Subjt:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPE---ICNILQTVTAQIR----NTECM

Query:  YIHTSLDAE
        Y    +D E
Subjt:  YIHTSLDAE

Q9FXB9 Pentatricopeptide repeat-containing protein At1g56690, mitochondrial5.1e-13139.67Show/hide
Query:  RFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARD
        + +EA+K FD +  K + SWNS+++GY  NG      Q F+ M ERNVVSWN +V GYI+   +  A  +F  +P  N VSW  M  G    G + EA  
Subjt:  RFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARD

Query:  LFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISV
        LF +MP +N VSW  M G  + + +ID A KL+  M  +D V+ T MI G  R G++ +AR I D M  +N+   T MI GY Q++R+D A ++F  +  
Subjt:  LFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISV

Query:  RDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQ
        +  V W +++ GY   GR+++A   F  M  K +++ N MI G+ + G+  KA  +F+ M +R+  +W  +I  Y + G  +EAL  F  M++QG +P  
Subjt:  RDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQ

Query:  STFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPD
         + +  L   A LA+L+ G Q+H   ++  F +D++V +V++TMY K G ++ A+ VF   ++KD++ WNS+I+GYA +G G+EA+++F EM   G +P+
Subjt:  STFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPD

Query:  EVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEP
        +VT   +L+AC++ G + +GL++F+ M   + + P  EHY+C VD+LGR G++++AME++E       A +WGALL AC+ H   +LA+   ++L E EP
Subjt:  EVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEP

Query:  QNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF
         NA  YVLLSS++A   +W  V  VR +MR N   K PGCSWIE+G +VH F
Subjt:  QNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF

Q9SY02 Pentatricopeptide repeat-containing protein At4g027503.6e-14540.33Show/hide
Query:  DLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWN
        D+  W + I+ Y R G   +A  +F  +P +  +V  N +I+GY +   F+ A+KLFDEM  +++VSWN ++ GY +N  +    + FE M ER+V SWN
Subjt:  DLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWN

Query:  LMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCV
         M+ GY + G +D A  +F ++P  N VSW  + S      ++ EA  LF    +  LVSWN ++G +V+  +I +A + F  M  RD VSW T+I G  
Subjt:  LMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCV

Query:  RVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDK
        + GK+ +AR++ D  P +++   TAM++GY+Q+  ++EA E+F ++  R+ V WN ++AGY    RM+ A  LF  M  +++ +WNTMI GY Q G+  +
Subjt:  RVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDK

Query:  ALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVL
        A  +F++M +R+ VSW ++I GY Q+G   EAL+ F+ M+++G + ++S+F   L   A++ AL +G QLH   +K G+    FV N ++ MY K G + 
Subjt:  ALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVL

Query:  DAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGR
        +A ++F E+  KD+VSWN++IAGY+ +G+G+ A+  FE M   G+ PD+ T   +LSAC+H G V++G + F  MT+ Y + P S+HYAC+VDLLGR G 
Subjt:  DAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGR

Query:  LEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF
        LE+A  +++       A IWG LL A RVH N ELA+   +++  +EP+N+  YVLLS+++A +GRW  V ++RV MR+   +K PG SWIEI N+ H F
Subjt:  LEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF

Arabidopsis top hitse value%identityAlignment
AT1G09410.1 pentatricopeptide (PPR) repeat-containing protein3.4e-13838.26Show/hide
Query:  NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGL
        N  I   ++  +  EA+KLFD    K++ SWNS++AGY  N       + F+ M +RN++SWN +V GY++ G++D A  +F  +P  N VSW  +  G 
Subjt:  NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGL

Query:  AHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD
         H G++  A  LF +MP KN VSW  M+  ++++ +IDDA KL++ + ++D+++ T+MI+G  + G++ +AREI D M  +++   T M+ GY Q++R+D
Subjt:  AHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD

Query:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI
        +A +IF  +  +  V W +++ GY   GR+++A  LF  M  K +++ N MI+G  Q G+  KA  +F+ M+ERN  SW ++I  + +NG  +EAL  FI
Subjt:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI

Query:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELF
         M++QG +P   T +  L   A+LA+L  G Q+H   ++  F  D++V +V++TMY K G ++ ++ +F    +KD++ WNS+I+GYA +G G+EA+++F
Subjt:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELF

Query:  EEMSIRG-VIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA
         EM + G   P+EVTF   LSAC++ G V +GLK+++ M   + +KP++ HYAC+VD+LGR GR  EAME+++       A +WG+LL ACR H   ++A
Subjt:  EEMSIRG-VIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA

Query:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPE---ICNILQTVTAQIR----NTECM
        ++  ++L+E+EP+N+  Y+LLS+M+A  GRW  V  +R  M+     K PGCSW E+ N+VH F +       PE   I  IL  +   +R    N +C 
Subjt:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPE---ICNILQTVTAQIR----NTECM

Query:  YIHTSLDAE
        Y    +D E
Subjt:  YIHTSLDAE

AT1G56690.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-13239.67Show/hide
Query:  RFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARD
        + +EA+K FD +  K + SWNS+++GY  NG      Q F+ M ERNVVSWN +V GYI+   +  A  +F  +P  N VSW  M  G    G + EA  
Subjt:  RFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARD

Query:  LFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISV
        LF +MP +N VSW  M G  + + +ID A KL+  M  +D V+ T MI G  R G++ +AR I D M  +N+   T MI GY Q++R+D A ++F  +  
Subjt:  LFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISV

Query:  RDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQ
        +  V W +++ GY   GR+++A   F  M  K +++ N MI G+ + G+  KA  +F+ M +R+  +W  +I  Y + G  +EAL  F  M++QG +P  
Subjt:  RDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQ

Query:  STFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPD
         + +  L   A LA+L+ G Q+H   ++  F +D++V +V++TMY K G ++ A+ VF   ++KD++ WNS+I+GYA +G G+EA+++F EM   G +P+
Subjt:  STFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPD

Query:  EVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEP
        +VT   +L+AC++ G + +GL++F+ M   + + P  EHY+C VD+LGR G++++AME++E       A +WGALL AC+ H   +LA+   ++L E EP
Subjt:  EVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEP

Query:  QNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF
         NA  YVLLSS++A   +W  V  VR +MR N   K PGCSWIE+G +VH F
Subjt:  QNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF

AT1G62260.1 mitochondrial editing factor 95.8e-12234.76Show/hide
Query:  NSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAG
        N  ++   ++G I  AR++F  +  +N V+WN+MI+GY+    +  A KLFD M KRD+ +W  MI+ Y   G +                         
Subjt:  NSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAG

Query:  YAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRI
            R  +EA+KLFDEM  ++  SWN++++GY KN  +   L  FE M ERN VSW+ M+ G+ + G++DSA ++FRK+P  ++     + +GL    R+
Subjt:  YAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRI

Query:  TEARDLFDQMPS-----KNLV-SWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD
        +EA  +  Q  S     ++LV ++N +I  Y +  Q++ A  LF ++ +         + G    G+  +          KN+ +  +MI  Y++   + 
Subjt:  TEARDLFDQMPS-----KNLV-SWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMD

Query:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI
         A  +F Q+  RDT+ WNT+I GY H  RM++A  LF EM  +D  SWN M++GY   G  + A   F +  E++ VSWNS+I  Y +N  Y EA+  FI
Subjt:  EANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFI

Query:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEIN-NKDVVSWNSLIAGYALNGYGKEAVEL
         M  +GEKPD  T    L AS  L  LR+GMQ+H + +K+    D+ V N +ITMY++ G ++++  +F E+   ++V++WN++I GYA +G   EA+ L
Subjt:  SMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEIN-NKDVVSWNSLIAGYALNGYGKEAVEL

Query:  FEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA
        F  M   G+ P  +TF  +L+AC H G V++    F  M   Y I+P  EHY+ +V++    G+ EEAM I+          +WGALL ACR++ N  LA
Subjt:  FEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELA

Query:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIE
            E +  LEP++++ YVLL +M+A+ G WD   +VR++M   + +K+ G SW++
Subjt:  KYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIE

AT2G35030.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-12637.14Show/hide
Query:  EKARELFNLLPDKQDTVCR------NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER-NVVSWNLMVDGYIEVGD
        +++ +LFNL+     +  R        LI    K  +  EA+KLFD +  ++VV+W  ++ GY K G+M    + F+ +  R NVV+W  MV GY+    
Subjt:  EKARELFNLLPDKQDTVCR------NALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGER-NVVSWNLMVDGYIEVGD

Query:  LDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREI
        L  A M+F+++P  N VSW TM  G A  GRI +A +LFD+MP +N+VSWN+M+ A V+  +ID+A  LF+ M  RD VSWT M++G  + GK+ +AR +
Subjt:  LDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREI

Query:  LDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRER
         D MP +NI +  AMI GY Q++R+DEA++                               LF+ M  +D  SWNTMI G+++  + +KA G+F+ M E+
Subjt:  LDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRER

Query:  NVVSWNSLITGYVQNGLYVEALKRFISMKQQGE-KPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAE--
        NV+SW ++ITGYV+N    EAL  F  M + G  KP+  T+V  L A ++LA L  G Q+H L  KS    +  V + ++ MY+KSG ++ A  +F    
Subjt:  NVVSWNSLITGYVQNGLYVEALKRFISMKQQGE-KPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAE--

Query:  INNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIV
        +  +D++SWNS+IA YA +G+GKEA+E++ +M   G  P  VT+  LL AC+H G V +G++ FK +     +    EHY C+VDL GR GRL++    +
Subjt:  INNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIV

Query:  EGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAEL
               S   +GA+L AC VH    +AK  V+++LE    +A  YVL+S+++A  G+ +    +R+ M+E   +KQPGCSW+++G Q H F+     + 
Subjt:  EGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAEL

Query:  RPEICNILQTVTAQIRNTECMYIHTSLDAE
         P+    L ++ + +RN      + + DAE
Subjt:  RPEICNILQTVTAQIRNTECMYIHTSLDAE

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-14640.33Show/hide
Query:  DLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWN
        D+  W + I+ Y R G   +A  +F  +P +  +V  N +I+GY +   F+ A+KLFDEM  +++VSWN ++ GY +N  +    + FE M ER+V SWN
Subjt:  DLYSWTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWN

Query:  LMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCV
         M+ GY + G +D A  +F ++P  N VSW  + S      ++ EA  LF    +  LVSWN ++G +V+  +I +A + F  M  RD VSW T+I G  
Subjt:  LMVDGYIEVGDLDSAWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCV

Query:  RVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDK
        + GK+ +AR++ D  P +++   TAM++GY+Q+  ++EA E+F ++  R+ V WN ++AGY    RM+ A  LF  M  +++ +WNTMI GY Q G+  +
Subjt:  RVGKLSQAREILDLMPYKNIAAQTAMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDK

Query:  ALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVL
        A  +F++M +R+ VSW ++I GY Q+G   EAL+ F+ M+++G + ++S+F   L   A++ AL +G QLH   +K G+    FV N ++ MY K G + 
Subjt:  ALGIFNEMRERNVVSWNSLITGYVQNGLYVEALKRFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVL

Query:  DAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGR
        +A ++F E+  KD+VSWN++IAGY+ +G+G+ A+  FE M   G+ PD+ T   +LSAC+H G V++G + F  MT+ Y + P S+HYAC+VDLLGR G 
Subjt:  DAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRGVIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGR

Query:  LEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF
        LE+A  +++       A IWG LL A RVH N ELA+   +++  +EP+N+  YVLLS+++A +GRW  V ++RV MR+   +K PG SWIEI N+ H F
Subjt:  LEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNYVLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCCAGCCTCAGGTCTATTGGGGAGAAGGGAAGCCATGTCTTTAACCAGAATTTGAAGATTTCCCAGTTGGGAAAATCAGGTCGCATTGAAGAAGCTGTTGCAGT
TTTCTCGCATATGACTGAGAAGAACACTGTGACGTACAATTCGATGATCTCCGCCTATGCCAAAAATGGAAGGATAGAAAATGCTCGTGAGTTGTTTAATCTAATGCCTC
GGAAAAACTTGGTTTCATGGAACTCCATGATCGCGGGGTACTTGCACAATGACTTGGTTGAGGATGCCGCCAAACTGTTTGATAAAATGTATAAAAGGGACCTCTACTCG
TGGACTTTGATGATAACTTGTTACACACGTATTGGGGAGCTTGAAAAGGCAAGGGAGCTTTTCAATTTGCTTCCTGACAAGCAAGATACAGTTTGTCGTAATGCACTTAT
TGCAGGTTATGCCAAGAAGAGACGGTTCGATGAGGCGAAGAAACTATTTGATGAAATGTTGGTAAAGAATGTAGTTTCATGGAATTCTCTGCTGGCAGGCTACACCAAGA
ACGGTGAAATGCACTCAGGACTGCAGTTCTTTGAGGCCATGGGAGAAAGGAATGTGGTTTCATGGAATTTGATGGTAGATGGGTATATTGAGGTTGGTGATTTGGATTCT
GCCTGGATGATTTTCAGGAAAATTCCAACTCCAAATGCAGTGTCCTGGGTGACAATGTTCTCTGGTCTTGCACATTATGGCAGGATCACTGAGGCTCGGGATCTCTTCGA
CCAGATGCCGAGTAAGAATTTGGTTTCTTGGAACGCTATGATTGGAGCTTATGTACGAAATGACCAAATTGACGACGCTTTTAAGCTATTTAAGGAAATGACAGAAAGGG
ACTCTGTATCATGGACTACCATGATCAATGGGTGTGTTCGAGTTGGTAAGCTTTCCCAGGCAAGGGAAATTCTTGATCTGATGCCTTATAAGAATATTGCGGCTCAAACA
GCAATGATCAATGGATATGTACAAAGTAGCAGAATGGATGAAGCCAATGAAATTTTTAGTCAAATTTCTGTACGTGATACTGTTTGTTGGAACACCGTGATAGCTGGTTA
TGCTCATTGTGGAAGAATGGATGAAGCTCTCTGTCTGTTTCGAGAAATGAAACGCAAAGATATGGTTTCATGGAATACCATGATTGCTGGTTATGTTCAAGCAGGACAGA
CAGATAAAGCACTCGGGATATTTAATGAGATGCGAGAGAGGAATGTAGTATCTTGGAATTCTCTGATTACAGGATACGTGCAAAATGGGTTATACGTTGAGGCACTGAAG
CGTTTCATATCGATGAAACAGCAAGGAGAGAAGCCTGATCAGTCAACTTTTGTGTGTTGCTTACGAGCATCTGCCAATCTTGCAGCTTTGAGAGTTGGAATGCAACTTCA
CCATCTCACTATCAAGAGTGGTTTTGGAAACGATTTATTTGTTAAAAACGTGATAATAACCATGTATACAAAAAGTGGAAGAGTCCTTGATGCTGAAAATGTCTTTGCTG
AGATCAACAATAAAGATGTGGTCTCATGGAATTCTTTGATAGCTGGATATGCACTAAATGGGTATGGAAAAGAAGCTGTTGAGCTTTTTGAAGAGATGTCAATAAGAGGG
GTCATTCCGGATGAAGTTACCTTCACTGGCTTGTTATCTGCATGTAATCATGGAGGTTTTGTGAATCAGGGTTTGAAATTGTTTAAGTGCATGACTGAAACATACTTAAT
AAAACCTTTATCAGAACATTATGCTTGTGTGGTTGATTTGCTTGGTCGGGTGGGTAGGTTAGAAGAAGCCATGGAAATAGTGGAGGGGAGGAAAACTGTGTCAAGTGCAA
AAATATGGGGTGCATTGCTATGGGCTTGCAGAGTACACCAGAATTGGGAGCTAGCCAAGTATCCAGTTGAGAGGCTTTTAGAACTTGAACCGCAAAATGCTTCGAATTAT
GTACTACTTTCAAGCATGCATGCTGAGGCAGGGAGATGGGATATGGTTGAGAGAGTCAGGGTCTCAATGAGAGAAAATAAAGCTGAGAAGCAACCTGGCTGCAGTTGGAT
TGAAATCGGTAATCAAGTGCACTGTTTCTTATCTGAAGCTCCAGCAGAATTGAGGCCAGAAATTTGCAATATATTGCAAACTGTGACTGCACAGATAAGAAACACAGAGT
GCATGTATATCCACACCAGCCTGGATGCAGAGCTCTACAACAGCTGGAATTCTATCATATTCAAATCGTGTTTCATGCTTAATTCGCCGACCGAAAGGATCGTAGAGATG
GACCAAGACAGTCTTGAATATGGTGGCCTGACTGGGCTACATGAACTTCTGAACATTCCCTACACAGAATCAAAAGACAAATTGATATTAAGGCGCAACTATAAATTGCA
AAAGAAATCAAATTTCTACCCATTTGAGATGGATCCCAAAAACAAGTGGAAACCGCGGGTGGCTTCTGGAGCATCAATCAAGTCATCGATTCTCAAGCTGGCTCGTCATC
TTGAAAAAGAAACCAAACGTGATATCAATGCCAACAGAACAAAAAAGTTGATGTCACTCCACAAAAAACAGACCATTAGAACTCAAGCGCTATGCCACACAGTCGCCGGA
GGGGAGTCCAAAGTTAAAGAGGGTGTTATAGCTACCGCCGGCGCCGGAGAAGAGGGAGAAAGTGGGTCGTCTGTGATGGAAGAGGTGGAAGAAGTACACTTCCGCAGCCA
AACTGAAACGCCTCGCTTCTCTTTGAATCTGCGTGGCTCTCGGTTTTTCTATCACAGAGAGAAAAAATCTCAGGCGGTTAAACCGGAACAGTGGGAGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCCAGCCTCAGGTCTATTGGGGAGAAGGGAAGCCATGTCTTTAACCAGAATTTGAAGATTTCCCAGTTGGGAAAATCAGGTCGCATTGAAGAAGCTGTTGCAGT
TTTCTCGCATATGACTGAGAAGAACACTGTGACGTACAATTCGATGATCTCCGCCTATGCCAAAAATGGAAGGATAGAAAATGCTCGTGAGTTGTTTAATCTAATGCCTC
GGAAAAACTTGGTTTCATGGAACTCCATGATCGCGGGGTACTTGCACAATGACTTGGTTGAGGATGCCGCCAAACTGTTTGATAAAATGTATAAAAGGGACCTCTACTCG
TGGACTTTGATGATAACTTGTTACACACGTATTGGGGAGCTTGAAAAGGCAAGGGAGCTTTTCAATTTGCTTCCTGACAAGCAAGATACAGTTTGTCGTAATGCACTTAT
TGCAGGTTATGCCAAGAAGAGACGGTTCGATGAGGCGAAGAAACTATTTGATGAAATGTTGGTAAAGAATGTAGTTTCATGGAATTCTCTGCTGGCAGGCTACACCAAGA
ACGGTGAAATGCACTCAGGACTGCAGTTCTTTGAGGCCATGGGAGAAAGGAATGTGGTTTCATGGAATTTGATGGTAGATGGGTATATTGAGGTTGGTGATTTGGATTCT
GCCTGGATGATTTTCAGGAAAATTCCAACTCCAAATGCAGTGTCCTGGGTGACAATGTTCTCTGGTCTTGCACATTATGGCAGGATCACTGAGGCTCGGGATCTCTTCGA
CCAGATGCCGAGTAAGAATTTGGTTTCTTGGAACGCTATGATTGGAGCTTATGTACGAAATGACCAAATTGACGACGCTTTTAAGCTATTTAAGGAAATGACAGAAAGGG
ACTCTGTATCATGGACTACCATGATCAATGGGTGTGTTCGAGTTGGTAAGCTTTCCCAGGCAAGGGAAATTCTTGATCTGATGCCTTATAAGAATATTGCGGCTCAAACA
GCAATGATCAATGGATATGTACAAAGTAGCAGAATGGATGAAGCCAATGAAATTTTTAGTCAAATTTCTGTACGTGATACTGTTTGTTGGAACACCGTGATAGCTGGTTA
TGCTCATTGTGGAAGAATGGATGAAGCTCTCTGTCTGTTTCGAGAAATGAAACGCAAAGATATGGTTTCATGGAATACCATGATTGCTGGTTATGTTCAAGCAGGACAGA
CAGATAAAGCACTCGGGATATTTAATGAGATGCGAGAGAGGAATGTAGTATCTTGGAATTCTCTGATTACAGGATACGTGCAAAATGGGTTATACGTTGAGGCACTGAAG
CGTTTCATATCGATGAAACAGCAAGGAGAGAAGCCTGATCAGTCAACTTTTGTGTGTTGCTTACGAGCATCTGCCAATCTTGCAGCTTTGAGAGTTGGAATGCAACTTCA
CCATCTCACTATCAAGAGTGGTTTTGGAAACGATTTATTTGTTAAAAACGTGATAATAACCATGTATACAAAAAGTGGAAGAGTCCTTGATGCTGAAAATGTCTTTGCTG
AGATCAACAATAAAGATGTGGTCTCATGGAATTCTTTGATAGCTGGATATGCACTAAATGGGTATGGAAAAGAAGCTGTTGAGCTTTTTGAAGAGATGTCAATAAGAGGG
GTCATTCCGGATGAAGTTACCTTCACTGGCTTGTTATCTGCATGTAATCATGGAGGTTTTGTGAATCAGGGTTTGAAATTGTTTAAGTGCATGACTGAAACATACTTAAT
AAAACCTTTATCAGAACATTATGCTTGTGTGGTTGATTTGCTTGGTCGGGTGGGTAGGTTAGAAGAAGCCATGGAAATAGTGGAGGGGAGGAAAACTGTGTCAAGTGCAA
AAATATGGGGTGCATTGCTATGGGCTTGCAGAGTACACCAGAATTGGGAGCTAGCCAAGTATCCAGTTGAGAGGCTTTTAGAACTTGAACCGCAAAATGCTTCGAATTAT
GTACTACTTTCAAGCATGCATGCTGAGGCAGGGAGATGGGATATGGTTGAGAGAGTCAGGGTCTCAATGAGAGAAAATAAAGCTGAGAAGCAACCTGGCTGCAGTTGGAT
TGAAATCGGTAATCAAGTGCACTGTTTCTTATCTGAAGCTCCAGCAGAATTGAGGCCAGAAATTTGCAATATATTGCAAACTGTGACTGCACAGATAAGAAACACAGAGT
GCATGTATATCCACACCAGCCTGGATGCAGAGCTCTACAACAGCTGGAATTCTATCATATTCAAATCGTGTTTCATGCTTAATTCGCCGACCGAAAGGATCGTAGAGATG
GACCAAGACAGTCTTGAATATGGTGGCCTGACTGGGCTACATGAACTTCTGAACATTCCCTACACAGAATCAAAAGACAAATTGATATTAAGGCGCAACTATAAATTGCA
AAAGAAATCAAATTTCTACCCATTTGAGATGGATCCCAAAAACAAGTGGAAACCGCGGGTGGCTTCTGGAGCATCAATCAAGTCATCGATTCTCAAGCTGGCTCGTCATC
TTGAAAAAGAAACCAAACGTGATATCAATGCCAACAGAACAAAAAAGTTGATGTCACTCCACAAAAAACAGACCATTAGAACTCAAGCGCTATGCCACACAGTCGCCGGA
GGGGAGTCCAAAGTTAAAGAGGGTGTTATAGCTACCGCCGGCGCCGGAGAAGAGGGAGAAAGTGGGTCGTCTGTGATGGAAGAGGTGGAAGAAGTACACTTCCGCAGCCA
AACTGAAACGCCTCGCTTCTCTTTGAATCTGCGTGGCTCTCGGTTTTTCTATCACAGAGAGAAAAAATCTCAGGCGGTTAAACCGGAACAGTGGGAGCTTTAA
Protein sequenceShow/hide protein sequence
MKSSLRSIGEKGSHVFNQNLKISQLGKSGRIEEAVAVFSHMTEKNTVTYNSMISAYAKNGRIENARELFNLMPRKNLVSWNSMIAGYLHNDLVEDAAKLFDKMYKRDLYS
WTLMITCYTRIGELEKARELFNLLPDKQDTVCRNALIAGYAKKRRFDEAKKLFDEMLVKNVVSWNSLLAGYTKNGEMHSGLQFFEAMGERNVVSWNLMVDGYIEVGDLDS
AWMIFRKIPTPNAVSWVTMFSGLAHYGRITEARDLFDQMPSKNLVSWNAMIGAYVRNDQIDDAFKLFKEMTERDSVSWTTMINGCVRVGKLSQAREILDLMPYKNIAAQT
AMINGYVQSSRMDEANEIFSQISVRDTVCWNTVIAGYAHCGRMDEALCLFREMKRKDMVSWNTMIAGYVQAGQTDKALGIFNEMRERNVVSWNSLITGYVQNGLYVEALK
RFISMKQQGEKPDQSTFVCCLRASANLAALRVGMQLHHLTIKSGFGNDLFVKNVIITMYTKSGRVLDAENVFAEINNKDVVSWNSLIAGYALNGYGKEAVELFEEMSIRG
VIPDEVTFTGLLSACNHGGFVNQGLKLFKCMTETYLIKPLSEHYACVVDLLGRVGRLEEAMEIVEGRKTVSSAKIWGALLWACRVHQNWELAKYPVERLLELEPQNASNY
VLLSSMHAEAGRWDMVERVRVSMRENKAEKQPGCSWIEIGNQVHCFLSEAPAELRPEICNILQTVTAQIRNTECMYIHTSLDAELYNSWNSIIFKSCFMLNSPTERIVEM
DQDSLEYGGLTGLHELLNIPYTESKDKLILRRNYKLQKKSNFYPFEMDPKNKWKPRVASGASIKSSILKLARHLEKETKRDINANRTKKLMSLHKKQTIRTQALCHTVAG
GESKVKEGVIATAGAGEEGESGSSVMEEVEEVHFRSQTETPRFSLNLRGSRFFYHREKKSQAVKPEQWEL