; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg09827 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg09827
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr10:3020208..3022295
RNA-Seq ExpressionCarg09827
SyntenyCarg09827
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589945.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0099.86Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS+VIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

KAG7023609.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

XP_022960689.1 pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita moschata]0.0e+0099.57Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS+VIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVASAEM GRHLF LDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

XP_022987632.1 pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita maxima]0.0e+0098.13Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAV LLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGL PTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIE ARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLH+MRLSGHMPDQVTLS+VIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYT SSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLI+DARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKD LELFENMLQEKFKPDNVTFVGVLSACLHSN IEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAV+LIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVA AEMAGRHLF LD TNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

XP_023516538.1 pentatricopeptide repeat-containing protein At4g02750-like [Cucurbita pepo subsp. pepo]0.0e+0098.13Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSI DLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLH+MRLSGHMPDQVTLS+VIAAYCQCGR DEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASL HGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSN IEQGQ+FFDSISNQHGLTPS DHYACMVNLLGRSGRIDQAV+LIK+MPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVA AEMAGRHLF LDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKL+E
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

TrEMBL top hitse value%identityAlignment
A0A0A0LUY3 DYW_deaminase domain-containing protein0.0e+0085.18Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKS LRQ+VDLLCSRSTATSEAYTQLVLECVR NEI+QAKRLQSHMEHHLFQP D FLHNQLLHLYAKFGKLRDAQNLFDKML+RD+FSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYNT IAG SGNS P+ESLELF+RMQREG  PTEYT VS LNASAQL DLR GKQIHGS+IV N+LGNVFI NALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFD LT KNLVSWNLMISGY KNGQPEKCIGLLH MRLSGHMPDQVT+S++IAAYCQCGR DEARRVF+EFK+KDIVCWTAM+VGYAK+
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEH EPDSYTLSSVVSSCAKLASL+HGQA+HGKSILAGL+NNLLVSSALIDMYSKCG I+DARSVF++MPTRNV++WNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NG DKD LELFENMLQ+KFKPDNVTF+G+LSACLH N+IEQGQ +FDSI+NQHG+TP+LDHYACMVNLLGR+GRI+QAV LIK+M H+PDFLIWSTLLS+
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
         +TKGD+ +AE+A RHLF LDPT AVPYIMLSNMYASMGRWKDVA+VR++MK+KNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYE+L +LI KL+E
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NG+SPIRIIKNIRIC+DCHEFMKFAS  I RQIILRDSNRFHHFS GKCSC DNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

A0A5A7UC76 Pentatricopeptide repeat-containing protein0.0e+0085.61Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKS LRQ+VDLLCSRSTATSEAYTQLVLECVR NEI+QAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDAQNLFDKML+RD FSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYNT IAG SGNS P+ESL+LF+RMQREG  PTEYT VS LNASAQLLDLR GKQIHGS+IV N+LGNVFI N LTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFD LT KNLVSWNLMISGY KNGQPEKCIGLLH MRLSGHMP+QVT+S++IAAYCQCGR DEARRVF+EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEH EPDSYTLSSVVSSCAKLASL+HGQA+HGKSILAGL+NNLLVSSALIDMYSKCG I+DARSVF++MPTRNV++WNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NG DKD LELFENMLQ+KFKPDNVTF+G+LSACLH N+IEQGQ +FDSISNQHGLTP+LDHYACMVNLLGR+GRI+QAV LIK+M HEPDFLIWSTLLS+
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
         +TKGD+ +AEMA RHLF LDPT+AVPYIMLSNMYASMGRWK VA+VR++MK+KNVKKFAG+SWIEID EVH+FTSEDRTHPE+E IYEEL ILI KL+E
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +GF PNT LVLHDVGE+EK KSICFHSEKLAL FGLIKK NG+SPIRIIKNIRIC+DCHEFMKFAS  I RQIILRDSNRFHHFS GKCSC DNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

A0A6J1CU81 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0085.47Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        M+AK KLRQA+DLLCSR +A+SEAYT L+LECVR NE+DQAKRLQSHMEHHLFQPPDPFL NQLLHLYAKFGK+RDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYNT IAG +GN  PKESLELFRRMQ EG  PTEYTNVSALNA+AQLLDLRRGK+IHGSVIVH +LGN FI NALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFD L NKNL+SWNLMISGYVKNGQPEKCIGLLH+M++SGHMPDQVT+S++IAAYCQC   DEAR+VF+EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEH +PDSYTLSSVVSSCAKLASLYHGQA+HGKSILAGLDNNLLVSSALIDMYSKCG +++ARSVF++MPTRNVI+WNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NG DKD L  FENMLQ+KFKPDNVTF+GVLSACLHSN+IE+GQ +FDSISNQHGL P++DHYACMVNLLGR GRIDQAVDLIKSMPHEPD LIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SA KGD+A+AEMA R+LF LDP NAVPY+MLSNMYA MGRWKDVA+VR++MK+KNVKKFAGYSWIEIDN+VHKFTSEDRTHPETE+IYEEL +LIRK +E
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
         GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGL+KK NGV+PIRIIKNIRICSDCHEFMKFAS  IRRQIILRDSNRFHHF+ GKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

A0A6J1HBT0 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0099.57Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS+VIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVASAEM GRHLF LDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

A0A6J1JAW4 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0098.13Show/hide
Query:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
        MKAKSKLRQAV LLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA
Subjt:  MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYA

Query:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
        KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGL PTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY
Subjt:  KSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMY

Query:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
        AKCGEIE ARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLH+MRLSGHMPDQVTLS+VIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS
Subjt:  AKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKS

Query:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ
        GREEDALLLFNEMLLEHFEPDSYT SSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLI+DARSVFDVMPTRNVITWNAMIVGYAQ
Subjt:  GREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQ

Query:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV
        NGRDKD LELFENMLQEKFKPDNVTFVGVLSACLHSN IEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAV+LIKSMPHEPDFLIWSTLLSV
Subjt:  NGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSV

Query:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
        SATKGDVA AEMAGRHLF LD TNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE
Subjt:  SATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEE

Query:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
Subjt:  QGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic4.3e-13536.94Show/hide
Query:  YTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLER----DVFSWNALLSAYAKSGSIQDLRATFDRMPYR
        Y  ++    + +++D+A +    M    +   +P ++N   LL +     +LR  + +   +++     D+F+   L + YAK   + + R  FDRMP R
Subjt:  YTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLER----DVFSWNALLSAYAKSGSIQDLRATFDRMPYR

Query:  DSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTN
        D VS+NTI+AG S N   + +LE+ + M  E L P+  T VS L A + L  +  GK+IHG  +   +   V I  AL DMYAKCG +E AR LFD +  
Subjt:  DSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTN

Query:  KNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVT-----------------------------------LSSVIAAYCQCGRADEARRVFNEFK
        +N+VSWN MI  YV+N  P++ + +   M   G  P  V+                                   ++S+I+ YC+C   D A  +F + +
Subjt:  KNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVT-----------------------------------LSSVIAAYCQCGRADEARRVFNEFK

Query:  DKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVM
         + +V W AM++G+A++GR  DAL  F++M     +PD++T  SV+++ A+L+  +H + IHG  + + LD N+ V++AL+DMY+KCG I  AR +FD+M
Subjt:  DKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVM

Query:  PTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIK
          R+V TWNAMI GY  +G  K  LELFE M +   KP+ VTF+ V+SAC HS  +E G   F  +   + +  S+DHY  MV+LLGR+GR+++A D I 
Subjt:  PTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIK

Query:  SMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPE
         MP +P   ++  +L       +V  AE A   LF L+P +   +++L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+
Subjt:  SMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPE

Query:  TEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHH
        +++IY  L+ LI  ++E G+ P+TNLVL  V  + K + +  HSEKLA+ FGL+    G + I + KN+R+C+DCH   K+ S+   R+I++RD  RFHH
Subjt:  TEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHH

Query:  FSNGKCSCKDNW
        F NG CSC D W
Subjt:  FSNGKCSCKDNW

Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689301.5e-14036.78Show/hide
Query:  PDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAP-T
        P+ FL+N ++H YA       A+ +FD++ + ++FSWN LL AY+K+G I ++ +TF+++P RD V++N +I G S +     +++ +  M R+  A  T
Subjt:  PDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAP-T

Query:  EYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAK-------------------------------CGEIEQARWLFDRLTNKNLVS
          T ++ L  S+    +  GKQIHG VI   +   + + + L  MYA                                CG IE A  LF R   K+ VS
Subjt:  EYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAK-------------------------------CGEIEQARWLFDRLTNKNLVS

Query:  WNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAA-----------------------------------YCQCGRADEARRVFNEFKDKDIV
        W  MI G  +NG  ++ I    +M++ G   DQ    SV+ A                                   YC+C     A+ VF+  K K++V
Subjt:  WNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAA-----------------------------------YCQCGRADEARRVFNEFKDKDIV

Query:  CWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNV
         WTAM+VGY ++GR E+A+ +F +M     +PD YTL   +S+CA ++SL  G   HGK+I +GL + + VS++L+ +Y KCG I+D+  +F+ M  R+ 
Subjt:  CWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNV

Query:  ITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHE
        ++W AM+  YAQ GR  +T++LF+ M+Q   KPD VT  GV+SAC  +  +E+GQ +F  +++++G+ PS+ HY+CM++L  RSGR+++A+  I  MP  
Subjt:  ITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHE

Query:  PDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIY
        PD + W+TLLS    KG++   + A   L  LDP +   Y +LS++YAS G+W  VA +R  M+ KNVKK  G SWI+   ++H F+++D + P  +QIY
Subjt:  PDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIY

Query:  EELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGK
         +L+ L  K+ + G++P+T+ V HDV E  K+K + +HSE+LA+ FGLI   +G  PIR+ KN+R+C DCH   K  S    R+I++RD+ RFH F +G 
Subjt:  EELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGK

Query:  CSCKDNW
        CSC D W
Subjt:  CSCKDNW

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099501.8e-13338.82Show/hide
Query:  NALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFI
        N L++ YAK GSI D R  F  M  +DSVS+N++I GL  N    E++E ++ M+R  + P  +T +S+L++ A L   + G+QIHG  +      NV +
Subjt:  NALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFI

Query:  CNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQ--PEKCIGLL---------------------------------HDMRLSGHMPDQV
         NAL  +YA+ G + + R +F  +   + VSWN +I    ++ +  PE  +  L                                 H + L  ++ D+ 
Subjt:  CNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQ--PEKCIGLL---------------------------------HDMRLSGHMPDQV

Query:  TL-SSVIAAYCQCGRADEARRVFNEFKD-KDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDN
        T  +++IA Y +CG  D   ++F+   + +D V W +M+ GY  +     AL L   ML      DS+  ++V+S+ A +A+L  G  +H  S+ A L++
Subjt:  TL-SSVIAAYCQCGRADEARRVFNEFKD-KDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDN

Query:  NLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENM-LQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHG
        +++V SAL+DMYSKCG ++ A   F+ MP RN  +WN+MI GYA++G+ ++ L+LFE M L  +  PD+VTFVGVLSAC H+  +E+G   F+S+S+ +G
Subjt:  NLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENM-LQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHG

Query:  LTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEM---AGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVM
        L P ++H++CM ++LGR+G +D+  D I+ MP +P+ LIW T+L  +  + +   AE+   A   LF L+P NAV Y++L NMYA+ GRW+D+   R  M
Subjt:  LTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEM---AGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVM

Query:  KNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKN
        K+ +VKK AGYSW+ + + VH F + D++HP+ + IY++LK L RK+ + G+ P T   L+D+ +E K + + +HSEKLA+ F L  + +   PIRI+KN
Subjt:  KNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +R+C DCH   K+ S    RQIILRDSNRFHHF +G CSC D W
Subjt:  IRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220703.5e-13738.21Show/hide
Query:  NQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREG-LAPTEYTNVS
        N LL++YAK G    A+ +FD+M+ RD+ SWNA+++ + + G +    A F++M  RD V++N++I+G +   +   +L++F +M R+  L+P  +T  S
Subjt:  NQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREG-LAPTEYTNVS

Query:  ALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS
         L+A A L  L  GKQIH  ++   +  +  + NAL  MY++CG +E AR L ++   K                          D+++ G        +
Subjt:  ALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS

Query:  SVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS
        +++  Y + G  ++A+ +F   KD+D+V WTAM+VGY + G   +A+ LF  M+     P+SYTL++++S  + LASL HG+ IHG ++ +G   ++ VS
Subjt:  SVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS

Query:  SALIDMYSKCGLIEDARSVFDVMP-TRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSL
        +ALI MY+K G I  A   FD++   R+ ++W +MI+  AQ+G  ++ LELFE ML E  +PD++T+VGV SAC H+  + QG+ +FD + +   + P+L
Subjt:  SALIDMYSKCGLIEDARSVFDVMP-TRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSL

Query:  DHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKF
         HYACMV+L GR+G + +A + I+ MP EPD + W +LLS      ++   ++A   L +L+P N+  Y  L+N+Y++ G+W++ A +R  MK+  VKK 
Subjt:  DHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKF

Query:  AGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCH
         G+SWIE+ ++VH F  ED THPE  +IY  +K +  ++++ G+ P+T  VLHD+ EE K + +  HSEKLA+ FGLI   +  + +RI+KN+R+C+DCH
Subjt:  AGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCH

Query:  EFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
          +KF S  + R+II+RD+ RFHHF +G CSC+D W
Subjt:  EFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

Q9SY02 Pentatricopeptide repeat-containing protein At4g027508.9e-14136.96Show/hide
Query:  TSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS
        +S +Y  ++   +R  E + A++L   M        D    N ++  Y +   L  A+ LF+ M ERDV SWN +LS YA++G + D R+ FDRMP ++ 
Subjt:  TSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS

Query:  VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTE-----YTNVSALNASAQLLDLRRGKQ-IHGSVIVHNY-----------------LGNVFICNALT
        VS+N +++    NS  +E+  LF+  +   L         +     +  + Q  D    +  +  + I+  Y                 + +VF   A+ 
Subjt:  VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTE-----YTNVSALNASAQLLDLRRGKQ-IHGSVIVHNY-----------------LGNVFICNALT

Query:  DMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGY
          Y +   +E+AR LFD++  +N VSWN M++GYV+  + E    L   M       +  T +++I  Y QCG+  EA+ +F++   +D V W AM+ GY
Subjt:  DMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGY

Query:  AKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVG
        ++SG   +AL LF +M  E    +  + SS +S+CA + +L  G+ +HG+ +  G +    V +AL+ MY KCG IE+A  +F  M  +++++WN MI G
Subjt:  AKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVG

Query:  YAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTL
        Y+++G  +  L  FE+M +E  KPD+ T V VLSAC H+  +++G+ +F +++  +G+ P+  HYACMV+LLGR+G ++ A +L+K+MP EPD  IW TL
Subjt:  YAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTL

Query:  LSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRK
        L  S   G+   AE A   +F ++P N+  Y++LSN+YAS GRW DV  +R  M++K VKK  GYSWIEI N+ H F+  D  HPE ++I+  L+ L  +
Subjt:  LSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRK

Query:  LEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +++ G+   T++VLHDV EEEK + + +HSE+LA+ +G+++  +G  PIR+IKN+R+C DCH  +K+ +    R IILRD+NRFHHF +G CSC D W
Subjt:  LEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-13636.94Show/hide
Query:  YTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLER----DVFSWNALLSAYAKSGSIQDLRATFDRMPYR
        Y  ++    + +++D+A +    M    +   +P ++N   LL +     +LR  + +   +++     D+F+   L + YAK   + + R  FDRMP R
Subjt:  YTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLER----DVFSWNALLSAYAKSGSIQDLRATFDRMPYR

Query:  DSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTN
        D VS+NTI+AG S N   + +LE+ + M  E L P+  T VS L A + L  +  GK+IHG  +   +   V I  AL DMYAKCG +E AR LFD +  
Subjt:  DSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTN

Query:  KNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVT-----------------------------------LSSVIAAYCQCGRADEARRVFNEFK
        +N+VSWN MI  YV+N  P++ + +   M   G  P  V+                                   ++S+I+ YC+C   D A  +F + +
Subjt:  KNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVT-----------------------------------LSSVIAAYCQCGRADEARRVFNEFK

Query:  DKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVM
         + +V W AM++G+A++GR  DAL  F++M     +PD++T  SV+++ A+L+  +H + IHG  + + LD N+ V++AL+DMY+KCG I  AR +FD+M
Subjt:  DKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVM

Query:  PTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIK
          R+V TWNAMI GY  +G  K  LELFE M +   KP+ VTF+ V+SAC HS  +E G   F  +   + +  S+DHY  MV+LLGR+GR+++A D I 
Subjt:  PTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIK

Query:  SMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPE
         MP +P   ++  +L       +V  AE A   LF L+P +   +++L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+
Subjt:  SMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPE

Query:  TEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHH
        +++IY  L+ LI  ++E G+ P+TNLVL  V  + K + +  HSEKLA+ FGL+    G + I + KN+R+C+DCH   K+ S+   R+I++RD  RFHH
Subjt:  TEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHH

Query:  FSNGKCSCKDNW
        F NG CSC D W
Subjt:  FSNGKCSCKDNW

AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein1.1e-14136.78Show/hide
Query:  PDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAP-T
        P+ FL+N ++H YA       A+ +FD++ + ++FSWN LL AY+K+G I ++ +TF+++P RD V++N +I G S +     +++ +  M R+  A  T
Subjt:  PDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAP-T

Query:  EYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAK-------------------------------CGEIEQARWLFDRLTNKNLVS
          T ++ L  S+    +  GKQIHG VI   +   + + + L  MYA                                CG IE A  LF R   K+ VS
Subjt:  EYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAK-------------------------------CGEIEQARWLFDRLTNKNLVS

Query:  WNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAA-----------------------------------YCQCGRADEARRVFNEFKDKDIV
        W  MI G  +NG  ++ I    +M++ G   DQ    SV+ A                                   YC+C     A+ VF+  K K++V
Subjt:  WNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAA-----------------------------------YCQCGRADEARRVFNEFKDKDIV

Query:  CWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNV
         WTAM+VGY ++GR E+A+ +F +M     +PD YTL   +S+CA ++SL  G   HGK+I +GL + + VS++L+ +Y KCG I+D+  +F+ M  R+ 
Subjt:  CWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNV

Query:  ITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHE
        ++W AM+  YAQ GR  +T++LF+ M+Q   KPD VT  GV+SAC  +  +E+GQ +F  +++++G+ PS+ HY+CM++L  RSGR+++A+  I  MP  
Subjt:  ITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHE

Query:  PDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIY
        PD + W+TLLS    KG++   + A   L  LDP +   Y +LS++YAS G+W  VA +R  M+ KNVKK  G SWI+   ++H F+++D + P  +QIY
Subjt:  PDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIY

Query:  EELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGK
         +L+ L  K+ + G++P+T+ V HDV E  K+K + +HSE+LA+ FGLI   +G  PIR+ KN+R+C DCH   K  S    R+I++RD+ RFH F +G 
Subjt:  EELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGK

Query:  CSCKDNW
        CSC D W
Subjt:  CSCKDNW

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein2.5e-13838.21Show/hide
Query:  NQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREG-LAPTEYTNVS
        N LL++YAK G    A+ +FD+M+ RD+ SWNA+++ + + G +    A F++M  RD V++N++I+G +   +   +L++F +M R+  L+P  +T  S
Subjt:  NQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREG-LAPTEYTNVS

Query:  ALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS
         L+A A L  L  GKQIH  ++   +  +  + NAL  MY++CG +E AR L ++   K                          D+++ G        +
Subjt:  ALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLS

Query:  SVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS
        +++  Y + G  ++A+ +F   KD+D+V WTAM+VGY + G   +A+ LF  M+     P+SYTL++++S  + LASL HG+ IHG ++ +G   ++ VS
Subjt:  SVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVS

Query:  SALIDMYSKCGLIEDARSVFDVMP-TRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSL
        +ALI MY+K G I  A   FD++   R+ ++W +MI+  AQ+G  ++ LELFE ML E  +PD++T+VGV SAC H+  + QG+ +FD + +   + P+L
Subjt:  SALIDMYSKCGLIEDARSVFDVMP-TRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSL

Query:  DHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKF
         HYACMV+L GR+G + +A + I+ MP EPD + W +LLS      ++   ++A   L +L+P N+  Y  L+N+Y++ G+W++ A +R  MK+  VKK 
Subjt:  DHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKF

Query:  AGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCH
         G+SWIE+ ++VH F  ED THPE  +IY  +K +  ++++ G+ P+T  VLHD+ EE K + +  HSEKLA+ FGLI   +  + +RI+KN+R+C+DCH
Subjt:  AGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCH

Query:  EFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
          +KF S  + R+II+RD+ RFHHF +G CSC+D W
Subjt:  EFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.3e-14236.96Show/hide
Query:  TSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS
        +S +Y  ++   +R  E + A++L   M        D    N ++  Y +   L  A+ LF+ M ERDV SWN +LS YA++G + D R+ FDRMP ++ 
Subjt:  TSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRATFDRMPYRDS

Query:  VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTE-----YTNVSALNASAQLLDLRRGKQ-IHGSVIVHNY-----------------LGNVFICNALT
        VS+N +++    NS  +E+  LF+  +   L         +     +  + Q  D    +  +  + I+  Y                 + +VF   A+ 
Subjt:  VSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTE-----YTNVSALNASAQLLDLRRGKQ-IHGSVIVHNY-----------------LGNVFICNALT

Query:  DMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGY
          Y +   +E+AR LFD++  +N VSWN M++GYV+  + E    L   M       +  T +++I  Y QCG+  EA+ +F++   +D V W AM+ GY
Subjt:  DMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGY

Query:  AKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVG
        ++SG   +AL LF +M  E    +  + SS +S+CA + +L  G+ +HG+ +  G +    V +AL+ MY KCG IE+A  +F  M  +++++WN MI G
Subjt:  AKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVG

Query:  YAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTL
        Y+++G  +  L  FE+M +E  KPD+ T V VLSAC H+  +++G+ +F +++  +G+ P+  HYACMV+LLGR+G ++ A +L+K+MP EPD  IW TL
Subjt:  YAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTL

Query:  LSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRK
        L  S   G+   AE A   +F ++P N+  Y++LSN+YAS GRW DV  +R  M++K VKK  GYSWIEI N+ H F+  D  HPE ++I+  L+ L  +
Subjt:  LSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVMKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRK

Query:  LEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +++ G+   T++VLHDV EEEK + + +HSE+LA+ +G+++  +G  PIR+IKN+R+C DCH  +K+ +    R IILRD+NRFHHF +G CSC D W
Subjt:  LEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-13438.82Show/hide
Query:  NALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFI
        N L++ YAK GSI D R  F  M  +DSVS+N++I GL  N    E++E ++ M+R  + P  +T +S+L++ A L   + G+QIHG  +      NV +
Subjt:  NALLSAYAKSGSIQDLRATFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFI

Query:  CNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQ--PEKCIGLL---------------------------------HDMRLSGHMPDQV
         NAL  +YA+ G + + R +F  +   + VSWN +I    ++ +  PE  +  L                                 H + L  ++ D+ 
Subjt:  CNALTDMYAKCGEIEQARWLFDRLTNKNLVSWNLMISGYVKNGQ--PEKCIGLL---------------------------------HDMRLSGHMPDQV

Query:  TL-SSVIAAYCQCGRADEARRVFNEFKD-KDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDN
        T  +++IA Y +CG  D   ++F+   + +D V W +M+ GY  +     AL L   ML      DS+  ++V+S+ A +A+L  G  +H  S+ A L++
Subjt:  TL-SSVIAAYCQCGRADEARRVFNEFKD-KDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVSSCAKLASLYHGQAIHGKSILAGLDN

Query:  NLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENM-LQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHG
        +++V SAL+DMYSKCG ++ A   F+ MP RN  +WN+MI GYA++G+ ++ L+LFE M L  +  PD+VTFVGVLSAC H+  +E+G   F+S+S+ +G
Subjt:  NLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENM-LQEKFKPDNVTFVGVLSACLHSNFIEQGQVFFDSISNQHG

Query:  LTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEM---AGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVM
        L P ++H++CM ++LGR+G +D+  D I+ MP +P+ LIW T+L  +  + +   AE+   A   LF L+P NAV Y++L NMYA+ GRW+D+   R  M
Subjt:  LTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEM---AGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSVM

Query:  KNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKN
        K+ +VKK AGYSW+ + + VH F + D++HP+ + IY++LK L RK+ + G+ P T   L+D+ +E K + + +HSEKLA+ F L  + +   PIRI+KN
Subjt:  KNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKN

Query:  IRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW
        +R+C DCH   K+ S    RQIILRDSNRFHHF +G CSC D W
Subjt:  IRICSDCHEFMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAAAATCCAAGCTGCGGCAAGCAGTAGACTTGCTTTGCTCTCGAAGCACCGCAACCTCTGAGGCCTACACTCAATTGGTTCTCGAATGTGTCCGTGCAAATGA
AATCGACCAAGCTAAGAGACTGCAGTCCCACATGGAACACCATCTTTTCCAACCCCCTGATCCCTTTCTCCACAATCAGCTACTTCATTTGTACGCAAAATTCGGGAAGC
TTCGGGATGCCCAAAACCTGTTTGATAAAATGCTTGAAAGGGATGTTTTCTCCTGGAATGCTCTGCTCTCTGCGTATGCTAAATCAGGGTCCATCCAGGATTTGCGGGCC
ACATTTGATCGAATGCCTTACCGCGATTCAGTTTCATACAATACGATCATAGCAGGTTTGTCTGGAAATAGTTTTCCAAAAGAGTCGCTTGAGCTTTTCAGAAGAATGCA
GAGGGAGGGTCTTGCACCTACGGAGTATACAAATGTAAGCGCATTGAATGCATCTGCACAATTGTTGGATTTGAGGCGTGGGAAACAGATTCATGGGAGTGTTATTGTGC
ATAACTATTTAGGGAATGTGTTCATTTGCAATGCTTTAACAGATATGTATGCCAAATGTGGTGAGATTGAACAGGCAAGGTGGTTGTTTGATCGTCTCACTAACAAGAAT
TTGGTTTCTTGGAACTTGATGATATCTGGGTATGTAAAGAATGGACAGCCTGAGAAGTGCATTGGTTTGTTACATGACATGCGCTTGTCTGGTCATATGCCTGATCAAGT
TACCTTGTCATCTGTGATCGCAGCTTACTGTCAATGTGGGCGTGCAGATGAAGCAAGAAGGGTATTTAATGAGTTTAAAGACAAAGATATTGTTTGCTGGACAGCCATGT
TGGTGGGTTATGCAAAAAGCGGCAGAGAAGAGGATGCACTGTTGTTGTTCAACGAAATGCTATTAGAACACTTTGAACCCGACAGCTACACATTATCCAGCGTTGTCAGT
TCATGTGCCAAATTAGCATCTCTATATCATGGTCAGGCAATCCATGGAAAATCAATTCTTGCTGGGCTTGATAACAATTTACTCGTCTCAAGCGCATTAATCGACATGTA
TTCCAAATGTGGTTTAATCGAGGACGCAAGGTCAGTCTTCGACGTGATGCCAACTAGGAATGTGATTACATGGAATGCTATGATTGTTGGCTATGCACAAAACGGACGTG
ATAAGGACACCCTTGAACTCTTTGAAAACATGTTGCAAGAGAAATTCAAACCTGATAATGTTACTTTTGTAGGCGTTTTATCTGCTTGTCTCCATTCTAATTTCATTGAA
CAAGGGCAGGTGTTCTTCGATTCTATAAGCAATCAACACGGATTGACGCCTAGTTTGGATCATTACGCATGTATGGTCAATCTCCTAGGACGTTCAGGTCGCATCGATCA
AGCAGTTGATCTAATAAAAAGTATGCCCCATGAACCAGATTTCCTGATATGGTCCACACTTCTATCTGTTAGTGCAACAAAGGGTGATGTTGCAAGTGCAGAAATGGCAG
GTAGGCATCTCTTCGTATTGGATCCTACCAATGCTGTGCCATATATTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCAGCAGTTAGGAGTGTG
ATGAAGAACAAGAATGTGAAGAAGTTTGCTGGGTACAGTTGGATTGAGATTGATAATGAGGTTCACAAATTCACATCCGAAGATCGAACACACCCAGAAACAGAACAAAT
CTATGAGGAATTGAAGATATTGATAAGGAAACTTGAAGAACAGGGATTTAGGCCTAATACAAATCTGGTTTTGCATGATGTTGGAGAGGAGGAGAAGTTGAAATCCATAT
GTTTCCACAGCGAGAAACTTGCGCTTGTCTTTGGTTTGATTAAGAAAGGTAATGGAGTTAGTCCAATAAGGATCATAAAGAACATTCGGATTTGCAGCGATTGCCATGAG
TTCATGAAGTTTGCATCTATGAGTATTAGAAGGCAAATAATCTTGAGAGATTCAAATAGATTTCATCATTTTTCAAATGGGAAATGCTCCTGCAAGGATAACTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCAAAATCCAAGCTGCGGCAAGCAGTAGACTTGCTTTGCTCTCGAAGCACCGCAACCTCTGAGGCCTACACTCAATTGGTTCTCGAATGTGTCCGTGCAAATGA
AATCGACCAAGCTAAGAGACTGCAGTCCCACATGGAACACCATCTTTTCCAACCCCCTGATCCCTTTCTCCACAATCAGCTACTTCATTTGTACGCAAAATTCGGGAAGC
TTCGGGATGCCCAAAACCTGTTTGATAAAATGCTTGAAAGGGATGTTTTCTCCTGGAATGCTCTGCTCTCTGCGTATGCTAAATCAGGGTCCATCCAGGATTTGCGGGCC
ACATTTGATCGAATGCCTTACCGCGATTCAGTTTCATACAATACGATCATAGCAGGTTTGTCTGGAAATAGTTTTCCAAAAGAGTCGCTTGAGCTTTTCAGAAGAATGCA
GAGGGAGGGTCTTGCACCTACGGAGTATACAAATGTAAGCGCATTGAATGCATCTGCACAATTGTTGGATTTGAGGCGTGGGAAACAGATTCATGGGAGTGTTATTGTGC
ATAACTATTTAGGGAATGTGTTCATTTGCAATGCTTTAACAGATATGTATGCCAAATGTGGTGAGATTGAACAGGCAAGGTGGTTGTTTGATCGTCTCACTAACAAGAAT
TTGGTTTCTTGGAACTTGATGATATCTGGGTATGTAAAGAATGGACAGCCTGAGAAGTGCATTGGTTTGTTACATGACATGCGCTTGTCTGGTCATATGCCTGATCAAGT
TACCTTGTCATCTGTGATCGCAGCTTACTGTCAATGTGGGCGTGCAGATGAAGCAAGAAGGGTATTTAATGAGTTTAAAGACAAAGATATTGTTTGCTGGACAGCCATGT
TGGTGGGTTATGCAAAAAGCGGCAGAGAAGAGGATGCACTGTTGTTGTTCAACGAAATGCTATTAGAACACTTTGAACCCGACAGCTACACATTATCCAGCGTTGTCAGT
TCATGTGCCAAATTAGCATCTCTATATCATGGTCAGGCAATCCATGGAAAATCAATTCTTGCTGGGCTTGATAACAATTTACTCGTCTCAAGCGCATTAATCGACATGTA
TTCCAAATGTGGTTTAATCGAGGACGCAAGGTCAGTCTTCGACGTGATGCCAACTAGGAATGTGATTACATGGAATGCTATGATTGTTGGCTATGCACAAAACGGACGTG
ATAAGGACACCCTTGAACTCTTTGAAAACATGTTGCAAGAGAAATTCAAACCTGATAATGTTACTTTTGTAGGCGTTTTATCTGCTTGTCTCCATTCTAATTTCATTGAA
CAAGGGCAGGTGTTCTTCGATTCTATAAGCAATCAACACGGATTGACGCCTAGTTTGGATCATTACGCATGTATGGTCAATCTCCTAGGACGTTCAGGTCGCATCGATCA
AGCAGTTGATCTAATAAAAAGTATGCCCCATGAACCAGATTTCCTGATATGGTCCACACTTCTATCTGTTAGTGCAACAAAGGGTGATGTTGCAAGTGCAGAAATGGCAG
GTAGGCATCTCTTCGTATTGGATCCTACCAATGCTGTGCCATATATTATGCTCTCAAATATGTATGCCTCTATGGGTAGATGGAAGGATGTAGCAGCAGTTAGGAGTGTG
ATGAAGAACAAGAATGTGAAGAAGTTTGCTGGGTACAGTTGGATTGAGATTGATAATGAGGTTCACAAATTCACATCCGAAGATCGAACACACCCAGAAACAGAACAAAT
CTATGAGGAATTGAAGATATTGATAAGGAAACTTGAAGAACAGGGATTTAGGCCTAATACAAATCTGGTTTTGCATGATGTTGGAGAGGAGGAGAAGTTGAAATCCATAT
GTTTCCACAGCGAGAAACTTGCGCTTGTCTTTGGTTTGATTAAGAAAGGTAATGGAGTTAGTCCAATAAGGATCATAAAGAACATTCGGATTTGCAGCGATTGCCATGAG
TTCATGAAGTTTGCATCTATGAGTATTAGAAGGCAAATAATCTTGAGAGATTCAAATAGATTTCATCATTTTTCAAATGGGAAATGCTCCTGCAAGGATAACTGGTAA
Protein sequenceShow/hide protein sequence
MKAKSKLRQAVDLLCSRSTATSEAYTQLVLECVRANEIDQAKRLQSHMEHHLFQPPDPFLHNQLLHLYAKFGKLRDAQNLFDKMLERDVFSWNALLSAYAKSGSIQDLRA
TFDRMPYRDSVSYNTIIAGLSGNSFPKESLELFRRMQREGLAPTEYTNVSALNASAQLLDLRRGKQIHGSVIVHNYLGNVFICNALTDMYAKCGEIEQARWLFDRLTNKN
LVSWNLMISGYVKNGQPEKCIGLLHDMRLSGHMPDQVTLSSVIAAYCQCGRADEARRVFNEFKDKDIVCWTAMLVGYAKSGREEDALLLFNEMLLEHFEPDSYTLSSVVS
SCAKLASLYHGQAIHGKSILAGLDNNLLVSSALIDMYSKCGLIEDARSVFDVMPTRNVITWNAMIVGYAQNGRDKDTLELFENMLQEKFKPDNVTFVGVLSACLHSNFIE
QGQVFFDSISNQHGLTPSLDHYACMVNLLGRSGRIDQAVDLIKSMPHEPDFLIWSTLLSVSATKGDVASAEMAGRHLFVLDPTNAVPYIMLSNMYASMGRWKDVAAVRSV
MKNKNVKKFAGYSWIEIDNEVHKFTSEDRTHPETEQIYEELKILIRKLEEQGFRPNTNLVLHDVGEEEKLKSICFHSEKLALVFGLIKKGNGVSPIRIIKNIRICSDCHE
FMKFASMSIRRQIILRDSNRFHHFSNGKCSCKDNW