; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013451 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013451
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153871:44055..65179
RNA-Seq ExpressionSgr013451
SyntenySgr013451
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR003340 - B3 DNA binding domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR015300 - DNA-binding pseudobarrel domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571981.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.88Show/hide
Query:  MVESTLTYEECRRQRLEENKKRMEELNLNKLADALKASSPKSSPTKQLKRPRQPLDITSVRVRRSSRFADKPPPNYKEVPIEPLPGVRRIYQRRDLLNRV
        MVES LTYEECRRQRLEENKKRMEELNLNKLADALK+SSPKSSPTKQLKRPRQPLDI+S+ VRRSSRFADKPPP+YKE PIEPL G+RR YQRRDLLNRV
Subjt:  MVESTLTYEECRRQRLEENKKRMEELNLNKLADALKASSPKSSPTKQLKRPRQPLDITSVRVRRSSRFADKPPPNYKEVPIEPLPGVRRIYQRRDLLNRV

Query:  YASHEERRYAIDRAEELQSSLESRYPSFVKPMLQSHVTGGFWLGLPVQFCKTHLPHHDEMVTLVDEDANEFQTKYLAEKTGLSGGWRGFSLDHQLVDGDA
        YAS  ER+YAIDRA +LQSSLESRYPSFVKPMLQSHVTGGFWLGLPV FCK HLP  DEM+TLVDED NEFQTKYLAEKTGLSGGWRGFS+DHQLVDGD 
Subjt:  YASHEERRYAIDRAEELQSSLESRYPSFVKPMLQSHVTGGFWLGLPVQFCKTHLPHHDEMVTLVDEDANEFQTKYLAEKTGLSGGWRGFSLDHQLVDGDA

Query:  LVFQLTKPTEFKVYIIRAYNSEDKADTNEDSDVSQRESSGKRITRSASKGLFRSEMASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKAL
        LVFQLTKPTEFKVYIIRAYN ED+ +T+EDSDV+Q ES+GKR T S   G    ++  LM    AR P+   S+ +V   L  +       Q E   K+ 
Subjt:  LVFQLTKPTEFKVYIIRAYNSEDKADTNEDSDVSQRESSGKRITRSASKGLFRSEMASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKAL

Query:  STSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATK
        S SA       F P                           +  + G  N  + S +              ++Y      S PNP      Q++     +
Subjt:  STSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATK

Query:  RPKPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNA
         P      NQG PQFGKP QRN Q ENS Q NNQ GIQ H AQN A NALVSPIDELRRFCGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSF+NA
Subjt:  RPKPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNA

Query:  KIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEE
        K+VHDYFLQST RSDLQLNNKVLEMYGKCGSMSDA+RVFDHMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQP+S+TFLF+MSACASASAVEE
Subjt:  KIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEE

Query:  GFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI
        GFMYFESMKNDYHI P+MDHYL LLG+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAI
Subjt:  GFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI

Query:  SMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIV
        SMLDGKNRI EFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIV
Subjt:  SMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIV

Query:  GRELIVRDNKRFHHFKDGKCSCGDY
        GRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  GRELIVRDNKRFHHFKDGKCSCGDY

XP_022136002.1 pentatricopeptide repeat-containing protein At2g15690 [Momordica charantia]1.5e-28983.58Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRRAR PIL SSFFKVR PLPS F+FSCGNQTET IKALSTSA+PNDYSNF P PQQ+P+SDPR LQGR TPGQWGTPSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRPKPMEQSNQGYPQFGKPPQRN
        EFQNRDYVQQGS  NQ+NYQ+QN+ SHPNPGF++QGQ YTQA                                   + P      NQGYPQ G P QRN
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRPKPMEQSNQGYPQFGKPPQRN

Query:  PQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV
        PQVEN NQ NNQ G+QGHGAQ QA NALV PIDELRR CG+GKIKEAVELLKEGVKADADCFHV+FELCGKSKSFDNAKIVHDYFLQST R DLQLNNKV
Subjt:  PQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV

Query:  LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYL
        LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TFL++MSACAS SAVEEGFMYFESMKNDYHI P+MDHYL
Subjt:  LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYL

Query:  ALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDE
         LLG+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYARIHG+VDLEDYAEELIVALDPTKA  NKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDE
Subjt:  ALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDE

Query:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
        KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
Subjt:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC

Query:  GDY
        GDY
Subjt:  GDY

XP_022952757.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita moschata]4.7e-28383.17Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRR R+PI ISSF KVRSPLPS FTFSCGN+TETLIKALSTSA P+D+SNFP PPQQ  SSDPR LQ     GQWG+PSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYVQQGSPSNQ+NY++QNQSS+PNPGF RQGQSYTQ                                 + P      NQG PQFGKP QRN Q 
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ GIQGHGAQN   NALVSPIDELRRFCGEGK+KEAVELLKEGVKADADCFH  FELCGKSKSF+NAK+VHDYFLQST RSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDA+RVFDHM DR+I+SWHLMIKGYADNG GDEGLELFENMKKLGL PNS+TFLF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK
        G+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKR AISMLDGKNRI EFRNPTLYKDDEKLK
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK

Query:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

XP_022972422.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita maxima]3.0e-28283.33Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRR R+PI ISSF KVRSPLPS FTFSCGNQTETLIKALSTSA P+D+SNFP PPQQ  SS PR LQ     GQ G+PSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYVQ GSPSNQ+N ++QNQSS+PNPGF RQGQSYTQ                                 + P      NQG PQFGKP QRN Q 
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ GIQGHGAQN A NALVSPIDELRRFCGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSFDNAK+VHDYFLQST RSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDA+RVFDHMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TFLF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK
        G+LGEPGHINEAFEYVEKLP+EPTVE+WETLKNYA+IHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLK
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK

Query:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

XP_023530084.1 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like [Cucurbita pepo subsp. pepo]3.4e-28182.2Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRRAR+PILIS FFKVRS LPSRFTFSCGNQTETLIKAL TSA PND+SNFPPPPQQ  SSDPR LQ     GQWG+P+QVH R GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYV QGS +NQ+NY+++NQSSHPNPGF+RQGQ Y+QA                                + P      NQGYPQFG+P Q NPQV
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ G+QGHG+QNQA N  VSP DELRRFC EGKIKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQSTYRSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYA+NG GDEGLELFE+MKKLGLQPNS+TFL++M ACASASA+EEGFMYFESMK+DY I PD+DHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL
        GVLGEPGH+NEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIV L PTKAAS+ KI TPPPK+RSAISMLDGKNRI EFRNPTLYKDDEKL
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL

Query:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
        KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Subjt:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD

Query:  Y
        Y
Subjt:  Y

TrEMBL top hitse value%identityAlignment
A0A6J1C4C3 pentatricopeptide repeat-containing protein At2g156907.3e-29083.58Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRRAR PIL SSFFKVR PLPS F+FSCGNQTET IKALSTSA+PNDYSNF P PQQ+P+SDPR LQGR TPGQWGTPSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRPKPMEQSNQGYPQFGKPPQRN
        EFQNRDYVQQGS  NQ+NYQ+QN+ SHPNPGF++QGQ YTQA                                   + P      NQGYPQ G P QRN
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT---------------------------------KRPKPMEQSNQGYPQFGKPPQRN

Query:  PQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV
        PQVEN NQ NNQ G+QGHGAQ QA NALV PIDELRR CG+GKIKEAVELLKEGVKADADCFHV+FELCGKSKSFDNAKIVHDYFLQST R DLQLNNKV
Subjt:  PQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKV

Query:  LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYL
        LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TFL++MSACAS SAVEEGFMYFESMKNDYHI P+MDHYL
Subjt:  LEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYL

Query:  ALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDE
         LLG+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYARIHG+VDLEDYAEELIVALDPTKA  NKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDE
Subjt:  ALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDE

Query:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
        KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC
Subjt:  KLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSC

Query:  GDY
        GDY
Subjt:  GDY

A0A6J1EIR6 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like2.1e-27380.87Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRRAR+PILIS F KVRS LPSRF FSCGNQTETLIKAL TSA PND+SNFPPPPQQH SSDPR LQ     GQWG+P+QVH R GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYVQQGS +NQ+NY+++NQSSHPNPGF+RQGQ Y+QA                                + P      NQGYPQFG+P Q NPQV
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q N           NQA N  VSPIDELRRFC EGKIKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQST RSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYA+NG GDEGLELFE+M+KLGLQPNS+TFLF+M ACASASA+EEGFMYFESMKNDY I PD+DHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL
        GVLGEPGH+NEAFEY EKLPMEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAAS+ KI TPPPK+RSAISMLDGKNRI EFRNPTLYKDDEKL
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL

Query:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
        KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Subjt:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD

Query:  Y
        Y
Subjt:  Y

A0A6J1GL94 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like2.3e-28383.17Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRR R+PI ISSF KVRSPLPS FTFSCGN+TETLIKALSTSA P+D+SNFP PPQQ  SSDPR LQ     GQWG+PSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYVQQGSPSNQ+NY++QNQSS+PNPGF RQGQSYTQ                                 + P      NQG PQFGKP QRN Q 
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ GIQGHGAQN   NALVSPIDELRRFCGEGK+KEAVELLKEGVKADADCFH  FELCGKSKSF+NAK+VHDYFLQST RSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDA+RVFDHM DR+I+SWHLMIKGYADNG GDEGLELFENMKKLGL PNS+TFLF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK
        G+LGEPGHINEAFEYVEKLPMEPTVE+WETLKNYARIHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKR AISMLDGKNRI EFRNPTLYKDDEKLK
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK

Query:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A6J1I4R8 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like1.5e-28283.33Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRR R+PI ISSF KVRSPLPS FTFSCGNQTETLIKALSTSA P+D+SNFP PPQQ  SS PR LQ     GQ G+PSQVH   GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EFQNRDYVQ GSPSNQ+N ++QNQSS+PNPGF RQGQSYTQ                                 + P      NQG PQFGKP QRN Q 
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ GIQGHGAQN A NALVSPIDELRRFCGEGK+KEAVELLKEGVKADADCFH LFELCGKSKSFDNAK+VHDYFLQST RSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGSMSDA+RVFDHMPDR+I+SWHLMIKGYADNG GDEGLELFENMKKLGLQPNS+TFLF+MSACASASAVEEGFMYFESMKNDYHI PDMDHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK
        G+LGEPGHINEAFEYVEKLP+EPTVE+WETLKNYA+IHGDVDLEDYAEELIV LDPTKAASNKI TPPPKKRSAISMLDGKNRI EFRNPTLYKDDEKLK
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLK

Query:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A6J1JDU2 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X18.4e-27881.36Show/hide
Query:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS
        MASLM VRRAR+PILIS FFKVRS LPSRFTFSCGNQTETLIKAL TSA PND+SNFPPPPQQH SSDPR LQ      QWG+PSQVH R GNFNNQSFS
Subjt:  MASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFNNQSFS

Query:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV
        EF NRDYV QGS +NQ+NY+++NQSSHPNPG +RQGQ Y+ A                                + P      NQGYPQFG+P Q  PQV
Subjt:  EFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAAT------------------------------KRPKPMEQSNQGYPQFGKPPQRNPQV

Query:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM
        ENS Q NNQ G+QGHG+QNQA N  VS  DELRRFC EGKIKEAVELLKEGVKADADCF VLF+LCGKS SFDNAK+VHDYFLQSTYRSDLQLNNKVLEM
Subjt:  ENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEM

Query:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL
        YGKCGS+SDARRVFDHMPDRNIDSWHLMI+GYA+NG GDEGLELFE+MKKLGLQPNS+TFLF+M ACASASA+EEGFMYFESMKNDY I PD+DHYL LL
Subjt:  YGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALL

Query:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL
        GVLGEPGH+NEAFEYVEKLPMEPTVE+WETLKNYARIHGDVDLED AEELIV LDPTKAAS+ KI TPPPK+RSAISMLDGKNRI EFRNPTLYKDDEKL
Subjt:  GVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASN-KITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKL

Query:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
        KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD
Subjt:  KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGD

Query:  Y
        Y
Subjt:  Y

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210652.6e-7437.19Show/hide
Query:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK
        F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I 
Subjt:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK

Query:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE
        G A NGFG E +ELF+ M+   GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+EY++ +PM+P V IW 
Subjt:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE

Query:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK
        TL     +HGD DL ++A   I+ L+P  +                        ++     KK    S+++  NR+ EF      +P       KLK + 
Subjt:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK

Query:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
          ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Subjt:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q680H3 Pentatricopeptide repeat-containing protein At2g255805.2e-8341.12Show/hide
Query:  IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDS
        I+E   FC  GK+K+A+     L       D      L ++CG+++    AK VH     S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++
Subjt:  IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDS

Query:  WHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPT
        W ++I+ +A NGFG++ +++F   K+ G  P+ + F  I  AC     V+EG ++FESM  DY I P ++ Y++L+ +   PG ++EA E+VE++PMEP 
Subjt:  WHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPT

Query:  VEIWETLKNYARIHGDVDLEDYAEELIVALDPTK-----------AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK----
        V++WETL N +R+HG+++L DY  E++  LDPT+             ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Subjt:  VEIWETLKNYARIHGDVDLEDYAEELIVALDPTK-----------AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK----

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHNA+KIMS IVGRE+I RD KRFH  K+G C+C DY
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial1.0e-7536.54Show/hide
Query:  NALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNI
        NAL++     RR   E  ++    +L++G +     +  LF  C  +   +  K VH Y ++S  +      N +L+MY K GS+ DAR++FD +  R++
Subjt:  NALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNI

Query:  DSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPME
         SW+ ++  YA +GFG E +  FE M+++G++PN  +FL +++AC+ +  ++EG+ Y+E MK D  I P+  HY+ ++ +LG  G +N A  ++E++P+E
Subjt:  DSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPME

Query:  PTVEIWETLKNYARIHGDVDLEDYAEELIVALDP---------------------TKAASNKITTPPPKKRSAISMLDGKNRIGEF-RNPTLYKDDEKL-
        PT  IW+ L N  R+H + +L  YA E +  LDP                           K+     KK  A S ++ +N I  F  N   +   E++ 
Subjt:  PTVEIWETLKNYARIHGDVDLEDYAEELIVALDP---------------------TKAASNKITTPPPKKRSAISMLDGKNRIGEF-RNPTLYKDDEKL-

Query:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKC
            + L  +KE GYVPDT +V+  +DQ+ +E  L YHSE++A+A+ L++TP  + + I KN+R+CGDCH AIK+ S++VGRE+IVRD  RFHHFKDG C
Subjt:  ----KALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKC

Query:  SCGDY
        SC DY
Subjt:  SCGDY

Q9SUU7 Pentatricopeptide repeat-containing protein At4g32450, mitochondrial9.4e-8536.17Show/hide
Query:  GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGK
        G E P          H +G     N   QS   FQ   Y Q  +P +  N         P   F + G +  Q+  +  + + Q NQ      G   +G 
Subjt:  GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGK

Query:  PPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYR
             PQ  N+   + Q    GH           S +DEL   C EGK+K+AVE++K    EG   D      + +LCG +++   AK+VH++   S   
Subjt:  PPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYR

Query:  SDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH
        SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +I+ +A NG G++ ++ F   K+ G +P+   F  I  AC     + EG ++FESM  +Y 
Subjt:  SDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH

Query:  ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN
        I P M+HY++L+ +L EPG+++EA  +VE   MEP V++WETL N +R+HGD+ L D  ++++  LD ++          P K S +       M  G N
Subjt:  ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN

Query:  ------RIGEFRNPTLYKDDEKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMS
                G+   P   ++ E   ALK++KE     GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +R++KNLR+C DCHNA+K+MS
Subjt:  ------RIGEFRNPTLYKDDEKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMS

Query:  RIVGRELIVRDNKRFHHFKDGKCSCGDY
        +IVGRELI RD KRFHH KDG CSC +Y
Subjt:  RIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9ZQE5 Pentatricopeptide repeat-containing protein At2g15690, mitochondrial8.4e-14248.99Show/hide
Query:  MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFN
        M+SLM +R AR+   + I S  ++RS  P   S+F FS G      IK LSTSA  NDY   P          P   Q  ++  Q  T  +V      ++
Subjt:  MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFN

Query:  NQSFSEF-----QNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNPQVENSNQ-----PNNQVGIQ
         Q   +      QN  +  Q  P    N Q   Q S       + G    Q    RP    Q     PQ+G P    +N  V+ SNQ     P  Q   Q
Subjt:  NQSFSEF-----QNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNPQVENSNQ-----PNNQVGIQ

Query:  GHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS
           + NQ+ N +  V+P   ++E+ R C     K+A+ELL +G   D +CF +LFE C   KS +++K VHD+FLQS +R D +LNN V+ M+G+C S++
Subjt:  GHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS

Query:  DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGH
        DA+RVFDHM D+++DSWHLM+  Y+DNG GD+ L LFE M K GL+PN  TFL +  ACA+   +EE F++F+SMKN++ I+P  +HYL +LGVLG+ GH
Subjt:  DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGH

Query:  INEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQ
        + EA +Y+  LP EPT + WE ++NYAR+HGD+DLEDY EEL+V +DP+KA  NKI TPPPK     +M+  K+RI EFRN T YKD+ K  A K  K  
Subjt:  INEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQ

Query:  GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Subjt:  GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Arabidopsis top hitse value%identityAlignment
AT2G15690.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-14348.99Show/hide
Query:  MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFN
        M+SLM +R AR+   + I S  ++RS  P   S+F FS G      IK LSTSA  NDY   P          P   Q  ++  Q  T  +V      ++
Subjt:  MASLMTVRRARSP--ILISSFFKVRSPLP---SRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGRETPGQWGTPSQVHHRVGNFN

Query:  NQSFSEF-----QNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNPQVENSNQ-----PNNQVGIQ
         Q   +      QN  +  Q  P    N Q   Q S       + G    Q    RP    Q     PQ+G P    +N  V+ SNQ     P  Q   Q
Subjt:  NQSFSEF-----QNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQ--RNPQVENSNQ-----PNNQVGIQ

Query:  GHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS
           + NQ+ N +  V+P   ++E+ R C     K+A+ELL +G   D +CF +LFE C   KS +++K VHD+FLQS +R D +LNN V+ M+G+C S++
Subjt:  GHGAQNQASNAL--VSP---IDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMS

Query:  DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGH
        DA+RVFDHM D+++DSWHLM+  Y+DNG GD+ L LFE M K GL+PN  TFL +  ACA+   +EE F++F+SMKN++ I+P  +HYL +LGVLG+ GH
Subjt:  DARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGH

Query:  INEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQ
        + EA +Y+  LP EPT + WE ++NYAR+HGD+DLEDY EEL+V +DP+KA  NKI TPPPK     +M+  K+RI EFRN T YKD+ K  A K  K  
Subjt:  INEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQ

Query:  GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         YVPDTR+VLHDIDQEAKEQALLYHSERLAIAYG+I TP R  L IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Subjt:  GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT2G25580.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-8441.12Show/hide
Query:  IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDS
        I+E   FC  GK+K+A+     L       D      L ++CG+++    AK VH     S    DL  N+ +LEMY  CG  ++A  VF+ M ++N+++
Subjt:  IDELRRFCGEGKIKEAVE----LLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDS

Query:  WHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPT
        W ++I+ +A NGFG++ +++F   K+ G  P+ + F  I  AC     V+EG ++FESM  DY I P ++ Y++L+ +   PG ++EA E+VE++PMEP 
Subjt:  WHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPT

Query:  VEIWETLKNYARIHGDVDLEDYAEELIVALDPTK-----------AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK----
        V++WETL N +R+HG+++L DY  E++  LDPT+             ++ +     KKRS I  L G K+ + EFR  +  L ++DE  + L+ +K    
Subjt:  VEIWETLKNYARIHGDVDLEDYAEELIVALDPTK-----------AASNKITTPPPKKRSAISMLDG-KNRIGEFR--NPTLYKDDEKLKALKAMK----

Query:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        E GYV +TR  LHDIDQE+KE  LL HSER+A A  ++++  R P  +IKNLR+C DCHNA+KIMS IVGRE+I RD KRFH  K+G C+C DY
Subjt:  EQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-7537.19Show/hide
Query:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK
        F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I 
Subjt:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK

Query:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE
        G A NGFG E +ELF+ M+   GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+EY++ +PM+P V IW 
Subjt:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE

Query:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK
        TL     +HGD DL ++A   I+ L+P  +                        ++     KK    S+++  NR+ EF      +P       KLK + 
Subjt:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK

Query:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
          ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Subjt:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-7537.19Show/hide
Query:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK
        F   GK +EA+ L  E    G+K D      L   C K  +    K VH Y ++     +L  +N +L++Y +CG + +A+ +FD M D+N  SW  +I 
Subjt:  FCGEGKIKEAVELLKE----GVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIK

Query:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE
        G A NGFG E +ELF+ M+   GL P   TF+ I+ AC+    V+EGF YF  M+ +Y I P ++H+  ++ +L   G + +A+EY++ +PM+P V IW 
Subjt:  GYADNGFGDEGLELFENMKKL-GLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWE

Query:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK
        TL     +HGD DL ++A   I+ L+P  +                        ++     KK    S+++  NR+ EF      +P       KLK + 
Subjt:  TLKNYARIHGDVDLEDYAEELIVALDPTKAAS---------------------NKITTPPPKKRSAISMLDGKNRIGEF-----RNPTLYKDDEKLKALK

Query:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
          ++ +GYVP    V  D+++E KE A++YHSE++AIA+ LISTP R+P+ ++KNLR+C DCH AIK++S++  RE++VRD  RFHHFK+G CSC DY
Subjt:  A-MKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT4G32450.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-8636.17Show/hide
Query:  GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGK
        G E P          H +G     N   QS   FQ   Y Q  +P +  N         P   F + G +  Q+  +  + + Q NQ      G   +G 
Subjt:  GRETPGQWGTPSQVHHRVG-----NFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQ------GYPQFGK

Query:  PPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYR
             PQ  N+   + Q    GH           S +DEL   C EGK+K+AVE++K    EG   D      + +LCG +++   AK+VH++   S   
Subjt:  PPQRNPQVENSNQPNNQVGIQGHGAQNQASNALVSPIDELRRFCGEGKIKEAVELLK----EGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYR

Query:  SDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH
        SD+   N ++EMY  CGS+ DA  VF+ MP+RN+++W  +I+ +A NG G++ ++ F   K+ G +P+   F  I  AC     + EG ++FESM  +Y 
Subjt:  SDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSWHLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYH

Query:  ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN
        I P M+HY++L+ +L EPG+++EA  +VE   MEP V++WETL N +R+HGD+ L D  ++++  LD ++          P K S +       M  G N
Subjt:  ITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYARIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAI------SMLDGKN

Query:  ------RIGEFRNPTLYKDDEKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMS
                G+   P   ++ E   ALK++KE     GYVP ++  LHD+DQE+K++ L  H+ER A     + TPAR+ +R++KNLR+C DCHNA+K+MS
Subjt:  ------RIGEFRNPTLYKDDEKLKALKAMKEQ----GYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLISTPARTPLRIIKNLRICGDCHNAIKIMS

Query:  RIVGRELIVRDNKRFHHFKDGKCSCGDY
        +IVGRELI RD KRFHH KDG CSC +Y
Subjt:  RIVGRELIVRDNKRFHHFKDGKCSCGDY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAATCGACGCTCACCTATGAGGAATGCCGACGCCAGAGGTTGGAGGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAGCTTGCCGATGCGCTAAAAGC
TTCCAGCCCTAAGTCCTCTCCGACGAAACAGCTAAAGCGTCCTCGCCAGCCACTCGATATCACGTCCGTCCGTGTGAGAAGGTCCAGCCGTTTTGCGGATAAGCCCCCTC
CGAACTATAAGGAGGTCCCCATTGAACCACTTCCAGGTGTCAGAAGGATCTATCAACGGAGAGATTTGTTGAATCGAGTTTATGCTTCACACGAAGAAAGACGATATGCT
ATTGACAGAGCAGAAGAACTTCAGTCTAGCCTGGAATCTAGGTACCCAAGTTTCGTGAAGCCCATGCTTCAGTCACATGTCACCGGGGGATTTTGGCTGGGTCTTCCAGT
TCAATTTTGCAAGACACACCTTCCCCATCATGATGAAATGGTTACTCTGGTGGATGAGGATGCGAATGAGTTCCAAACAAAATACCTTGCGGAGAAAACGGGTCTTAGCG
GTGGTTGGAGAGGGTTTTCACTTGATCACCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGGGCATATAAT
TCCGAAGACAAAGCAGACACCAACGAGGATTCCGATGTCTCTCAACGGGAAAGTAGTGGCAAAAGAATTACTAGATCAGCAAGTAAAGGCCTTTTCAGATCAGAAATGGC
GTCTCTCATGACGGTCCGGCGTGCTCGAAGCCCCATACTCATCTCCTCATTCTTCAAGGTACGGTCTCCACTCCCTTCTCGTTTCACGTTCTCTTGTGGAAACCAGACAG
AGACTCTGATCAAAGCCCTAAGCACCTCTGCGGTCCCTAATGATTACTCAAATTTTCCTCCTCCACCACAACAACATCCTTCGTCTGATCCTAGAACTCTTCAGGGCCGG
GAAACGCCTGGCCAGTGGGGCACGCCAAGCCAGGTTCATCATCGAGTTGGAAACTTTAATAACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAG
CCCTAGTAATCAATTGAATTATCAGAATCAGAACCAAAGCTCTCATCCGAATCCTGGATTTGCCAGGCAGGGTCAGAGCTATACTCAAGCCGCAACAAAGAGGCCCAAAC
CAATGGAACAATCAAATCAGGGATACCCACAATTTGGAAAGCCTCCGCAGCGGAACCCACAAGTTGAGAATTCTAATCAGCCGAATAATCAGGTTGGGATTCAAGGACAC
GGTGCTCAAAATCAAGCATCAAATGCCCTTGTATCTCCTATCGATGAACTGCGGCGCTTTTGTGGAGAGGGAAAGATTAAAGAAGCTGTTGAATTGTTGAAAGAAGGTGT
TAAAGCTGATGCTGATTGTTTCCATGTGTTGTTTGAACTATGCGGGAAATCAAAGTCATTTGACAATGCAAAAATAGTTCATGATTACTTCTTACAGTCAACTTATAGAA
GTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGAAAATGTGGAAGCATGAGTGATGCACGGAGAGTGTTTGACCATATGCCTGATAGGAATATTGATTCTTGG
CATTTGATGATAAAAGGATATGCGGATAATGGATTTGGTGATGAGGGGCTAGAGTTATTTGAGAATATGAAGAAGCTGGGATTGCAACCCAATTCACGAACTTTCCTTTT
TATAATGTCAGCTTGTGCTAGTGCGAGTGCTGTAGAAGAAGGATTTATGTACTTTGAATCGATGAAAAATGATTATCATATCACCCCAGACATGGATCATTATTTAGCGC
TTTTAGGTGTTCTTGGAGAACCTGGACACATCAACGAGGCTTTCGAGTATGTTGAAAAACTGCCCATGGAACCCACAGTTGAAATCTGGGAGACTTTGAAAAACTATGCT
AGAATTCATGGAGATGTTGATCTTGAGGACTACGCCGAGGAGCTAATTGTTGCTCTGGACCCGACAAAAGCTGCTTCTAATAAGATAACCACACCACCTCCCAAAAAACG
GTCTGCCATTAGCATGCTTGATGGGAAGAACAGGATTGGTGAGTTCAGAAATCCGACTCTCTACAAAGATGATGAGAAGCTAAAGGCTTTGAAGGCAATGAAAGAACAAG
GTTATGTGCCAGATACTAGATATGTACTTCATGATATCGATCAGGAGGCCAAAGAGCAGGCATTGTTGTATCATAGTGAACGATTGGCTATTGCATATGGCCTGATCAGT
ACCCCAGCACGGACACCTCTTAGGATCATTAAGAACTTACGGATCTGTGGTGACTGTCACAATGCGATTAAGATCATGTCTAGGATTGTTGGGAGAGAGTTGATTGTAAG
GGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGTCCTCAAGGCTGCAGCTCTCCCCCAACCCCTCAACACCCTCATCACCTCCTCC
GCACTTATCTTCTTATCGCCGTTCAAATCGAACGTCCTGAAAGCAAACTCCACGTCCCTCGCCTGAACTCCCCCGCCGTTCCGGTAAACCTCCATGAACTCGCTCAGGTT
TATGTACCCATCGCCGTCGGCGTCCACCGCCCTGAATATCTTCTGCACCTCTTCCATAGAGTTCCCTCTTCCCAGCGCCTTCAGAATCCCCCTGTACTCGTGCTTCGAGA
TCCTCCCGTCTCTGTTCGAGTCGAACTTGTTGAAGATCTGCTTTATCTCCTCCGCGCTCGGCTGCAGCGCCCTTCTCAGGCCGGAGCTCTGCCGGTCCTTGAACGAAAAC
AGCCGGGAGGGCTCCCGGAGGAACTTCTTCTTGGAGACGTTGTATTGGAAGTCCAGGAGGTTCAAGTTCGTCGGCATTCTTTCTGTTGGGTTGCTCTGTGCTTCGAGAAC
TTTCGACTTCAGATCGTTATTCTGGAGAAATCTGTCGACACAAAATCATCAATCGAAACCATTAGATCATCTCTAAAAACTATAAAGAATGAAGATCATGTATCTGTAAT
AATCAGAAGTATGGTAGCTACCGTGACACCACTGAGTTCTTTCATCTACTCAGCTAATCCTTTTACGTCGTTATATCATGCCTTAATCAGCAAAACAGAGTTAGAAAACA
GAGCCCGAGAGTTTATTGTTACCGAGCCAACTGCAAAAGCAAATGCAGCCATTCAAGAAGTGGGTTTCAAACACACCATTAGCAGTGACATTCAGGTTTGTAAAGAAATT
CTTGCCGTTTACTGCAACATTAAAGACCCTCCAGCTAAACGGACTCGGAGAACGGTTGTCCTGAAACCTTCGTTGGCGGGAGGTTCCAGAAGTTTGAGGGAGTTATGTTG
GCATGGCATGTTACCAACGGGTTTTCGTCCACGAACGGGTGCCACAGCCGAAAATCTTACAAAAGAAGTTTGCCTACCCAATGACGCTTTCGTCGTGTCCAAAGCTGGTT
CTGGCAACCGTGCTCAGCGCATAGGTCTTGAAATCCGTCGAGTTATATACAGAATCCTCCAAAAATTCCAGCTCCAGAGCCGAAATGAAGGGGCTCGAAACCGTGTGCTG
GTTCCTCCCCAAGCACACGCTCATCATCTTCCCCATGGCGACCACCACAATCTCGTAATACGACGACAGCCCCTTGGCGAAATCCTCCGTCGTGTTGACGGTGCTCCATT
TGGTCCCCTCCACAATCTGGTCGAACACCGGCGGCTCCTTCCCGCCGTCGAACCCGCCATAGTAATACGTCGTTCTCACGAGGTATTTCCCGCCCTTCACTACGGGAATC
GCGTAGCAATATTTCCGGGCCGACTTGTCGGGAAAATAGCGCAATGTGGACAGTATCGGCACCAGGTTTGGATCGTCGAGCTTCGTGGTGTTCCCGACGGAGGTGAACCC
CTCGTCGGTGATGTACTGCAGACTGCCGACGGTTACCTTGGCGGTCGATTCGCTGGCCCCACAATTGAGAAGATAGCCTGCATTTTGGTGGCGTTATGGCGGCGGACGGC
GGCTAAGGGGAGAAATAGGAGAAGAAAAACACCAGCTGGCGGTGTCACAAGAAAAGAAAGAGGGGGCGTAACGGCGGTTGGCAGTGGCGGCGGTGGTTGCAGACGCCGCC
GCCAGAGAGGAACTTGGGTTGAATGTTGTCAACGAGTTGTCGGGGAACGGGAAATTGAGTTTGGCTCGTGGGCCGCGGAAATTGATGGCGGCTTCATCGTAGGCACGGGC
GGCGTCCTCCGCCGTGTTGAAAGTGCCGAGCCAGACCCGGGTCGCCCTCTTCGGGTCACGGATTTCGGCGGCCCATTTCCCCCAAGGGCGCTGCCTCACGCCTCTGTAAT
TCTTCTTCAACCGCTTATTCCTGCGGCCGCCGCCCTTCTTCTGGTCTTCCGGCATGAAGAAGTTGCAACCCAGGCAGCCTTTAATCCTGCACACTTGACATGTGTCGAAA
TCGGAAGGAGGGAAAACGGAGCCGTTGGTGGGAGTGGAGAAAGACGAAGCCGAAGCACATTCCGAGATGGGCGAGAGGAGGTGAAGAAAATGGTCATGGCGAAATTCGAG
TCCGGAGGAGGCGGCGCCGGAAACTACTTGGGTGAGGGCGTCGACGATGACGGAAACCTCCTGCTCCTCGGAGATGCGGCGGAAGGGGTGGCCGAGGAAGGCGGGGGGGT
CGGAGGACATTTGCATGCCGGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGAGAGAAGGGGAGGACAGCTGGAACACCGCCGAGACACGGGTTGAAGAGGGGGG
CACGTGACGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGAATCGACGCTCACCTATGAGGAATGCCGACGCCAGAGGTTGGAGGAGAATAAGAAGAGGATGGAAGAGCTTAATCTGAACAAGCTTGCCGATGCGCTAAAAGC
TTCCAGCCCTAAGTCCTCTCCGACGAAACAGCTAAAGCGTCCTCGCCAGCCACTCGATATCACGTCCGTCCGTGTGAGAAGGTCCAGCCGTTTTGCGGATAAGCCCCCTC
CGAACTATAAGGAGGTCCCCATTGAACCACTTCCAGGTGTCAGAAGGATCTATCAACGGAGAGATTTGTTGAATCGAGTTTATGCTTCACACGAAGAAAGACGATATGCT
ATTGACAGAGCAGAAGAACTTCAGTCTAGCCTGGAATCTAGGTACCCAAGTTTCGTGAAGCCCATGCTTCAGTCACATGTCACCGGGGGATTTTGGCTGGGTCTTCCAGT
TCAATTTTGCAAGACACACCTTCCCCATCATGATGAAATGGTTACTCTGGTGGATGAGGATGCGAATGAGTTCCAAACAAAATACCTTGCGGAGAAAACGGGTCTTAGCG
GTGGTTGGAGAGGGTTTTCACTTGATCACCAGTTAGTAGATGGGGATGCTTTGGTGTTTCAGTTAACTAAGCCAACTGAATTCAAGGTATATATCATCAGGGCATATAAT
TCCGAAGACAAAGCAGACACCAACGAGGATTCCGATGTCTCTCAACGGGAAAGTAGTGGCAAAAGAATTACTAGATCAGCAAGTAAAGGCCTTTTCAGATCAGAAATGGC
GTCTCTCATGACGGTCCGGCGTGCTCGAAGCCCCATACTCATCTCCTCATTCTTCAAGGTACGGTCTCCACTCCCTTCTCGTTTCACGTTCTCTTGTGGAAACCAGACAG
AGACTCTGATCAAAGCCCTAAGCACCTCTGCGGTCCCTAATGATTACTCAAATTTTCCTCCTCCACCACAACAACATCCTTCGTCTGATCCTAGAACTCTTCAGGGCCGG
GAAACGCCTGGCCAGTGGGGCACGCCAAGCCAGGTTCATCATCGAGTTGGAAACTTTAATAACCAGTCGTTCTCGGAGTTTCAGAATCGCGACTATGTTCAACAGGGAAG
CCCTAGTAATCAATTGAATTATCAGAATCAGAACCAAAGCTCTCATCCGAATCCTGGATTTGCCAGGCAGGGTCAGAGCTATACTCAAGCCGCAACAAAGAGGCCCAAAC
CAATGGAACAATCAAATCAGGGATACCCACAATTTGGAAAGCCTCCGCAGCGGAACCCACAAGTTGAGAATTCTAATCAGCCGAATAATCAGGTTGGGATTCAAGGACAC
GGTGCTCAAAATCAAGCATCAAATGCCCTTGTATCTCCTATCGATGAACTGCGGCGCTTTTGTGGAGAGGGAAAGATTAAAGAAGCTGTTGAATTGTTGAAAGAAGGTGT
TAAAGCTGATGCTGATTGTTTCCATGTGTTGTTTGAACTATGCGGGAAATCAAAGTCATTTGACAATGCAAAAATAGTTCATGATTACTTCTTACAGTCAACTTATAGAA
GTGATCTGCAATTGAATAATAAAGTGCTTGAGATGTATGGAAAATGTGGAAGCATGAGTGATGCACGGAGAGTGTTTGACCATATGCCTGATAGGAATATTGATTCTTGG
CATTTGATGATAAAAGGATATGCGGATAATGGATTTGGTGATGAGGGGCTAGAGTTATTTGAGAATATGAAGAAGCTGGGATTGCAACCCAATTCACGAACTTTCCTTTT
TATAATGTCAGCTTGTGCTAGTGCGAGTGCTGTAGAAGAAGGATTTATGTACTTTGAATCGATGAAAAATGATTATCATATCACCCCAGACATGGATCATTATTTAGCGC
TTTTAGGTGTTCTTGGAGAACCTGGACACATCAACGAGGCTTTCGAGTATGTTGAAAAACTGCCCATGGAACCCACAGTTGAAATCTGGGAGACTTTGAAAAACTATGCT
AGAATTCATGGAGATGTTGATCTTGAGGACTACGCCGAGGAGCTAATTGTTGCTCTGGACCCGACAAAAGCTGCTTCTAATAAGATAACCACACCACCTCCCAAAAAACG
GTCTGCCATTAGCATGCTTGATGGGAAGAACAGGATTGGTGAGTTCAGAAATCCGACTCTCTACAAAGATGATGAGAAGCTAAAGGCTTTGAAGGCAATGAAAGAACAAG
GTTATGTGCCAGATACTAGATATGTACTTCATGATATCGATCAGGAGGCCAAAGAGCAGGCATTGTTGTATCATAGTGAACGATTGGCTATTGCATATGGCCTGATCAGT
ACCCCAGCACGGACACCTCTTAGGATCATTAAGAACTTACGGATCTGTGGTGACTGTCACAATGCGATTAAGATCATGTCTAGGATTGTTGGGAGAGAGTTGATTGTAAG
GGACAACAAACGGTTCCATCATTTTAAGGATGGTAAATGTTCTTGTGGGGATTACTGTCCTCAAGGCTGCAGCTCTCCCCCAACCCCTCAACACCCTCATCACCTCCTCC
GCACTTATCTTCTTATCGCCGTTCAAATCGAACGTCCTGAAAGCAAACTCCACGTCCCTCGCCTGAACTCCCCCGCCGTTCCGGTAAACCTCCATGAACTCGCTCAGGTT
TATGTACCCATCGCCGTCGGCGTCCACCGCCCTGAATATCTTCTGCACCTCTTCCATAGAGTTCCCTCTTCCCAGCGCCTTCAGAATCCCCCTGTACTCGTGCTTCGAGA
TCCTCCCGTCTCTGTTCGAGTCGAACTTGTTGAAGATCTGCTTTATCTCCTCCGCGCTCGGCTGCAGCGCCCTTCTCAGGCCGGAGCTCTGCCGGTCCTTGAACGAAAAC
AGCCGGGAGGGCTCCCGGAGGAACTTCTTCTTGGAGACGTTGTATTGGAAGTCCAGGAGGTTCAAGTTCGTCGGCATTCTTTCTGTTGGGTTGCTCTGTGCTTCGAGAAC
TTTCGACTTCAGATCGTTATTCTGGAGAAATCTGTCGACACAAAATCATCAATCGAAACCATTAGATCATCTCTAAAAACTATAAAGAATGAAGATCATGTATCTGTAAT
AATCAGAAGTATGGTAGCTACCGTGACACCACTGAGTTCTTTCATCTACTCAGCTAATCCTTTTACGTCGTTATATCATGCCTTAATCAGCAAAACAGAGTTAGAAAACA
GAGCCCGAGAGTTTATTGTTACCGAGCCAACTGCAAAAGCAAATGCAGCCATTCAAGAAGTGGGTTTCAAACACACCATTAGCAGTGACATTCAGGTTTGTAAAGAAATT
CTTGCCGTTTACTGCAACATTAAAGACCCTCCAGCTAAACGGACTCGGAGAACGGTTGTCCTGAAACCTTCGTTGGCGGGAGGTTCCAGAAGTTTGAGGGAGTTATGTTG
GCATGGCATGTTACCAACGGGTTTTCGTCCACGAACGGGTGCCACAGCCGAAAATCTTACAAAAGAAGTTTGCCTACCCAATGACGCTTTCGTCGTGTCCAAAGCTGGTT
CTGGCAACCGTGCTCAGCGCATAGGTCTTGAAATCCGTCGAGTTATATACAGAATCCTCCAAAAATTCCAGCTCCAGAGCCGAAATGAAGGGGCTCGAAACCGTGTGCTG
GTTCCTCCCCAAGCACACGCTCATCATCTTCCCCATGGCGACCACCACAATCTCGTAATACGACGACAGCCCCTTGGCGAAATCCTCCGTCGTGTTGACGGTGCTCCATT
TGGTCCCCTCCACAATCTGGTCGAACACCGGCGGCTCCTTCCCGCCGTCGAACCCGCCATAGTAATACGTCGTTCTCACGAGGTATTTCCCGCCCTTCACTACGGGAATC
GCGTAGCAATATTTCCGGGCCGACTTGTCGGGAAAATAGCGCAATGTGGACAGTATCGGCACCAGGTTTGGATCGTCGAGCTTCGTGGTGTTCCCGACGGAGGTGAACCC
CTCGTCGGTGATGTACTGCAGACTGCCGACGGTTACCTTGGCGGTCGATTCGCTGGCCCCACAATTGAGAAGATAGCCTGCATTTTGGTGGCGTTATGGCGGCGGACGGC
GGCTAAGGGGAGAAATAGGAGAAGAAAAACACCAGCTGGCGGTGTCACAAGAAAAGAAAGAGGGGGCGTAACGGCGGTTGGCAGTGGCGGCGGTGGTTGCAGACGCCGCC
GCCAGAGAGGAACTTGGGTTGAATGTTGTCAACGAGTTGTCGGGGAACGGGAAATTGAGTTTGGCTCGTGGGCCGCGGAAATTGATGGCGGCTTCATCGTAGGCACGGGC
GGCGTCCTCCGCCGTGTTGAAAGTGCCGAGCCAGACCCGGGTCGCCCTCTTCGGGTCACGGATTTCGGCGGCCCATTTCCCCCAAGGGCGCTGCCTCACGCCTCTGTAAT
TCTTCTTCAACCGCTTATTCCTGCGGCCGCCGCCCTTCTTCTGGTCTTCCGGCATGAAGAAGTTGCAACCCAGGCAGCCTTTAATCCTGCACACTTGACATGTGTCGAAA
TCGGAAGGAGGGAAAACGGAGCCGTTGGTGGGAGTGGAGAAAGACGAAGCCGAAGCACATTCCGAGATGGGCGAGAGGAGGTGAAGAAAATGGTCATGGCGAAATTCGAG
TCCGGAGGAGGCGGCGCCGGAAACTACTTGGGTGAGGGCGTCGACGATGACGGAAACCTCCTGCTCCTCGGAGATGCGGCGGAAGGGGTGGCCGAGGAAGGCGGGGGGGT
CGGAGGACATTTGCATGCCGGTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGAGAGAGAAGGGGAGGACAGCTGGAACACCGCCGAGACACGGGTTGAAGAGGGGGG
CACGTGACGAATAG
Protein sequenceShow/hide protein sequence
MVESTLTYEECRRQRLEENKKRMEELNLNKLADALKASSPKSSPTKQLKRPRQPLDITSVRVRRSSRFADKPPPNYKEVPIEPLPGVRRIYQRRDLLNRVYASHEERRYA
IDRAEELQSSLESRYPSFVKPMLQSHVTGGFWLGLPVQFCKTHLPHHDEMVTLVDEDANEFQTKYLAEKTGLSGGWRGFSLDHQLVDGDALVFQLTKPTEFKVYIIRAYN
SEDKADTNEDSDVSQRESSGKRITRSASKGLFRSEMASLMTVRRARSPILISSFFKVRSPLPSRFTFSCGNQTETLIKALSTSAVPNDYSNFPPPPQQHPSSDPRTLQGR
ETPGQWGTPSQVHHRVGNFNNQSFSEFQNRDYVQQGSPSNQLNYQNQNQSSHPNPGFARQGQSYTQAATKRPKPMEQSNQGYPQFGKPPQRNPQVENSNQPNNQVGIQGH
GAQNQASNALVSPIDELRRFCGEGKIKEAVELLKEGVKADADCFHVLFELCGKSKSFDNAKIVHDYFLQSTYRSDLQLNNKVLEMYGKCGSMSDARRVFDHMPDRNIDSW
HLMIKGYADNGFGDEGLELFENMKKLGLQPNSRTFLFIMSACASASAVEEGFMYFESMKNDYHITPDMDHYLALLGVLGEPGHINEAFEYVEKLPMEPTVEIWETLKNYA
RIHGDVDLEDYAEELIVALDPTKAASNKITTPPPKKRSAISMLDGKNRIGEFRNPTLYKDDEKLKALKAMKEQGYVPDTRYVLHDIDQEAKEQALLYHSERLAIAYGLIS
TPARTPLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCPQGCSSPPTPQHPHHLLRTYLLIAVQIERPESKLHVPRLNSPAVPVNLHELAQV
YVPIAVGVHRPEYLLHLFHRVPSSQRLQNPPVLVLRDPPVSVRVELVEDLLYLLRARLQRPSQAGALPVLERKQPGGLPEELLLGDVVLEVQEVQVRRHSFCWVALCFEN
FRLQIVILEKSVDTKSSIETIRSSLKTIKNEDHVSVIIRSMVATVTPLSSFIYSANPFTSLYHALISKTELENRAREFIVTEPTAKANAAIQEVGFKHTISSDIQVCKEI
LAVYCNIKDPPAKRTRRTVVLKPSLAGGSRSLRELCWHGMLPTGFRPRTGATAENLTKEVCLPNDAFVVSKAGSGNRAQRIGLEIRRVIYRILQKFQLQSRNEGARNRVL
VPPQAHAHHLPHGDHHNLVIRRQPLGEILRRVDGAPFGPLHNLVEHRRLLPAVEPAIVIRRSHEVFPALHYGNRVAIFPGRLVGKIAQCGQYRHQVWIVELRGVPDGGEP
LVGDVLQTADGYLGGRFAGPTIEKIACILVALWRRTAAKGRNRRRKTPAGGVTRKERGGVTAVGSGGGGCRRRRQRGTWVECCQRVVGEREIEFGSWAAEIDGGFIVGTG
GVLRRVESAEPDPGRPLRVTDFGGPFPPRALPHASVILLQPLIPAAAALLLVFRHEEVATQAAFNPAHLTCVEIGRRENGAVGGSGERRSRSTFRDGREEVKKMVMAKFE
SGGGGAGNYLGEGVDDDGNLLLLGDAAEGVAEEGGGVGGHLHAGGERERERERERREKGRTAGTPPRHGLKRGARDE