; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G011310 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G011310
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr09:18185013..18206441
RNA-Seq ExpressionLsi09G011310
SyntenyLsi09G011310
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR009060 - UBA-like superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013087 - Zinc finger C2H2-type
IPR015940 - Ubiquitin-associated domain
IPR018997 - PUB domain
IPR032867 - DYW domain
IPR036339 - PUB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAG7909114.1 unnamed protein product [Brassica rapa]0.0e+0058.82Show/hide
Query:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEA----PKVDAESEDGG
        M G+SLKCGDCGALL+SVEEAQ+HAE+TSHSNF+ESTEAVLNLVCTAC KPCRSKTESDLHTKRTGHTEF DKT+E  KPISLEA    P ++ ++ DG 
Subjt:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEA----PKVDAESEDGG

Query:  DASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARK
          S   +EEMVVPEV++ ILEELE MGFP ARATRAL YSGNASLEAAVNWVVEHEND ++D++P  +VP ++    PKP+LTPE++K K QEL+ERARK
Subjt:  DASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARK

Query:  KKEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRP
        KKEEEEK  EREREKERIRIGKELLEAKR+EE+NERKRI+ LRKAEKEEE+RAR+KIRQK+EEDKAERRR+LGLP EDP+ AKP  PVVEEKK SLP+RP
Subjt:  KKEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRP

Query:  ASKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNS
        A+K EQMRECLRSLK  HKEDDAKVKRAFQTLLTY+GNVAKNPDEEKFRKIRL+NQTFQDRVG+LRGGIEF+ELCGFEK+EGGEFLFLPR+K+D A++NS
Subjt:  ASKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNS

Query:  AGSELDSAIKNPFFGPRLAFF--------SSTSSSSSP---QISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVS
        AG+EL+SAI NPFFG  +           SS +  S P   Q S  E+HFI LIH+   T +LR+++ Q+ R   F +SRV +Q +S  S L S DY +S
Subjt:  AGSELDSAIKNPFFGPRLAFF--------SSTSSSSSP---QISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVS

Query:  IFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSAL
        IF+  + KN F FNALIRGL EN+RFE SI HFILML+  + PDRLTFPFVLKS + L    +G+ALH   +K  ++ DSFVRVSLVDMY K + L  A 
Subjt:  IFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSAL

Query:  KVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETF
        ++FDESP  +K  +VL+WNVLINGYCR  D+  A  LF SMP++++GSW++LI G++  G+L RAK+ FE M  KNVVSWTT++NGFSQNGD E A+  +
Subjt:  KVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETF

Query:  FCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------
        F M+E+G +PN+YT+ + LSAC+K GAL+                                                                       
Subjt:  FCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------

Query:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA
              +G KPD VVFLAVLTAC +SG+V+ G+ FFDSMR DY IEP++KHY +VVD+LGRAG+LNEA + I  MPIKPD   W AL+ AC+ H +    
Subjt:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA

Query:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC
        E+ S+ L++++P+  GSY+FL   YAA  G+ +D E+ R+ ++    +   GW++IEVD +L++FVAGD++H +A EI  KL EI + + E+GY    + 
Subjt:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC

Query:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY
         +H+IEEEEKE   G HSEKLALA GL+ T PGT ++IVKNLR+C DCHS MKY
Subjt:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY

KAG2324168.1 hypothetical protein Bca52824_006896 [Brassica carinata]0.0e+0059.06Show/hide
Query:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPK--VDAESEDGGDA
        MAG+SLKCGDCGALL+SVEEAQ+HAE+TSHSNF+ESTEAVLNLVCTAC KPCRSKTESDLHTKRTGHTEF DKT+E  KPISLEAPK  ++ ++ DG   
Subjt:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPK--VDAESEDGGDA

Query:  SASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKKK
        S   +EEMVVPEV+KNILEELE MGFP ARATRAL YSGNASLEAAVNWVVEHENDP++D++P  +VP ++NV  PKP+LTPE++K K QELRERARKKK
Subjt:  SASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKKK

Query:  EEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRPAS
        EEEEK  EREREKERIRIGKELLEAKRIEE+NERKRI+ LRKAEKEEE+RAR+KIRQK+EEDKAERRR+LGLP EDP+ AKP  PVVEEKK SLP+RPA+
Subjt:  EEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRPAS

Query:  KAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSAG
        K EQMRECLRSLK  HKEDDAKVKRAFQTLLTY+GNVAKNPDEEKFRKIRL+NQTFQDRVG+LRGGIEF+ELCGFEK+EGGEFLFLPR+K+D A++NSAG
Subjt:  KAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSAG

Query:  SELDSAIKNPFFG---------PRLAF-----FSSTSSSSSPQISSL-----ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNS
        +EL+SAI NPFFG          R AF     F +TS ++    + L     E+HFI LIH+   T  LR+++ Q+ R N+  SSRVV+Q +S  S L S
Subjt:  SELDSAIKNPFFG---------PRLAF-----FSSTSSSSSPQISSL-----ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNS

Query:  VDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE
         DY++SIF+    KN F+FNALIRGL EN+RFE S+ HFILML++ + PDRLTFPFVLKS + L    +G+ALH   +K  ++ DSFVRVSL+DMY K +
Subjt:  VDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE

Query:  DLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPE
         L  A +VFDESP  +K  S+L+WNVLINGYCR  D+  A  LF SMP++++GSW++LI G++  G+L RAK+ FE MP KNVVSWTT++NGFSQNGD E
Subjt:  DLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPE

Query:  KALETFFCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------
         A+  +F M+E+G +PN+YT+ + LSAC+K GAL+                                                                 
Subjt:  KALETFFCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------

Query:  ------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTH
                    +G KPD VVFLAVLTAC +SG+V  G+ FFDSMR DY+IEP++KHY +VVD+LGRAG+LNEA + I  MPI PD   W AL+ AC+ H
Subjt:  ------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTH

Query:  KNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGY
         +    E+ S+ LL+++P+  GSYVFL   +         E+ R+ ++    ++  GWS+IE+D +L++F+AGD++H +  EI  KL+EI + A E GY
Subjt:  KNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGY

XP_004139010.1 pentatricopeptide repeat-containing protein At1g04840 [Cucumis sativus]7.2e-30679.56Show/hide
Query:  NSAGSELDSAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKN
        N +GS +   + +  F PR+AFFSS  SSSSP IS LETHFIDLIHASNST+ LRQIHGQLYRCN+FSSSRVVTQFISSCSSLNSVDYA+SIFQRFELKN
Subjt:  NSAGSELDSAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKN

Query:  SFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPAS
        S+LFNALIRGLAENSRFESSIS F+LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE+LGSALKVFDESP S
Subjt:  SFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPAS

Query:  VKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR
        VK GSVLIWNVLI+GYCR+GDL+KA +LF+SMPKKDTGSWNSLINGFM+ GD+GRAKELF KMP KNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR
Subjt:  VKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR

Query:  PNDYTIVSALSACAKVGALDA-----------------------------------------------------------------------------GT
        PNDYTIVSALSACAK+GALDA                                                                             GT
Subjt:  PNDYTIVSALSACAKVGALDA-----------------------------------------------------------------------------GT

Query:  KPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQ
        KPD VVFLAVL ACSHSGQVN+GL+FFD+MR  Y IEPSMKHYTLVVDMLGRAGRL+EALKFIR MPI PDFVVWGALFCACRTHKN+EMAELASKKLLQ
Subjt:  KPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQ

Query:  LEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEK
        LEPKHPGSYVFLSNAYA+VGRW+DAERVRVSMRD GA KDPGWSFIEVD KLHRFVAGDNTHNRAVEIYSKLDEISA AREKGYTKEIECVLHNIEEEEK
Subjt:  LEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEK

Query:  EEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        EEALGYHSEKLALAFG+VST PGTTVRIVKNLRVCVDCHSFMKYASKMS+REIILRDMKRFHHFNDGVCSCGDYW
Subjt:  EEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

XP_022964045.1 pentatricopeptide repeat-containing protein At1g04840 [Cucurbita moschata]7.5e-30380.45Show/hide
Query:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS
        F PR+AFF+STSSSSSPQISSLETHFIDLIHAS+ST+ LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAV IFQRFELKNSFLFNALIRGLAENS
Subjt:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS

Query:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING
        RFESSIS+F+ ML+WKISPDRLTFPFVLKSAAALSNGGVG ALH GILKFGLEFDSFVRVSLVDMYVKV+DLGSALKVFDESP  +K G+VLIWNVLI+G
Subjt:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING

Query:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK
        YCRVG+L+KA +LFE+MP+KDTGSWNSLINGFMRKG LG A ELFEKMP KNVVSWTTMVNGFSQNGDPEKAL+ FFCMLEEGARPNDYTIVSALSACAK
Subjt:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK

Query:  VGALD-----------------------------------------------------------------------------AGTKPDGVVFLAVLTACS
        +GALD                                                                             AGTKPDGVVFLAVLTACS
Subjt:  VGALD-----------------------------------------------------------------------------AGTKPDGVVFLAVLTACS

Query:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA
        HSGQV+DGLEFFDSMR DY IEPSMKHYTL+VDMLGRAGRL+EALKFIRDMPI PDFVVWGALFCACR HKNI+MAELAS+KLL+LEPKHPGSYVFLSNA
Subjt:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA

Query:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF
        YAAVGRWEDAERVRVSMRDRGAQKDPGWSF+EVDDKLHRFVAGDNTHNRA EIYSKLDEI+AGAREKGYTK IECVLHNIEEEEKEEALG+HSEKLALAF
Subjt:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF

Query:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        GLVST P TT+RIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHF+DGVCSCGDYW
Subjt:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

XP_038876300.1 pentatricopeptide repeat-containing protein At1g04840 [Benincasa hispida]3.1e-30982.58Show/hide
Query:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS
        + PRLAFFSS SSSSSP ISSLETHFIDLIHASNST+NL QIHGQLYRCNIFSSSRVVTQFISSCS LNSVDYAVSIFQRF+LKNSFLFNALIRGLAEN 
Subjt:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS

Query:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING
        RFESSIS+F+LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCG+LKFGL+FDSFVRVSLVDMYVKVE+LGSALKVFDESP SVK GSVLIWNVLING
Subjt:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING

Query:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK
        YCRVGDL+KA +LF+SMPKKDTGSWNSLINGFMRKGDLG+AKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF MLEEG RPN YTIVSALSACAK
Subjt:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK

Query:  VGALDA-----------------------------------------------------------------------------GTKPDGVVFLAVLTACS
        VGALDA                                                                             GTKPDGVVFLAVLTACS
Subjt:  VGALDA-----------------------------------------------------------------------------GTKPDGVVFLAVLTACS

Query:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA
        HSGQVNDGL+FFDSMRHDY IEPSMKHYTLVVDMLGRAGRLNEALKFI DMPI PDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA
Subjt:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA

Query:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF
        YAAVGRWEDAERVRVSM++RGA+KDPGWSFIEVDDKLHRFV+GDNTHNRAVEIYSKLD+IS GAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF
Subjt:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF

Query:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        GLVST PGTT+RIVKNLRVCVDCHSFMKYASKMS+REIILRDMKRFHHF DGVCSCGDYW
Subjt:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0LI86 DYW_deaminase domain-containing protein3.5e-30679.56Show/hide
Query:  NSAGSELDSAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKN
        N +GS +   + +  F PR+AFFSS  SSSSP IS LETHFIDLIHASNST+ LRQIHGQLYRCN+FSSSRVVTQFISSCSSLNSVDYA+SIFQRFELKN
Subjt:  NSAGSELDSAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKN

Query:  SFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPAS
        S+LFNALIRGLAENSRFESSIS F+LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVE+LGSALKVFDESP S
Subjt:  SFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPAS

Query:  VKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR
        VK GSVLIWNVLI+GYCR+GDL+KA +LF+SMPKKDTGSWNSLINGFM+ GD+GRAKELF KMP KNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR
Subjt:  VKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGAR

Query:  PNDYTIVSALSACAKVGALDA-----------------------------------------------------------------------------GT
        PNDYTIVSALSACAK+GALDA                                                                             GT
Subjt:  PNDYTIVSALSACAKVGALDA-----------------------------------------------------------------------------GT

Query:  KPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQ
        KPD VVFLAVL ACSHSGQVN+GL+FFD+MR  Y IEPSMKHYTLVVDMLGRAGRL+EALKFIR MPI PDFVVWGALFCACRTHKN+EMAELASKKLLQ
Subjt:  KPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQ

Query:  LEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEK
        LEPKHPGSYVFLSNAYA+VGRW+DAERVRVSMRD GA KDPGWSFIEVD KLHRFVAGDNTHNRAVEIYSKLDEISA AREKGYTKEIECVLHNIEEEEK
Subjt:  LEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEK

Query:  EEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        EEALGYHSEKLALAFG+VST PGTTVRIVKNLRVCVDCHSFMKYASKMS+REIILRDMKRFHHFNDGVCSCGDYW
Subjt:  EEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

A0A3P6D288 UBA domain-containing protein0.0e+0058.82Show/hide
Query:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEA----PKVDAESEDGG
        M G+SLKCGDCGALL+SVEEAQ+HAE+TSHSNF+ESTEAVLNLVCTAC KPCRSKTESDLHTKRTGHTEF DKT+E  KPISLEA    P ++ ++ DG 
Subjt:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEA----PKVDAESEDGG

Query:  DASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARK
          S   +EEMVVPEV++ ILEELE MGFP ARATRAL YSGNASLEAAVNWVVEHEND ++D++P  +VP ++    PKP+LTPE++K K QEL+ERARK
Subjt:  DASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARK

Query:  KKEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRP
        KKEEEEK  EREREKERIRIGKELLEAKR+EE+NERKRI+ LRKAEKEEE+RAR+KIRQK+EEDKAERRR+LGLP EDP+ AKP  PVVEEKK SLP+RP
Subjt:  KKEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRP

Query:  ASKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNS
        A+K EQMRECLRSLK  HKEDDAKVKRAFQTLLTY+GNVAKNPDEEKFRKIRL+NQTFQDRVG+LRGGIEF+ELCGFEK+EGGEFLFLPR+K+D A++NS
Subjt:  ASKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNS

Query:  AGSELDSAIKNPFFGPRLAFF--------SSTSSSSSP---QISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVS
        AG+EL+SAI NPFFG  +           SS +  S P   Q S  E+HFI LIH+   T +LR+++ Q+ R   F +SRV +Q +S  S L S DY +S
Subjt:  AGSELDSAIKNPFFGPRLAFF--------SSTSSSSSP---QISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVS

Query:  IFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSAL
        IF+  + KN F FNALIRGL EN+RFE SI HFILML+  + PDRLTFPFVLKS + L    +G+ALH   +K  ++ DSFVRVSLVDMY K + L  A 
Subjt:  IFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSAL

Query:  KVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETF
        ++FDESP  +K  +VL+WNVLINGYCR  D+  A  LF SMP++++GSW++LI G++  G+L RAK+ FE M  KNVVSWTT++NGFSQNGD E A+  +
Subjt:  KVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETF

Query:  FCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------
        F M+E+G +PN+YT+ + LSAC+K GAL+                                                                       
Subjt:  FCMLEEGARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------

Query:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA
              +G KPD VVFLAVLTAC +SG+V+ G+ FFDSMR DY IEP++KHY +VVD+LGRAG+LNEA + I  MPIKPD   W AL+ AC+ H +    
Subjt:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA

Query:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC
        E+ S+ L++++P+  GSY+FL   YAA  G+ +D E+ R+ ++    +   GW++IEVD +L++FVAGD++H +A EI  KL EI + + E+GY    + 
Subjt:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC

Query:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY
         +H+IEEEEKE   G HSEKLALA GL+ T PGT ++IVKNLR+C DCHS MKY
Subjt:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY

A0A6J1HJP9 pentatricopeptide repeat-containing protein At1g048403.6e-30380.45Show/hide
Query:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS
        F PR+AFF+STSSSSSPQISSLETHFIDLIHAS+ST+ LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAV IFQRFELKNSFLFNALIRGLAENS
Subjt:  FGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENS

Query:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING
        RFESSIS+F+ ML+WKISPDRLTFPFVLKSAAALSNGGVG ALH GILKFGLEFDSFVRVSLVDMYVKV+DLGSALKVFDESP  +K G+VLIWNVLI+G
Subjt:  RFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING

Query:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK
        YCRVG+L+KA +LFE+MP+KDTGSWNSLINGFMRKG LG A ELFEKMP KNVVSWTTMVNGFSQNGDPEKAL+ FFCMLEEGARPNDYTIVSALSACAK
Subjt:  YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAK

Query:  VGALD-----------------------------------------------------------------------------AGTKPDGVVFLAVLTACS
        +GALD                                                                             AGTKPDGVVFLAVLTACS
Subjt:  VGALD-----------------------------------------------------------------------------AGTKPDGVVFLAVLTACS

Query:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA
        HSGQV+DGLEFFDSMR DY IEPSMKHYTL+VDMLGRAGRL+EALKFIRDMPI PDFVVWGALFCACR HKNI+MAELAS+KLL+LEPKHPGSYVFLSNA
Subjt:  HSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNA

Query:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF
        YAAVGRWEDAERVRVSMRDRGAQKDPGWSF+EVDDKLHRFVAGDNTHNRA EIYSKLDEI+AGAREKGYTK IECVLHNIEEEEKEEALG+HSEKLALAF
Subjt:  YAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAF

Query:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        GLVST P TT+RIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHF+DGVCSCGDYW
Subjt:  GLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

A0A6J1KIT8 pentatricopeptide repeat-containing protein At1g048403.4e-30178.35Show/hide
Query:  VLNSAGSELDSAIKN--PFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRF
        +LN  GS   + +KN    F PR+AFF+STSSSSSPQISSLET+FIDLIHAS+ST+ LRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAV IFQRF
Subjt:  VLNSAGSELDSAIKN--PFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRF

Query:  ELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDE
        ELKNSFLFNALIRGLAENSRFESSIS+F+ ML+WKISPDRLTFPFVLKSAAALSNGGVG ALH GI+KFGLEFDSFVRVSLVDMYVKV+DLGSALKVFDE
Subjt:  ELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDE

Query:  SPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLE
        SP  +K G+VLIWNVLI+GYCRVG+L+KA +LFE+MPKKDTGSWNSLINGFMRKG LG A ELFEKMP KNVVSWTTMVNGFSQNGDPEKAL+ FFCMLE
Subjt:  SPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLE

Query:  EGARPNDYTIVSALSACAKVGALDA---------------------------------------------------------------------------
        EGA+PNDYTIVSALSACAK+GALDA                                                                           
Subjt:  EGARPNDYTIVSALSACAKVGALDA---------------------------------------------------------------------------

Query:  --GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASK
          GTKPDGVVFLAVLTACSHSGQV+DGLEFFDSMR DY IEPSMKHYTL+VDMLGRAGRL+EALKFIRDMPI PDFVVWGALFCACR HKNI+MAELAS+
Subjt:  --GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASK

Query:  KLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIE
        KLL+LEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSF+EVDDKLHRFVAGDNTHNRA EIYSKLDEI+A AREKGYTK IECVLHNIE
Subjt:  KLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIE

Query:  EEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        EEEKEEALG+HSEKLALAFGL+ST P T +RIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHF+DGVCSCGDYW
Subjt:  EEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

M4DFT4 UBA domain-containing protein0.0e+0059.58Show/hide
Query:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPK---VDAESEDGGD
        MAG+SLKCGDCGALL+SVEEAQ+HAE+TSHSNF+ESTEAVLNLVC+AC KPCRSKTESDLHTKRTGHTEF DKT+E  KPISLEAPK   +++++ DG  
Subjt:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPK---VDAESEDGGD

Query:  ASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKK
         S   +EEMVVPEV++ +LEELE MGFP ARATRAL YSGNASLEAAVNWVVEHENDP++D++P  +VP ++    PKP+LTPE++K K QEL+ERARKK
Subjt:  ASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKK

Query:  KEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRPA
        KEEEEK  EREREKERIRIGKELLEAKRIEE+NERKRI+ LRKAEKEEE+RAR+KIRQK+EEDKAERRR+LGLP EDP+ AKP  PVVEEKK SLP+RPA
Subjt:  KEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRPA

Query:  SKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSA
        +K EQMRECLRSLK  HKEDDAKVKRAFQTLLTY+GNVAKNPDEEKFRKIRL+NQTFQDRVG+LRGGIEF+ELCGFEK+EGGEFLFLPR+K+D A++NSA
Subjt:  SKAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSA

Query:  GSELDSAIKNPFFGPRL----AFFSSTSSS-------SSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSI
        G+EL+SAI NPFFG  +      F   SSS       ++ Q S  E+HFI LIH+   T +LR+++ Q+ R   F +SRV +Q +S  S L S DY +SI
Subjt:  GSELDSAIKNPFFGPRL----AFFSSTSSS-------SSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSI

Query:  FQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALK
        F+  + KN F FNALIRGL EN+RFE SI HFILML+  + PDRLTFPF LKS + L    +G+ALH   +K  ++ DSFVRVSLVDMY K + L  A +
Subjt:  FQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALK

Query:  VFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF
        VFDESP  +K  SVL+WNVLINGYCR  DL  A  LF SMP++++GSW++LI G++  G+L RAK+ FE MP KNV+SWTT++NGFSQNG+ E A+  +F
Subjt:  VFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF

Query:  CMLEEGA-RPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------
         M+E+G  +PN+YT+ + LSAC+K GAL+                                                                       
Subjt:  CMLEEGA-RPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------

Query:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA
              +G KPD VVFLAVLTAC +SG+V+ G+ FFDSMR DY IEP++KHY +VVD+LGRAG+LNEA + I  MPI PD   W AL+ AC+ H +    
Subjt:  ------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMA

Query:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC
        E+ S+ L++++P+  GSY+FL   YAA  G+  D E+ R  +++RG     GWS+IEVD +L++FVAGDN+H +A EI  KL+EI + A E GY    + 
Subjt:  ELASKKLLQLEPKHPGSYVFLSNAYAAV-GRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIEC

Query:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY
         +H+IEEEEKE   G HSEKLALA GL+ T PGT +RIVKNLR+C DCHS MKY
Subjt:  VLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKY

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic9.7e-11233.29Show/hide
Query:  SSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQF--ISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHF
        S+ + P  ++  +  I LI    S   L+Q HG + R   FS     ++   +++ SS  S++YA  +F      NSF +N LIR  A       SI  F
Subjt:  SSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQF--ISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHF

Query:  ILML-KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING--------
        + M+ + +  P++ TFPF++K+AA +S+  +G++LH   +K  +  D FV  SL+  Y    DL SA KVF     ++K   V+ WN +ING        
Subjt:  ILML-KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING--------

Query:  --------------------------------------------------------------YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDL
                                                                      Y + G +  A+ LF++M +KD  +W ++++G+    D 
Subjt:  --------------------------------------------------------------YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDL

Query:  GRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF-CMLEEGARPNDYTIVSALSACAKVGALDAG------------------------------
          A+E+   MP K++V+W  +++ + QNG P +AL  F    L++  + N  T+VS LSACA+VGAL+ G                              
Subjt:  GRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF-CMLEEGARPNDYTIVSALSACAKVGALDAG------------------------------

Query:  -----------------------------------------------TKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGR
                                                        KP+GV F  V  ACSH+G V++    F  M  +Y I P  KHY  +VD+LGR
Subjt:  -----------------------------------------------TKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGR

Query:  AGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKL
        +G L +A+KFI  MPI P   VWGAL  AC+ H N+ +AE+A  +LL+LEP++ G++V LSN YA +G+WE+   +R  MR  G +K+PG S IE+D  +
Subjt:  AGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKL

Query:  HRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEE-KEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQR
        H F++GDN H  + ++Y KL E+    +  GY  EI  VL  IEEEE KE++L  HSEKLA+ +GL+ST+    +R++KNLRVC DCHS  K  S++  R
Subjt:  HRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEE-KEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQR

Query:  EIILRDMKRFHHFNDGVCSCGDYW
        EII+RD  RFHHF +G CSC D+W
Subjt:  EIILRDMKRFHHFNDGVCSCGDYW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.9e-11333.06Show/hide
Query:  FSSTSSSSSPQISSLETH-FIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSC---SSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFE
        F    SSS P   S+  H  + L+H   +  +LR IH Q+ +  + +++  +++ I  C        + YA+S+F+  +  N  ++N + RG A +S   
Subjt:  FSSTSSSSSPQISSLETH-FIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSC---SSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFE

Query:  SSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCR
        S++  ++ M+   + P+  TFPFVLKS A       G+ +H  +LK G + D +V  SL+ MYV+   L  A KVFD+SP       V+ +  LI GY  
Subjt:  SSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCR

Query:  VGDLIKARDLFESMPKKDTGSWNSLINGFMRKG-------------------------------------DLGR--------------------------
         G +  A+ LF+ +P KD  SWN++I+G+   G                                     +LGR                          
Subjt:  VGDLIKARDLFESMPKKDTGSWNSLINGFMRKG-------------------------------------DLGR--------------------------

Query:  -------AKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKVGALD----------------------------
               A  LFE++P K+V+SW T++ G++     ++AL  F  ML  G  PND T++S L ACA +GA+D                            
Subjt:  -------AKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKVGALD----------------------------

Query:  ---------------------------------------------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLV
                                                            G +PD + F+ +L+ACSHSG ++ G   F +M  DY + P ++HY  +
Subjt:  ---------------------------------------------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLV

Query:  VDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFI
        +D+LG +G   EA + I  M ++PD V+W +L  AC+ H N+E+ E  ++ L+++EP++PGSYV LSN YA+ GRW +  + R  + D+G +K PG S I
Subjt:  VDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFI

Query:  EVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYAS
        E+D  +H F+ GD  H R  EIY  L+E+     + G+  +   VL  +EEE KE AL +HSEKLA+AFGL+ST PGT + IVKNLRVC +CH   K  S
Subjt:  EVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYAS

Query:  KMSQREIILRDMKRFHHFNDGVCSCGDYW
        K+ +REII RD  RFHHF DGVCSC DYW
Subjt:  KMSQREIILRDMKRFHHFNDGVCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127709.7e-11232.35Show/hide
Query:  ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRL
        ++ +  LI ++     L+QIH +L    +  S  ++T+ I + SS   + +A  +F        F +NA+IRG + N+ F+ ++  +  M   ++SPD  
Subjt:  ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRL

Query:  TFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDT
        TFP +LK+ + LS+  +GR +H  + + G + D FV+  L+ +Y K   LGSA  VF+  P   +T  ++ W  +++ Y + G+ ++A ++F  M K D 
Subjt:  TFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDT

Query:  -GSWNSLIN--------------------------------------GFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEG
           W +L++                                       + + G +  AK LF+KM   N++ W  M++G+++NG   +A++ F  M+ + 
Subjt:  -GSWNSLIN--------------------------------------GFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEG

Query:  ARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------------A
         RP+  +I SA+SACA+VG+L+                                                                              
Subjt:  ARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------------A

Query:  GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKL
        G  P+ V FL +L AC+HSG V +G  FF+ M  D+ I P  +HY  V+D+LGRAG L++A + I+ MP++P   VWGAL  AC+ H+++E+ E A+++L
Subjt:  GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKL

Query:  LQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEE
          ++P + G YV LSN YAA   W+    VRV M+++G  KD G S++EV  +L  F  GD +H R  EI  +++ I +  +E G+    +  LH++ +E
Subjt:  LQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEE

Query:  EKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        E EE L  HSE++A+A+GL+ST  GT +RI KNLR CV+CH+  K  SK+  REI++RD  RFHHF DGVCSCGDYW
Subjt:  EKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

Q9MAT2 Pentatricopeptide repeat-containing protein At1g048401.0e-17749.03Show/hide
Query:  SAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALI
        S I  P   P   +F      +  Q S  E+HFI LIHA   T +LR +H Q+ R  +  SSRV  Q +S  S L S DY++SIF+  E +N F+ NALI
Subjt:  SAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALI

Query:  RGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLI
        RGL EN+RFESS+ HFILML+  + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K   L  A +VF+ESP  +K  S+LI
Subjt:  RGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLI

Query:  WNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVS
        WNVLINGYCR  D+  A  LF SMP++++GSW++LI G++  G+L RAK+LFE MP KNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+YTI +
Subjt:  WNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVS

Query:  ALSACAKVGALD-----------------------------------------------------------------------------AGTKPDGVVFL
         LSAC+K GAL                                                                              +G KPD VVFL
Subjt:  ALSACAKVGALD-----------------------------------------------------------------------------AGTKPDGVVFL

Query:  AVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGS
        AVLTAC +S +V+ GL FFDSMR DY+IEP++KHY LVVD+LGRAG+LNEA + + +MPI PD   W AL+ AC+ HK    AE  S+ LL+L+P+  GS
Subjt:  AVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGS

Query:  YVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHS
        Y+FL   +A+ G  +D E+ R+S++ R  ++  GWS+IE+D +L++F AGD +H    EI  KLDEI + A +KGY    +  +H+IEEEEKE   G HS
Subjt:  YVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHS

Query:  EKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        EKLAL  G + T PGTT+RI+KNLR+C DCHS MKY SK+SQR+I+LRD ++FHHF DG CSCGDYW
Subjt:  EKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088201.2e-10932.8Show/hide
Query:  TSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFI
        T  S++ ++  ++T    LI  + +  +L+QIH  L   ++   + +V   +          Y+  +F   +  N FL+N+LI G   N  F  ++  F+
Subjt:  TSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFI

Query:  LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKA
         + K  +     TFP VLK+    S+  +G  LH  ++K G   D     SL+ +Y     L  A K+FDE P      SV+ W  L +GY   G   +A
Subjt:  LMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKA

Query:  RDLFESMPKK----------------------DTGSW-----------------NSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEK
         DLF+ M +                       D+G W                  +L+N + + G + +A+ +F+ M  K++V+W+TM+ G++ N  P++
Subjt:  RDLFESMPKK----------------------DTGSW-----------------NSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEK

Query:  ALETFFCMLEEGARPNDYTIVSALSACAKVGALD------------------------------------------------------------------
         +E F  ML+E  +P+ ++IV  LS+CA +GALD                                                                  
Subjt:  ALETFFCMLEEGARPNDYTIVSALSACAKVGALD------------------------------------------------------------------

Query:  -----------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHK
                    G  PDG  FL +L  C H+G + DGL FF+++   Y+++ +++HY  +VD+ GRAG L++A + I DMP++P+ +VWGAL   CR  K
Subjt:  -----------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHK

Query:  NIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTK
        + ++AE   K+L+ LEP + G+YV LSN Y+  GRW++A  VR  M  +G +K PG+S+IE++ K+H F+A D +H  + +IY+KL+++    R  G+  
Subjt:  NIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTK

Query:  EIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
          E V  ++EEEEKE  LGYHSEKLA+A GL+STD G  +R+VKNLRVC DCH  MK  SK+++REI++RD  RFH F +G CSC DYW
Subjt:  EIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT1G04840.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.2e-17949.03Show/hide
Query:  SAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALI
        S I  P   P   +F      +  Q S  E+HFI LIHA   T +LR +H Q+ R  +  SSRV  Q +S  S L S DY++SIF+  E +N F+ NALI
Subjt:  SAIKNPFFGPRLAFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALI

Query:  RGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLI
        RGL EN+RFESS+ HFILML+  + PDRLTFPFVLKS + L    +GRALH   LK  ++ DSFVR+SLVDMY K   L  A +VF+ESP  +K  S+LI
Subjt:  RGLAENSRFESSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLI

Query:  WNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVS
        WNVLINGYCR  D+  A  LF SMP++++GSW++LI G++  G+L RAK+LFE MP KNVVSWTT++NGFSQ GD E A+ T+F MLE+G +PN+YTI +
Subjt:  WNVLINGYCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVS

Query:  ALSACAKVGALD-----------------------------------------------------------------------------AGTKPDGVVFL
         LSAC+K GAL                                                                              +G KPD VVFL
Subjt:  ALSACAKVGALD-----------------------------------------------------------------------------AGTKPDGVVFL

Query:  AVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGS
        AVLTAC +S +V+ GL FFDSMR DY+IEP++KHY LVVD+LGRAG+LNEA + + +MPI PD   W AL+ AC+ HK    AE  S+ LL+L+P+  GS
Subjt:  AVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGS

Query:  YVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHS
        Y+FL   +A+ G  +D E+ R+S++ R  ++  GWS+IE+D +L++F AGD +H    EI  KLDEI + A +KGY    +  +H+IEEEEKE   G HS
Subjt:  YVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHS

Query:  EKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        EKLAL  G + T PGTT+RI+KNLR+C DCHS MKY SK+SQR+I+LRD ++FHHF DG CSCGDYW
Subjt:  EKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW

AT1G04850.1 ubiquitin-associated (UBA)/TS-N domain-containing protein7.2e-17980.63Show/hide
Query:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPKVDAESEDGGDASA
        MAG+SLKCGDCG LL+SVEEAQ+HAELTSHSNF+ESTEAVLNLVCT C KPCRSK ESDLHTKRTGHTEF DKTLE  KPISLEAPKV  E +D    S 
Subjt:  MAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLEAPKVDAESEDGGDASA

Query:  SKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKKKEE
          +EEMVVP+V+ NILEELE+MGFP ARATRAL YSGNASLEAAVNWVVEHENDP++D+MP  +VP ++NV   KP+LTPE++K K QELRERARKKKEE
Subjt:  SKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERARKKKEE

Query:  EEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPST--AKPPAPVVEEKKISLPVRPAS
        EEK  EREREKERIRIGKELLEAKR+EE NERKR++ LRKAEKEEEKRAR+KIRQKLEEDKAERRR+LGLPPEDP+T  AKP  PVVEEKK++LP+RPA+
Subjt:  EEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPST--AKPPAPVVEEKKISLPVRPAS

Query:  KAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSAG
        K EQMRECLRSLK  HKEDDAKVKRAFQTLLTY+GNVAKNPDEEKFRKIRL+NQTFQ+RVG+LRGGIEF+ELCGFEKIEGGEFLFLPR+K+D A++NSAG
Subjt:  KAEQMRECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSAG

Query:  SELDSAIKNPFFG
        +EL+SAI NPFFG
Subjt:  SELDSAIKNPFFG

AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-11433.06Show/hide
Query:  FSSTSSSSSPQISSLETH-FIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSC---SSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFE
        F    SSS P   S+  H  + L+H   +  +LR IH Q+ +  + +++  +++ I  C        + YA+S+F+  +  N  ++N + RG A +S   
Subjt:  FSSTSSSSSPQISSLETH-FIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSC---SSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFE

Query:  SSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCR
        S++  ++ M+   + P+  TFPFVLKS A       G+ +H  +LK G + D +V  SL+ MYV+   L  A KVFD+SP       V+ +  LI GY  
Subjt:  SSISHFILMLKWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCR

Query:  VGDLIKARDLFESMPKKDTGSWNSLINGFMRKG-------------------------------------DLGR--------------------------
         G +  A+ LF+ +P KD  SWN++I+G+   G                                     +LGR                          
Subjt:  VGDLIKARDLFESMPKKDTGSWNSLINGFMRKG-------------------------------------DLGR--------------------------

Query:  -------AKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKVGALD----------------------------
               A  LFE++P K+V+SW T++ G++     ++AL  F  ML  G  PND T++S L ACA +GA+D                            
Subjt:  -------AKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKVGALD----------------------------

Query:  ---------------------------------------------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLV
                                                            G +PD + F+ +L+ACSHSG ++ G   F +M  DY + P ++HY  +
Subjt:  ---------------------------------------------------AGTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLV

Query:  VDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFI
        +D+LG +G   EA + I  M ++PD V+W +L  AC+ H N+E+ E  ++ L+++EP++PGSYV LSN YA+ GRW +  + R  + D+G +K PG S I
Subjt:  VDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFI

Query:  EVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYAS
        E+D  +H F+ GD  H R  EIY  L+E+     + G+  +   VL  +EEE KE AL +HSEKLA+AFGL+ST PGT + IVKNLRVC +CH   K  S
Subjt:  EVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYAS

Query:  KMSQREIILRDMKRFHHFNDGVCSCGDYW
        K+ +REII RD  RFHHF DGVCSC DYW
Subjt:  KMSQREIILRDMKRFHHFNDGVCSCGDYW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-11333.29Show/hide
Query:  SSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQF--ISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHF
        S+ + P  ++  +  I LI    S   L+Q HG + R   FS     ++   +++ SS  S++YA  +F      NSF +N LIR  A       SI  F
Subjt:  SSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQF--ISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHF

Query:  ILML-KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING--------
        + M+ + +  P++ TFPF++K+AA +S+  +G++LH   +K  +  D FV  SL+  Y    DL SA KVF     ++K   V+ WN +ING        
Subjt:  ILML-KWKISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLING--------

Query:  --------------------------------------------------------------YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDL
                                                                      Y + G +  A+ LF++M +KD  +W ++++G+    D 
Subjt:  --------------------------------------------------------------YCRVGDLIKARDLFESMPKKDTGSWNSLINGFMRKGDL

Query:  GRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF-CMLEEGARPNDYTIVSALSACAKVGALDAG------------------------------
          A+E+   MP K++V+W  +++ + QNG P +AL  F    L++  + N  T+VS LSACA+VGAL+ G                              
Subjt:  GRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFF-CMLEEGARPNDYTIVSALSACAKVGALDAG------------------------------

Query:  -----------------------------------------------TKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGR
                                                        KP+GV F  V  ACSH+G V++    F  M  +Y I P  KHY  +VD+LGR
Subjt:  -----------------------------------------------TKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGR

Query:  AGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKL
        +G L +A+KFI  MPI P   VWGAL  AC+ H N+ +AE+A  +LL+LEP++ G++V LSN YA +G+WE+   +R  MR  G +K+PG S IE+D  +
Subjt:  AGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKL

Query:  HRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEE-KEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQR
        H F++GDN H  + ++Y KL E+    +  GY  EI  VL  IEEEE KE++L  HSEKLA+ +GL+ST+    +R++KNLRVC DCHS  K  S++  R
Subjt:  HRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEE-KEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQR

Query:  EIILRDMKRFHHFNDGVCSCGDYW
        EII+RD  RFHHF +G CSC D+W
Subjt:  EIILRDMKRFHHFNDGVCSCGDYW

AT3G12770.1 mitochondrial editing factor 226.9e-11332.35Show/hide
Query:  ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRL
        ++ +  LI ++     L+QIH +L    +  S  ++T+ I + SS   + +A  +F        F +NA+IRG + N+ F+ ++  +  M   ++SPD  
Subjt:  ETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKWKISPDRL

Query:  TFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDT
        TFP +LK+ + LS+  +GR +H  + + G + D FV+  L+ +Y K   LGSA  VF+  P   +T  ++ W  +++ Y + G+ ++A ++F  M K D 
Subjt:  TFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDT

Query:  -GSWNSLIN--------------------------------------GFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEG
           W +L++                                       + + G +  AK LF+KM   N++ W  M++G+++NG   +A++ F  M+ + 
Subjt:  -GSWNSLIN--------------------------------------GFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEG

Query:  ARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------------A
         RP+  +I SA+SACA+VG+L+                                                                              
Subjt:  ARPNDYTIVSALSACAKVGALD-----------------------------------------------------------------------------A

Query:  GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKL
        G  P+ V FL +L AC+HSG V +G  FF+ M  D+ I P  +HY  V+D+LGRAG L++A + I+ MP++P   VWGAL  AC+ H+++E+ E A+++L
Subjt:  GTKPDGVVFLAVLTACSHSGQVNDGLEFFDSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKL

Query:  LQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEE
          ++P + G YV LSN YAA   W+    VRV M+++G  KD G S++EV  +L  F  GD +H R  EI  +++ I +  +E G+    +  LH++ +E
Subjt:  LQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGAQKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEE

Query:  EKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW
        E EE L  HSE++A+A+GL+ST  GT +RI KNLR CV+CH+  K  SK+  REI++RD  RFHHF DGVCSCGDYW
Subjt:  EKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASKMSQREIILRDMKRFHHFNDGVCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCGGAAGGAAGAATCCAATCCGCCAATTTGCTGCAAGGATTTCGATTCGGATCCAAGAATTTGGTTTCATCAATGGCGGGCCTATCGCTCAAGTGCGGGGATTG
TGGCGCTCTCTTGAGATCCGTAGAAGAAGCTCAACAACATGCCGAACTCACTTCTCACTCCAACTTCTCCGAGTCCACCGAAGCTGTCCTCAATCTCGTCTGCACTGCCT
GCGGCAAGCCCTGCCGATCCAAGACGGAAAGTGATTTGCACACGAAAAGGACCGGCCATACCGAGTTTGCTGATAAGACTTTGGAGGCTGCGAAACCAATAAGTTTGGAG
GCCCCAAAGGTAGATGCGGAATCAGAAGATGGTGGGGATGCAAGTGCTAGCAAGTCTGAAGAAATGGTTGTGCCAGAGGTGAACAAGAATATATTAGAGGAACTTGAATC
TATGGGCTTTCCAACAGCACGAGCAACCCGTGCACTCTTTTATTCTGGTAATGCCAGTCTTGAGGCTGCAGTCAATTGGGTAGTTGAACATGAGAATGATCCAGAGATAG
ATCAGATGCCTTTGGTACAAGTTCCTAAGGACACAAATGTTGAGGCTCCAAAGCCTTCTCTTACACCTGAGCAATTGAAAGCAAAGCAGCAGGAGCTAAGGGAACGGGCT
CGAAAGAAAAAAGAGGAGGAAGAGAAGATAGCGGAGAGAGAAAGGGAAAAGGAGAGAATTCGAATTGGCAAGGAGCTCTTAGAAGCAAAAAGGATCGAGGAAGAAAATGA
GAGAAAAAGAATATTAGCCTTGAGAAAAGCTGAAAAAGAAGAAGAGAAAAGAGCCAGAGACAAAATTCGTCAAAAACTTGAAGAGGACAAGGCAGAAAGAAGACGGAGGC
TTGGATTGCCACCAGAAGATCCTTCAACTGCAAAACCTCCGGCACCTGTTGTTGAGGAGAAAAAGATCTCATTACCGGTTAGACCTGCTTCAAAGGCAGAGCAAATGAGA
GAATGTTTGCGATCATTAAAGTCCAATCACAAGGAGGATGATGCTAAAGTGAAGAGAGCGTTTCAAACCCTTCTGACGTATGTGGGAAATGTGGCAAAAAATCCCGATGA
AGAGAAATTCAGAAAAATTAGACTTAGCAACCAAACTTTCCAGGATAGAGTGGGTGCCCTGAGAGGAGGAATTGAGTTTCTAGAGCTGTGTGGGTTCGAGAAAATTGAAG
GTGGCGAGTTCTTGTTTCTGCCCAGAAATAAGGTTGACAGGGCAGTGCTCAATTCAGCTGGCTCTGAGCTCGACTCTGCTATAAAGAATCCCTTCTTTGGCCCAAGGCTC
GCTTTTTTCAGTTCAACGTCTTCTTCATCATCACCTCAGATTTCATCTCTCGAAACCCATTTCATCGATCTAATTCATGCTTCCAATTCGACCTACAACCTCCGTCAGAT
CCATGGTCAACTCTACCGCTGCAACATCTTCTCCAGCAGCCGGGTTGTGACCCAGTTCATCTCTTCCTGTTCGTCGCTAAATTCTGTCGACTATGCCGTCTCGATCTTTC
AGCGGTTCGAGTTGAAGAACAGTTTCCTTTTTAATGCGTTGATTCGAGGACTTGCTGAAAATTCCAGGTTCGAGAGCTCAATTTCTCACTTTATTTTAATGTTGAAGTGG
AAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTCAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGGGCTTTGCATTGTGGGATTTTGAAGTTTGGTCT
TGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTGGATATGTACGTGAAAGTTGAGGATTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGCGAGTGTTAAGACTG
GAAGTGTGTTGATTTGGAATGTTCTTATTAATGGGTATTGTAGAGTGGGGGATTTAATAAAAGCTAGGGACCTATTCGAGTCAATGCCGAAGAAGGATACGGGATCTTGG
AATAGTTTGATCAATGGTTTCATGAGAAAGGGGGACTTGGGTCGAGCAAAGGAACTGTTTGAGAAAATGCCCGGAAAAAATGTTGTTTCTTGGACTACAATGGTGAATGG
ATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTGTATGCTTGAAGAAGGCGCACGGCCGAATGATTACACAATTGTCTCTGCACTTTCAGCTTGTG
CAAAAGTTGGTGCCTTAGATGCTGGAACAAAGCCAGATGGTGTGGTCTTTCTTGCTGTTCTTACTGCATGCTCCCATTCTGGACAAGTAAACGATGGACTTGAGTTTTTT
GACAGTATGAGACACGACTACTCGATTGAGCCTTCTATGAAGCATTACACACTGGTTGTAGACATGCTAGGCAGGGCTGGTAGACTAAATGAAGCTCTAAAGTTCATACG
TGACATGCCCATAAAGCCTGATTTTGTGGTGTGGGGTGCTCTATTTTGTGCTTGTAGGACTCATAAGAACATTGAAATGGCAGAACTAGCATCCAAAAAGCTTCTTCAGC
TTGAACCCAAGCATCCGGGGAGTTATGTGTTTTTGTCGAACGCATATGCTGCCGTAGGGAGATGGGAAGATGCAGAGAGAGTGAGAGTTTCTATGCGTGATCGCGGTGCA
CAAAAAGATCCTGGATGGAGCTTTATTGAAGTGGATGATAAATTACATAGATTTGTGGCCGGTGATAACACTCATAACCGTGCTGTGGAGATATACTCGAAATTAGATGA
GATAAGTGCAGGTGCTAGGGAAAAAGGATACACAAAAGAAATTGAGTGTGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGAAGCATTGGGATATCACAGCGAGAAGT
TGGCACTTGCTTTCGGGCTTGTTAGTACGGACCCCGGAACGACCGTTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGTCATTCTTTCATGAAATATGCCAGTAAA
ATGAGTCAGAGGGAGATCATTTTGAGGGATATGAAGCGATTCCATCATTTTAATGATGGTGTTTGTTCATGTGGAGATTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCGGAAGGAAGAATCCAATCCGCCAATTTGCTGCAAGGATTTCGATTCGGATCCAAGAATTTGGTTTCATCAATGGCGGGCCTATCGCTCAAGTGCGGGGATTG
TGGCGCTCTCTTGAGATCCGTAGAAGAAGCTCAACAACATGCCGAACTCACTTCTCACTCCAACTTCTCCGAGTCCACCGAAGCTGTCCTCAATCTCGTCTGCACTGCCT
GCGGCAAGCCCTGCCGATCCAAGACGGAAAGTGATTTGCACACGAAAAGGACCGGCCATACCGAGTTTGCTGATAAGACTTTGGAGGCTGCGAAACCAATAAGTTTGGAG
GCCCCAAAGGTAGATGCGGAATCAGAAGATGGTGGGGATGCAAGTGCTAGCAAGTCTGAAGAAATGGTTGTGCCAGAGGTGAACAAGAATATATTAGAGGAACTTGAATC
TATGGGCTTTCCAACAGCACGAGCAACCCGTGCACTCTTTTATTCTGGTAATGCCAGTCTTGAGGCTGCAGTCAATTGGGTAGTTGAACATGAGAATGATCCAGAGATAG
ATCAGATGCCTTTGGTACAAGTTCCTAAGGACACAAATGTTGAGGCTCCAAAGCCTTCTCTTACACCTGAGCAATTGAAAGCAAAGCAGCAGGAGCTAAGGGAACGGGCT
CGAAAGAAAAAAGAGGAGGAAGAGAAGATAGCGGAGAGAGAAAGGGAAAAGGAGAGAATTCGAATTGGCAAGGAGCTCTTAGAAGCAAAAAGGATCGAGGAAGAAAATGA
GAGAAAAAGAATATTAGCCTTGAGAAAAGCTGAAAAAGAAGAAGAGAAAAGAGCCAGAGACAAAATTCGTCAAAAACTTGAAGAGGACAAGGCAGAAAGAAGACGGAGGC
TTGGATTGCCACCAGAAGATCCTTCAACTGCAAAACCTCCGGCACCTGTTGTTGAGGAGAAAAAGATCTCATTACCGGTTAGACCTGCTTCAAAGGCAGAGCAAATGAGA
GAATGTTTGCGATCATTAAAGTCCAATCACAAGGAGGATGATGCTAAAGTGAAGAGAGCGTTTCAAACCCTTCTGACGTATGTGGGAAATGTGGCAAAAAATCCCGATGA
AGAGAAATTCAGAAAAATTAGACTTAGCAACCAAACTTTCCAGGATAGAGTGGGTGCCCTGAGAGGAGGAATTGAGTTTCTAGAGCTGTGTGGGTTCGAGAAAATTGAAG
GTGGCGAGTTCTTGTTTCTGCCCAGAAATAAGGTTGACAGGGCAGTGCTCAATTCAGCTGGCTCTGAGCTCGACTCTGCTATAAAGAATCCCTTCTTTGGCCCAAGGCTC
GCTTTTTTCAGTTCAACGTCTTCTTCATCATCACCTCAGATTTCATCTCTCGAAACCCATTTCATCGATCTAATTCATGCTTCCAATTCGACCTACAACCTCCGTCAGAT
CCATGGTCAACTCTACCGCTGCAACATCTTCTCCAGCAGCCGGGTTGTGACCCAGTTCATCTCTTCCTGTTCGTCGCTAAATTCTGTCGACTATGCCGTCTCGATCTTTC
AGCGGTTCGAGTTGAAGAACAGTTTCCTTTTTAATGCGTTGATTCGAGGACTTGCTGAAAATTCCAGGTTCGAGAGCTCAATTTCTCACTTTATTTTAATGTTGAAGTGG
AAAATTAGCCCTGATAGGCTTACTTTTCCGTTTGTGCTCAAATCAGCGGCGGCTCTTTCCAATGGTGGCGTTGGGAGGGCTTTGCATTGTGGGATTTTGAAGTTTGGTCT
TGAGTTTGATTCTTTTGTGAGGGTTTCGTTGGTGGATATGTACGTGAAAGTTGAGGATTTGGGTTCTGCCTTGAAGGTGTTTGATGAAAGTCCTGCGAGTGTTAAGACTG
GAAGTGTGTTGATTTGGAATGTTCTTATTAATGGGTATTGTAGAGTGGGGGATTTAATAAAAGCTAGGGACCTATTCGAGTCAATGCCGAAGAAGGATACGGGATCTTGG
AATAGTTTGATCAATGGTTTCATGAGAAAGGGGGACTTGGGTCGAGCAAAGGAACTGTTTGAGAAAATGCCCGGAAAAAATGTTGTTTCTTGGACTACAATGGTGAATGG
ATTTTCACAGAATGGAGACCCTGAAAAGGCACTGGAAACTTTCTTTTGTATGCTTGAAGAAGGCGCACGGCCGAATGATTACACAATTGTCTCTGCACTTTCAGCTTGTG
CAAAAGTTGGTGCCTTAGATGCTGGAACAAAGCCAGATGGTGTGGTCTTTCTTGCTGTTCTTACTGCATGCTCCCATTCTGGACAAGTAAACGATGGACTTGAGTTTTTT
GACAGTATGAGACACGACTACTCGATTGAGCCTTCTATGAAGCATTACACACTGGTTGTAGACATGCTAGGCAGGGCTGGTAGACTAAATGAAGCTCTAAAGTTCATACG
TGACATGCCCATAAAGCCTGATTTTGTGGTGTGGGGTGCTCTATTTTGTGCTTGTAGGACTCATAAGAACATTGAAATGGCAGAACTAGCATCCAAAAAGCTTCTTCAGC
TTGAACCCAAGCATCCGGGGAGTTATGTGTTTTTGTCGAACGCATATGCTGCCGTAGGGAGATGGGAAGATGCAGAGAGAGTGAGAGTTTCTATGCGTGATCGCGGTGCA
CAAAAAGATCCTGGATGGAGCTTTATTGAAGTGGATGATAAATTACATAGATTTGTGGCCGGTGATAACACTCATAACCGTGCTGTGGAGATATACTCGAAATTAGATGA
GATAAGTGCAGGTGCTAGGGAAAAAGGATACACAAAAGAAATTGAGTGTGTACTTCATAATATTGAAGAGGAAGAAAAGGAAGAAGCATTGGGATATCACAGCGAGAAGT
TGGCACTTGCTTTCGGGCTTGTTAGTACGGACCCCGGAACGACCGTTAGGATAGTGAAAAACCTTAGAGTTTGTGTGGATTGTCATTCTTTCATGAAATATGCCAGTAAA
ATGAGTCAGAGGGAGATCATTTTGAGGGATATGAAGCGATTCCATCATTTTAATGATGGTGTTTGTTCATGTGGAGATTATTGGTAA
Protein sequenceShow/hide protein sequence
MVAEGRIQSANLLQGFRFGSKNLVSSMAGLSLKCGDCGALLRSVEEAQQHAELTSHSNFSESTEAVLNLVCTACGKPCRSKTESDLHTKRTGHTEFADKTLEAAKPISLE
APKVDAESEDGGDASASKSEEMVVPEVNKNILEELESMGFPTARATRALFYSGNASLEAAVNWVVEHENDPEIDQMPLVQVPKDTNVEAPKPSLTPEQLKAKQQELRERA
RKKKEEEEKIAEREREKERIRIGKELLEAKRIEEENERKRILALRKAEKEEEKRARDKIRQKLEEDKAERRRRLGLPPEDPSTAKPPAPVVEEKKISLPVRPASKAEQMR
ECLRSLKSNHKEDDAKVKRAFQTLLTYVGNVAKNPDEEKFRKIRLSNQTFQDRVGALRGGIEFLELCGFEKIEGGEFLFLPRNKVDRAVLNSAGSELDSAIKNPFFGPRL
AFFSSTSSSSSPQISSLETHFIDLIHASNSTYNLRQIHGQLYRCNIFSSSRVVTQFISSCSSLNSVDYAVSIFQRFELKNSFLFNALIRGLAENSRFESSISHFILMLKW
KISPDRLTFPFVLKSAAALSNGGVGRALHCGILKFGLEFDSFVRVSLVDMYVKVEDLGSALKVFDESPASVKTGSVLIWNVLINGYCRVGDLIKARDLFESMPKKDTGSW
NSLINGFMRKGDLGRAKELFEKMPGKNVVSWTTMVNGFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKVGALDAGTKPDGVVFLAVLTACSHSGQVNDGLEFF
DSMRHDYSIEPSMKHYTLVVDMLGRAGRLNEALKFIRDMPIKPDFVVWGALFCACRTHKNIEMAELASKKLLQLEPKHPGSYVFLSNAYAAVGRWEDAERVRVSMRDRGA
QKDPGWSFIEVDDKLHRFVAGDNTHNRAVEIYSKLDEISAGAREKGYTKEIECVLHNIEEEEKEEALGYHSEKLALAFGLVSTDPGTTVRIVKNLRVCVDCHSFMKYASK
MSQREIILRDMKRFHHFNDGVCSCGDYW