; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025767 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025767
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr09:6343872..6345936
RNA-Seq ExpressionPI0025767
SyntenyPI0025767
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067569.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.8e-30290.27Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFIPTSNF LLFSL TSNL SLHLNSSG PSPILEQ SIALPDIH NS+L DFQLP LSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRPEKSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDSDVA+T FEAAMRGYNKLHMYKSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCY RVMEAYLKLGDSERVMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASI+EVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLI +GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        +AYVKASEFEKCEQYYREFRMNGGTIDKAI GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

XP_004151188.1 pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis sativus]1.7e-30289.93Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFI TSNF+LLFSLPTSNL SLHLNSSGCPSPILEQ SIALPDIH NS+L DFQLPSL NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE A
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRP+KSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDS VAMTAFEAAMRGYNKLHM+KSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRIS  TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGY+AAVKVYEKLIE+GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        SAYVKASEFEKCEQYYREFRMNGGTIDKA  GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY++ALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

XP_008465506.1 PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucumis melo]6.9e-30490.77Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFIPTSNF LLFSLPTSNL SLHLNSSG PSPILEQ SIALPDIH NS+L DFQLP LSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRPEKSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCY RVMEAYLKLGDSERVMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASI+EVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLI +GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        +AYVKASEFEKCEQYYREFRMNGGTIDKAI GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

XP_022938691.1 pentatricopeptide repeat-containing protein At5g13770, chloroplastic [Cucurbita moschata]2.1e-28486.03Show/hide
Query:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL
        FIP SN ALLFSLP  NLRSLHLNSSGCPSPILE S  +LP+I  +S+LQDFQLPS S+VEDLNDFLCGL QNPG EDLIY+YYVKAKE  GFRPEKSTL
Subjt:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL

Query:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD
        RHLI                RDFVD+ V PDRDTCSRLVSSCV+G            FERD DVA  AFEAAMRGYNKLHMYKSTI+VFQRLKSA+IEAD
Subjt:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD

Query:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA
        SGCYCRVMEAYLKLGDSER+MELFNE+ESRISDFTPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTIYSALICTFASIQEVKLAEDLYNEA
Subjt:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA

Query:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED
        K KKLLRDPAMF KLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGY+AAV VYEKLI + CEPGQVTYA AINAYCRVGLYSKAED
Subjt:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED

Query:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF
        +F EMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EF
Subjt:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF

Query:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        E CE+YYREFRMNGG IDKAIAGIMVGVFSKTSRVDELVKLLRDM LEG RLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

XP_038874313.1 pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X1 [Benincasa hispida]1.8e-29688.7Show/hide
Query:  KSH---FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFR
        KSH   FIPTSN + LFSLPTSNLRSLHL SSGCPSPILEQSSIALPDIHL+S+LQD QLPSL  VEDLNDFLCGLSQNPG+EDLIY+YYVKAKE+AGFR
Subjt:  KSH---FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFR

Query:  PEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKS
        PEKSTLRHLI                RDFVD+ V PDRDTCSRLVSSCV+G            FE+DSDVA  AFEAAMRGYNKLHMYKSTI+VFQRLKS
Subjt:  PEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKS

Query:  ARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAE
        ARIEADSGC CRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGI EDYTIYSALI TFASIQEVKLAE
Subjt:  ARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAE

Query:  DLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGL
        DLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALE+V+VMKDFKIGVSDCIFCAIVNGYATRRGY+AAVKVYEKLIE+GCEPGQVTYASAINAYCRVGL
Subjt:  DLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGL

Query:  YSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAY
        YSKAED+FGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAY
Subjt:  YSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAY

Query:  VKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
         KASEFE CEQYY EFRMNGGTIDKA+AGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLY+SALNALMDAGLQVQAKWLQ HYAGKSGFV
Subjt:  VKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

TrEMBL top hitse value%identityAlignment
A0A0A0KPV8 Uncharacterized protein8.2e-30389.93Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFI TSNF+LLFSLPTSNL SLHLNSSGCPSPILEQ SIALPDIH NS+L DFQLPSL NV+DLNDFLCGLSQNPGTEDLIYDYYVKAKE A
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRP+KSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDS VAMTAFEAAMRGYNKLHM+KSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRIS  TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMY+QQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGY+AAVKVYEKLIE+GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        SAYVKASEFEKCEQYYREFRMNGGTIDKA  GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY++ALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

A0A1S3CPF0 pentatricopeptide repeat-containing protein At5g13770, chloroplastic3.3e-30490.77Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFIPTSNF LLFSLPTSNL SLHLNSSG PSPILEQ SIALPDIH NS+L DFQLP LSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRPEKSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCY RVMEAYLKLGDSERVMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASI+EVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLI +GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        +AYVKASEFEKCEQYYREFRMNGGTIDKAI GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

A0A5A7VR01 Pentatricopeptide repeat-containing protein1.8e-30290.27Show/hide
Query:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
        P  +  KSHFIPTSNF LLFSL TSNL SLHLNSSG PSPILEQ SIALPDIH NS+L DFQLP LSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA
Subjt:  PLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERA

Query:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR
        GFRPEKSTLRHLI                RDFVDFGV PDRDTCS+LVSSCV+G            FERDSDVA+T FEAAMRGYNKLHMYKSTIMVFQR
Subjt:  GFRPEKSTLRHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQR

Query:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK
        LKSARIEADSGCY RVMEAYLKLGDSERVMELFNEVESRIS+ TPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASI+EVK
Subjt:  LKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVK

Query:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR
        LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLI +GCEPGQVTYASAINAYCR
Subjt:  LAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCR

Query:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
        VGLYSKAEDIFGEMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII
Subjt:  VGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII

Query:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        +AYVKASEFEKCEQYYREFRMNGGTIDKAI GIMVGVFSKTSRVDELVKLLRDMKLEGTRLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  SAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

A0A6J1FDV4 pentatricopeptide repeat-containing protein At5g13770, chloroplastic1.0e-28486.03Show/hide
Query:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL
        FIP SN ALLFSLP  NLRSLHLNSSGCPSPILE S  +LP+I  +S+LQDFQLPS S+VEDLNDFLCGL QNPG EDLIY+YYVKAKE  GFRPEKSTL
Subjt:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL

Query:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD
        RHLI                RDFVD+ V PDRDTCSRLVSSCV+G            FERD DVA  AFEAAMRGYNKLHMYKSTI+VFQRLKSA+IEAD
Subjt:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD

Query:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA
        SGCYCRVMEAYLKLGDSER+MELFNE+ESRISDFTPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTIYSALICTFASIQEVKLAEDLYNEA
Subjt:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA

Query:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED
        K KKLLRDPAMF KLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGY+AAV VYEKLI + CEPGQVTYA AINAYCRVGLYSKAED
Subjt:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED

Query:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF
        +F EMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EF
Subjt:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF

Query:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        E CE+YYREFRMNGG IDKAIAGIMVGVFSKTSRVDELVKLLRDM LEG RLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

A0A6J1JWB4 pentatricopeptide repeat-containing protein At5g13770, chloroplastic isoform X11.1e-28385.52Show/hide
Query:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL
        FIP SN ALLFSLP  NLRSLHLNSSGCPSPILE S  +LP+I  +S+LQDFQLPS S+VEDLNDFLCGL QNPG EDLIY+YYVKAKE  GFRPEKSTL
Subjt:  FIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTL

Query:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD
        RHLI                RDFVD+ V PDRDTCSRLVSSCV+G            FERD DVA  AFEAAMRGYNKLHMY+STI+VFQRLKSA+IEAD
Subjt:  RHLI----------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEAD

Query:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA
        SGCYCRVMEAYLKLGDSER+MELFNE+ESRISDFTPFSTKIYGILC+SLAKSGRVFESLEFFRDMRKKGI EDYTIYSALICTFASIQEVKLAEDLYNEA
Subjt:  SGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEA

Query:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED
        K KKLLRDPA F KLILMYIQQGSLEKALEIVEVMKDFKIG SDCIFCAIVNGYATRRGY+AAV +YEKLI + CEPGQVTYA AINAYCRVGLYSKAED
Subjt:  KAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAED

Query:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF
        +F EMEEKGFDKCVVAYSSLI MYGKTGRLKDAMRLLAKMKE+GCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSII+AYVKA+EF
Subjt:  IFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEF

Query:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV
        E CE+YYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDM LEG RLD RLY+SALNALMDAGLQVQAKWLQDHYAGKSGFV
Subjt:  EKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV

SwissProt top hitse value%identityAlignment
O82178 Pentatricopeptide repeat-containing protein At2g351301.8e-3323.67Show/hide
Query:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK
        F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E+++           +Y    E L K  G   E+++ F+ M++
Subjt:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK

Query:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV
                 Y+ +I  +    +  ++  LY E ++ +   +   +  L+  + ++G  EKA EI E +++  +     ++ A++  Y +R GY   A ++
Subjt:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV

Query:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK
        +  +   GCEP + +Y   ++AY R GL+S AE +F EM+                                   E G +      +S++ +YG+ G+  
Subjt:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK

Query:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK
           ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I AY +   + KC + + E   +G   D   A +++   S 
Subjt:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK

Query:  TSRVDELVKLLRDM
          +V+++  +LR M
Subjt:  TSRVDELVKLLRDM

Q66GP4 Pentatricopeptide repeat-containing protein At5g13770, chloroplastic3.6e-15450.09Show/hide
Query:  NSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRP
        N  S   PT     L   P     + H+ SS C S +LE+     P      D   F  P      DLN  L    ++P T  L  ++Y KAKE +  R 
Subjt:  NSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRP

Query:  EKSTLRHLI------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLK-SARIE
         K  + +L+             D  +    PD  TCS L+ SC++             F  D  +A++A +AAM+G+NKL MY STI VF RLK S  +E
Subjt:  EKSTLRHLI------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLK-SARIE

Query:  ADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLY
           GCYCR+MEA+ K+G++ +V+ELF E +S R+S     S  IY I+C SLAKSGR FE+LE   +M+ KGI E   +YS LI  FA  +EV + E L+
Subjt:  ADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLY

Query:  NEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSK
         EA  KKLL+DP M LK++LMY+++G++E  LE+V  M+  ++ V+DCI CAIVNG++ +RG+  AVKVYE  ++E CE GQVTYA AINAYCR+  Y+K
Subjt:  NEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSK

Query:  AEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKA
        AE +F EM +KGFDKCVVAYS+++ MYGKT RL DA+RL+AKMK++GC+PN+WIYN L++MHG+A +L++ EK+WKEMKR K+ PDKVSYTS+ISAY ++
Subjt:  AEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKA

Query:  SEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHY
         E E+C + Y+EFRMN G ID+A+AGIMVGVFSKTSR+DEL++LL+DMK+EGTRLD RLY SALNAL DAGL  Q +WLQ+ +
Subjt:  SEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHY

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic3.7e-3424.04Show/hide
Query:  DFVDFGVSPDRDTCSRLVSSCV-----KGFERDS-DVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LF
        DFV++ +     T S  + S +     K  ERD  ++ +      + G+ K       + +    ++  + A +     ++ A   L DS R +E   LF
Subjt:  DFVDFGVSPDRDTCSRLVSSCV-----KGFERDS-DVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LF

Query:  NEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGS
         E+  R S   P  T+ Y  L +   K+G + ++     +M K+G++ D   YS LI  + +    + A  +  E +A  +  +  +F +L+  +  +G 
Subjt:  NEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGS

Query:  LEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMY
         +K  ++++ MK   +      +  +++ +      D A+  +++++ EG EP +VT+ + I+ +C+ G +  AE++F  ME +G   C   Y+ +I  Y
Subjt:  LEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMY

Query:  GKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGI
        G   R  D  RLL KMK +G  PNV  +  L++++GK+       +  +EMK   + P    Y ++I+AY +    E+    +R    +G          
Subjt:  GKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGI

Query:  MVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALM
        ++  F +  R  E   +L+ MK  G + D   Y + + AL+
Subjt:  MVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALM

Q8S8P6 Pentatricopeptide repeat-containing protein At2g326301.4e-3323.46Show/hide
Query:  GVSPDRDTC---------SRLVSSCVKGFER--DSDVAMTAFE--AAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFN
        G+S D  +C          R +  C++ F R  DS V +T +     + G  +    + +  + +      I+ ++  Y  ++ AY+K  D   V  +  
Subjt:  GVSPDRDTC---------SRLVSSCVKGFER--DSDVAMTAFE--AAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFN

Query:  EVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSL
         ++    D   ++   Y +L E   K+G++ ++ + F +MR++GI  D  +Y++LI        +K A  L++E   K L      +  LI    + G +
Subjt:  EVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSL

Query:  EKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYG
          A  ++  M+   + ++  +F  +++GY  +   D A  +Y+ + ++G +    T  +  + + R+  Y +A+     M E G     V+Y++LI +Y 
Subjt:  EKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYG

Query:  KTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIM
        K G +++A RL  +M  KG QPN   YN+++  + K   +K+  KL   M+   + PD  +YTS+I     A   ++  + + E  + G   +     +M
Subjt:  KTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIM

Query:  VGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL
        +   SK  + DE   L  +MK +G  +D ++Y + + ++
Subjt:  VGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL

Q9LW84 Pentatricopeptide repeat-containing protein At3g160104.1e-3323.17Show/hide
Query:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKK
        + A +  Y KL    S I +F  +K   ++     Y  ++  Y K+G  E+ ++LF E++      T ++   Y  L + L K+GRV E+  F++DM + 
Subjt:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKK

Query:  GIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLI-LMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVY
        G+  D    + L+     +  V+   ++++E    +       +  +I  ++  +  + +     + MK   +  S+  +  +++GY      + A+ + 
Subjt:  GIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLI-LMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVY

Query:  EKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQ
        E++ E+G  P    Y S INA  +   Y  A ++F E++E   +     Y+ +I  +GK G+L +A+ L  +MK +G  P+V+ YN LM    KA  + +
Subjt:  EKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQ

Query:  VEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL
           L ++M+      D  S+  I++ + +     +  + +   + +G   D      ++G F+     +E  +++R+MK +G   D   Y S L+A+
Subjt:  VEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL

Arabidopsis top hitse value%identityAlignment
AT2G32630.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.0e-3423.46Show/hide
Query:  GVSPDRDTC---------SRLVSSCVKGFER--DSDVAMTAFE--AAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFN
        G+S D  +C          R +  C++ F R  DS V +T +     + G  +    + +  + +      I+ ++  Y  ++ AY+K  D   V  +  
Subjt:  GVSPDRDTC---------SRLVSSCVKGFER--DSDVAMTAFE--AAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFN

Query:  EVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSL
         ++    D   ++   Y +L E   K+G++ ++ + F +MR++GI  D  +Y++LI        +K A  L++E   K L      +  LI    + G +
Subjt:  EVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSL

Query:  EKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYG
          A  ++  M+   + ++  +F  +++GY  +   D A  +Y+ + ++G +    T  +  + + R+  Y +A+     M E G     V+Y++LI +Y 
Subjt:  EKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMYG

Query:  KTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIM
        K G +++A RL  +M  KG QPN   YN+++  + K   +K+  KL   M+   + PD  +YTS+I     A   ++  + + E  + G   +     +M
Subjt:  KTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIM

Query:  VGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL
        +   SK  + DE   L  +MK +G  +D ++Y + + ++
Subjt:  VGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNAL

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-3423.67Show/hide
Query:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK
        F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E+++           +Y    E L K  G   E+++ F+ M++
Subjt:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK

Query:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV
                 Y+ +I  +    +  ++  LY E ++ +   +   +  L+  + ++G  EKA EI E +++  +     ++ A++  Y +R GY   A ++
Subjt:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV

Query:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK
        +  +   GCEP + +Y   ++AY R GL+S AE +F EM+                                   E G +      +S++ +YG+ G+  
Subjt:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK

Query:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK
           ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I AY +   + KC + + E   +G   D   A +++   S 
Subjt:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK

Query:  TSRVDELVKLLRDM
          +V+++  +LR M
Subjt:  TSRVDELVKLLRDM

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-3423.67Show/hide
Query:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK
        F   +  Y +   YK    ++ +L  +R       Y  +++AY   G  ER   +  E+++           +Y    E L K  G   E+++ F+ M++
Subjt:  FEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAK-SGRVFESLEFFRDMRK

Query:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV
                 Y+ +I  +    +  ++  LY E ++ +   +   +  L+  + ++G  EKA EI E +++  +     ++ A++  Y +R GY   A ++
Subjt:  KGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYD-AAVKV

Query:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK
        +  +   GCEP + +Y   ++AY R GL+S AE +F EM+                                   E G +      +S++ +YG+ G+  
Subjt:  YEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEME-----------------------------------EKGFDKCVVAYSSLILMYGKTGRLK

Query:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK
           ++LA+M+   C  ++  YNIL+ ++GKA  L+++E+L+ E+K K   PD V++TS I AY +   + KC + + E   +G   D   A +++   S 
Subjt:  DAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSK

Query:  TSRVDELVKLLRDM
          +V+++  +LR M
Subjt:  TSRVDELVKLLRDM

AT5G13770.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.6e-15550.09Show/hide
Query:  NSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRP
        N  S   PT     L   P     + H+ SS C S +LE+     P      D   F  P      DLN  L    ++P T  L  ++Y KAKE +  R 
Subjt:  NSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVEDLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRP

Query:  EKSTLRHLI------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLK-SARIE
         K  + +L+             D  +    PD  TCS L+ SC++             F  D  +A++A +AAM+G+NKL MY STI VF RLK S  +E
Subjt:  EKSTLRHLI------------RDFVDFGVSPDRDTCSRLVSSCVKG------------FERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLK-SARIE

Query:  ADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLY
           GCYCR+MEA+ K+G++ +V+ELF E +S R+S     S  IY I+C SLAKSGR FE+LE   +M+ KGI E   +YS LI  FA  +EV + E L+
Subjt:  ADSGCYCRVMEAYLKLGDSERVMELFNEVES-RISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLY

Query:  NEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSK
         EA  KKLL+DP M LK++LMY+++G++E  LE+V  M+  ++ V+DCI CAIVNG++ +RG+  AVKVYE  ++E CE GQVTYA AINAYCR+  Y+K
Subjt:  NEAKAKKLLRDPAMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSK

Query:  AEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKA
        AE +F EM +KGFDKCVVAYS+++ MYGKT RL DA+RL+AKMK++GC+PN+WIYN L++MHG+A +L++ EK+WKEMKR K+ PDKVSYTS+ISAY ++
Subjt:  AEDIFGEMEEKGFDKCVVAYSSLILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKA

Query:  SEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHY
         E E+C + Y+EFRMN G ID+A+AGIMVGVFSKTSR+DEL++LL+DMK+EGTRLD RLY SALNAL DAGL  Q +WLQ+ +
Subjt:  SEFEKCEQYYREFRMNGGTIDKAIAGIMVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHY

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.6e-3524.04Show/hide
Query:  DFVDFGVSPDRDTCSRLVSSCV-----KGFERDS-DVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LF
        DFV++ +     T S  + S +     K  ERD  ++ +      + G+ K       + +    ++  + A +     ++ A   L DS R +E   LF
Subjt:  DFVDFGVSPDRDTCSRLVSSCV-----KGFERDS-DVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEADSGCYCRVMEAYLKLGDSERVME---LF

Query:  NEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGS
         E+  R S   P  T+ Y  L +   K+G + ++     +M K+G++ D   YS LI  + +    + A  +  E +A  +  +  +F +L+  +  +G 
Subjt:  NEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDPAMFLKLILMYIQQGS

Query:  LEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMY
         +K  ++++ MK   +      +  +++ +      D A+  +++++ EG EP +VT+ + I+ +C+ G +  AE++F  ME +G   C   Y+ +I  Y
Subjt:  LEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSSLILMY

Query:  GKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGI
        G   R  D  RLL KMK +G  PNV  +  L++++GK+       +  +EMK   + P    Y ++I+AY +    E+    +R    +G          
Subjt:  GKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGI

Query:  MVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALM
        ++  F +  R  E   +L+ MK  G + D   Y + + AL+
Subjt:  MVGVFSKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTGGAGAAGCAAAAGCATTATATATTATATACCACACACAAGCTTGGCGGTTTTCATGGAGGAAACACAAATGCTTACATGGAAAATCCCTTAACCCTTCTGTC
CAAAAGCTCCCCACTTGGCACAAATTCAAAATCCCACTTCATCCCCACCTCTAATTTCGCTCTCCTTTTCTCTCTTCCCACTTCAAATCTTCGATCCCTTCATCTAAACT
CCTCCGGTTGCCCTTCCCCAATCTTAGAACAATCCTCCATCGCCTTACCCGACATCCATTTAAATTCCGATCTTCAAGATTTTCAACTTCCCTCGTTGTCCAATGTTGAA
GATTTGAACGATTTCTTATGTGGGTTGTCGCAAAACCCCGGAACCGAGGATTTGATCTATGACTATTATGTGAAAGCGAAGGAGAGGGCTGGGTTTCGACCTGAGAAATC
GACATTGCGGCATCTAATCAGGGACTTTGTGGATTTTGGTGTTTCCCCTGATAGAGATACTTGTTCTAGATTGGTTAGTAGTTGTGTCAAAGGTTTTGAAAGGGATAGTG
ATGTTGCTATGACTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGTACAAAAGCACTATCATGGTTTTCCAGCGGTTGAAATCTGCAAGAATTGAAGCA
GATTCTGGATGTTATTGTAGGGTCATGGAAGCCTATCTTAAACTTGGGGATTCTGAGAGAGTTATGGAGCTGTTTAATGAAGTGGAGAGTAGGATTTCGGATTTTACGCC
TTTTTCGACCAAGATTTATGGGATACTTTGCGAGTCCTTAGCGAAGTCGGGGCGGGTTTTTGAGTCGCTTGAGTTCTTTAGAGATATGAGGAAGAAAGGGATTGCAGAAG
ACTACACTATTTACTCTGCTTTGATATGTACTTTTGCTAGCATCCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAAGCAAAAGCCAAGAAGTTGTTGAGAGACCCT
GCGATGTTTCTAAAGCTCATATTGATGTACATTCAACAAGGATCATTAGAGAAGGCACTTGAGATTGTTGAAGTAATGAAAGACTTTAAAATTGGAGTCTCTGACTGTAT
TTTTTGTGCAATTGTCAATGGTTACGCCACGAGAAGGGGCTACGACGCTGCGGTTAAAGTTTATGAGAAGTTGATCGAAGAGGGGTGTGAGCCAGGACAAGTGACGTACG
CCTCAGCAATCAACGCCTACTGCCGTGTCGGACTCTACTCGAAAGCAGAGGACATTTTTGGAGAAATGGAGGAGAAAGGATTTGATAAATGTGTAGTAGCTTACTCCAGC
TTGATATTAATGTATGGAAAAACAGGGAGATTAAAGGATGCAATGAGGCTATTAGCAAAGATGAAAGAAAAAGGGTGTCAGCCAAATGTTTGGATATACAACATATTGAT
GGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGTTATGGAAGGAAATGAAGCGCAAAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTG
CTTATGTCAAGGCATCAGAATTTGAGAAGTGCGAGCAATATTACCGGGAGTTTCGGATGAATGGGGGCACCATCGATAAGGCAATTGCGGGGATCATGGTTGGGGTGTTC
TCGAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTCAGGGACATGAAGTTAGAAGGAACAAGGTTGGATGGGAGGCTTTATAAGTCAGCATTGAATGCTTTGATGGA
TGCTGGGTTGCAAGTACAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTGGAGAAGCAAAAGCATTATATATTATATACCACACACAAGCTTGGCGGTTTTCATGGAGGAAACACAAATGCTTACATGGAAAATCCCTTAACCCTTCTGTC
CAAAAGCTCCCCACTTGGCACAAATTCAAAATCCCACTTCATCCCCACCTCTAATTTCGCTCTCCTTTTCTCTCTTCCCACTTCAAATCTTCGATCCCTTCATCTAAACT
CCTCCGGTTGCCCTTCCCCAATCTTAGAACAATCCTCCATCGCCTTACCCGACATCCATTTAAATTCCGATCTTCAAGATTTTCAACTTCCCTCGTTGTCCAATGTTGAA
GATTTGAACGATTTCTTATGTGGGTTGTCGCAAAACCCCGGAACCGAGGATTTGATCTATGACTATTATGTGAAAGCGAAGGAGAGGGCTGGGTTTCGACCTGAGAAATC
GACATTGCGGCATCTAATCAGGGACTTTGTGGATTTTGGTGTTTCCCCTGATAGAGATACTTGTTCTAGATTGGTTAGTAGTTGTGTCAAAGGTTTTGAAAGGGATAGTG
ATGTTGCTATGACTGCTTTTGAAGCTGCCATGAGAGGCTACAATAAGCTTCACATGTACAAAAGCACTATCATGGTTTTCCAGCGGTTGAAATCTGCAAGAATTGAAGCA
GATTCTGGATGTTATTGTAGGGTCATGGAAGCCTATCTTAAACTTGGGGATTCTGAGAGAGTTATGGAGCTGTTTAATGAAGTGGAGAGTAGGATTTCGGATTTTACGCC
TTTTTCGACCAAGATTTATGGGATACTTTGCGAGTCCTTAGCGAAGTCGGGGCGGGTTTTTGAGTCGCTTGAGTTCTTTAGAGATATGAGGAAGAAAGGGATTGCAGAAG
ACTACACTATTTACTCTGCTTTGATATGTACTTTTGCTAGCATCCAGGAAGTTAAATTAGCTGAAGATCTTTACAATGAAGCAAAAGCCAAGAAGTTGTTGAGAGACCCT
GCGATGTTTCTAAAGCTCATATTGATGTACATTCAACAAGGATCATTAGAGAAGGCACTTGAGATTGTTGAAGTAATGAAAGACTTTAAAATTGGAGTCTCTGACTGTAT
TTTTTGTGCAATTGTCAATGGTTACGCCACGAGAAGGGGCTACGACGCTGCGGTTAAAGTTTATGAGAAGTTGATCGAAGAGGGGTGTGAGCCAGGACAAGTGACGTACG
CCTCAGCAATCAACGCCTACTGCCGTGTCGGACTCTACTCGAAAGCAGAGGACATTTTTGGAGAAATGGAGGAGAAAGGATTTGATAAATGTGTAGTAGCTTACTCCAGC
TTGATATTAATGTATGGAAAAACAGGGAGATTAAAGGATGCAATGAGGCTATTAGCAAAGATGAAAGAAAAAGGGTGTCAGCCAAATGTTTGGATATACAACATATTGAT
GGAGATGCATGGGAAGGCTAAGAATTTGAAGCAAGTTGAGAAGTTATGGAAGGAAATGAAGCGCAAAAAGATAGCACCTGATAAGGTTAGCTATACAAGTATCATAAGTG
CTTATGTCAAGGCATCAGAATTTGAGAAGTGCGAGCAATATTACCGGGAGTTTCGGATGAATGGGGGCACCATCGATAAGGCAATTGCGGGGATCATGGTTGGGGTGTTC
TCGAAGACGAGTCGGGTTGATGAGCTGGTGAAGCTTCTCAGGGACATGAAGTTAGAAGGAACAAGGTTGGATGGGAGGCTTTATAAGTCAGCATTGAATGCTTTGATGGA
TGCTGGGTTGCAAGTACAAGCAAAATGGTTGCAAGATCATTATGCTGGAAAATCAGGCTTTGTTTAA
Protein sequenceShow/hide protein sequence
MKLEKQKHYILYTTHKLGGFHGGNTNAYMENPLTLLSKSSPLGTNSKSHFIPTSNFALLFSLPTSNLRSLHLNSSGCPSPILEQSSIALPDIHLNSDLQDFQLPSLSNVE
DLNDFLCGLSQNPGTEDLIYDYYVKAKERAGFRPEKSTLRHLIRDFVDFGVSPDRDTCSRLVSSCVKGFERDSDVAMTAFEAAMRGYNKLHMYKSTIMVFQRLKSARIEA
DSGCYCRVMEAYLKLGDSERVMELFNEVESRISDFTPFSTKIYGILCESLAKSGRVFESLEFFRDMRKKGIAEDYTIYSALICTFASIQEVKLAEDLYNEAKAKKLLRDP
AMFLKLILMYIQQGSLEKALEIVEVMKDFKIGVSDCIFCAIVNGYATRRGYDAAVKVYEKLIEEGCEPGQVTYASAINAYCRVGLYSKAEDIFGEMEEKGFDKCVVAYSS
LILMYGKTGRLKDAMRLLAKMKEKGCQPNVWIYNILMEMHGKAKNLKQVEKLWKEMKRKKIAPDKVSYTSIISAYVKASEFEKCEQYYREFRMNGGTIDKAIAGIMVGVF
SKTSRVDELVKLLRDMKLEGTRLDGRLYKSALNALMDAGLQVQAKWLQDHYAGKSGFV