; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G000050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G000050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr11:33331..40551
RNA-Seq ExpressionLsi11G000050
SyntenyLsi11G000050
Gene Ontology termsGO:0009846 - pollen germination (biological process)
GO:0010208 - pollen wall assembly (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0048655 - anther wall tapetum morphogenesis (biological process)
GO:0055046 - microgametogenesis (biological process)
GO:0071367 - cellular response to brassinosteroid stimulus (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011011 - Zinc finger, FYVE/PHD-type
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588362.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN  +SP  DP+KLLKVAADAKNL FGR IHAHLIITN    DC+VNQ+NSLINLYVKCDE+ +ARQMFD MS+RNVVSW ALMAGYMQNGSPLEVFELF
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKM+VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQ+YSKCSDVRAAL+IL TVPGYDIFCYNLV++GLL+H+H  EA+E
Subjt:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRA FDQLQ++NVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNM CCDSITWNAIITGHSHH +GKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        L++F DMLT RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+SN INWDVV+WRTLLNACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+ VRDLL+KIRPLGY+PDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDIEDEQK++NLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISK+ANRTIIVRDANRFHHFQDG CSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

XP_011655117.1 pentatricopeptide repeat-containing protein At5g39680 isoform X1 [Cucumis sativus]0.0e+0087.39Show/hide
Query:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM
        F  F +R     SNF +SPHQDPIKLLKVAADAKNL FGR IHAHL ITNH +RD KVNQLNSLINLYVKCDEV +AR++FDSM RRNVVSWSALMAGYM
Subjt:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM

Query:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL
        QNG+PLEVFELFKKMVVKDNIFPNEYVIAT ISSCDSQMYVEGKQCHGYALKSGLE HQYVKNALIQLYSKCSDV AA+QILYTVPG DIFCYNLV++GL
Subjt:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL

Query:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA
        LQHTH  EAV+VLKL+ISEGIEWNNATYVTIFRLCASLKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSGR  FD+LQS+NVVSWTSI+AA
Subjt:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA

Query:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT
        +FQN FFEEAL+LFS MEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC+ ITWNAIIT
Subjt:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT

Query:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT
        GHSHHGLGKE LSMFQDM+ T E PNYVTFIGV+ ACAHL LVDEG YYFNHLMKQF IVPGLEHYTCIVGLLSRSGRLDEAENFM+S+QINWDVVSWRT
Subjt:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT

Query:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS
        LLNACYVH++YDKG++IAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE+NLIY+ V+DLLS
Subjt:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS

Query:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        KIRPLGYVPDI +VLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMCDDCHTA+KLISKVANR I+VRDANRFHHFQ+GCCSCGDYW
Subjt:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

XP_022933883.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata]0.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN  +SP  DP+KLLKVAADAKNL FGR IHAHLIITN    DC+VNQ+NSLINLYVKCDE+ +ARQMFD MS+RNVVSW ALMAGYMQNGSPLEVFELF
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKM+VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQ+YSKCSDVRAAL+IL TVPGYD+FCYNLV++GLL+H+H  EA+E
Subjt:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRA FDQLQ++NVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM CCDSITWNAIITGHSHH +GKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        L++F DMLT RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+SN INWDVV+WRTLLNACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+ VRDLL+KIRPLGYVPDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDIEDEQK+DNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISK+ANRTIIVRDANRFHHFQDG CSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

XP_023002421.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima]0.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN  +SP  DP+KLLKVAADAKNL FGR+IHAHL+ITN   RDC+VNQ+NSLINLYVKCDE+ +ARQMFD MS+RNVVSW ALMAGYMQNGSPL+VFELF
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKM+VKDNIFPNEYVIATVISSC DSQMYVEGKQCHG++LKSGLELHQYVKNALIQ+YSKCSDVRAAL+IL TVPGYD+FCYNLV++GLL+H+H GEA+E
Subjt:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VLKLMI EG +WNNAT+VTIFR+CASLKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGRA FDQLQ++NVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM CCDSITWNAIITGHSHH +GKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        L++F DMLT RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQ GIVPGLEHYTCIVGLLSRSGRLDEAENFM+SN INWDVV+WRTLLNACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+ +RDLL+KIRPLGYVPDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISKVANRTIIVRDANRFHHFQDG CSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

XP_038892172.1 pentatricopeptide repeat-containing protein At5g39680 [Benincasa hispida]0.0e+0092.27Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SNFF+SPHQ+PIKLLKVAADAKNL FGRI+HAHLIITNH+H DCKVNQLNSLINLYVKCDEV +ARQMFDSM RRNV+SWS LMAGYMQNGSPLEVFEL 
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEV
        KKMVVKDNI PNEYVIATVISSCDSQMY+EGKQCHGY  KSGLELHQYVKN LIQ+YSKCSDVRAA+QILYTVPGYDIFCYNLVMS LLQHTHTGEAVEV
Subjt:  KKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEV

Query:  LKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALD
        LKLMISEGIEWNNAT+VTIFRLCASLKD+KLGKQVHAQMLKSDIDYDVYIGSS+IDMYGKCGNV SGRA FDQLQS+NVVSWTSIMAA+FQNGFFEEALD
Subjt:  LKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALD

Query:  LFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETL
        LFS ME+DHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLK NVIVGNALIIMYSKSGDILAA+ VFSNM CCD+ITWNA+ITGHSHHGLG   L
Subjt:  LFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETL

Query:  SMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYD
        SMFQDML TRE PNYVTFIGVLSACAHLS VDEG YYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAE FM SNQINWDVVSWRTLLNACYVHRNYD
Subjt:  SMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYD

Query:  KGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIA
        KGKQIAE LLQLEP+DVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPESNLIY KV+DLLSKIRPLGYVPDIA
Subjt:  KGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIA

Query:  SVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        SVLHDIEDEQKVDNLSYHSEKLA+AYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHFQDG CSCGDYW
Subjt:  SVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KR26 DYW_deaminase domain-containing protein0.0e+0087.39Show/hide
Query:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM
        F  F +R     SNF +SPHQDPIKLLKVAADAKNL FGR IHAHL ITNH +RD KVNQLNSLINLYVKCDEV +AR++FDSM RRNVVSWSALMAGYM
Subjt:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM

Query:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL
        QNG+PLEVFELFKKMVVKDNIFPNEYVIAT ISSCDSQMYVEGKQCHGYALKSGLE HQYVKNALIQLYSKCSDV AA+QILYTVPG DIFCYNLV++GL
Subjt:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL

Query:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA
        LQHTH  EAV+VLKL+ISEGIEWNNATYVTIFRLCASLKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSGR  FD+LQS+NVVSWTSI+AA
Subjt:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA

Query:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT
        +FQN FFEEAL+LFS MEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC+ ITWNAIIT
Subjt:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT

Query:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT
        GHSHHGLGKE LSMFQDM+ T E PNYVTFIGV+ ACAHL LVDEG YYFNHLMKQF IVPGLEHYTCIVGLLSRSGRLDEAENFM+S+QINWDVVSWRT
Subjt:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT

Query:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS
        LLNACYVH++YDKG++IAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE+NLIY+ V+DLLS
Subjt:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS

Query:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        KIRPLGYVPDI +VLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPI VIKNLRMCDDCHTA+KLISKVANR I+VRDANRFHHFQ+GCCSCGDYW
Subjt:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

A0A1S4E243 pentatricopeptide repeat-containing protein At5g39680 isoform X20.0e+0086.68Show/hide
Query:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM
        F  F +R     S+FF+SPHQDPIKLLKVAADAKNL FGR I AHL ITNH +RD KVNQLNSLINLYVKC EV +AR++FDSM RRNVVSWS LMAGYM
Subjt:  FVSFFTRQRRTWSNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYM

Query:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL
        QNG+P EVFELFKKMV+KDNI PN+YVIATVISSC+SQMYVEGKQCHGYALKSGLE HQYVKNALIQLYSKCSDV AA+QILYTVPG DIFCYNLV++GL
Subjt:  QNGSPLEVFELFKKMVVKDNIFPNEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGL

Query:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA
        LQHTH  EAV+VLKL+IS+GIEWN+ATYVTIFRLCASLKD+ LGKQVHAQMLKSDID DVYIGSSIIDMYGKCGNVLSGR  FD+LQS+NVVSWTSIMAA
Subjt:  LQHTHTGEAVEVLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAA

Query:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT
        +FQN FFEEALDLFS MEID IPPNEYT+AVL NSAAGLSAL LGDQLHARAEKSGLKGNV+VGNALIIMY KSGDILAAQRVFSNM CCD ITWNAIIT
Subjt:  HFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIIT

Query:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT
        GHSHHGLGKE LSMFQDM+TT E PNYVTFIGV+SACAHL LVDEG YYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+S+QINWDVVSWRT
Subjt:  GHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRT

Query:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS
        LLNACYVH++YDKGKQIAEYLLQLEP DVGTYILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHP++NLIY+ V++LLS
Subjt:  LLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLS

Query:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        KIRPLGYVPDI +VLHDIEDEQKV+NLSYHSEKLAVAYGLMKT SG PIRVIKNLRMCDDCHTA+KLIS+VANR IIVRD NRFHHFQ+GCCSCGDYW
Subjt:  KIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

A0A6J1CRJ5 pentatricopeptide repeat-containing protein At5g396800.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN+F+SP Q+PIKLLK+AADAKNL FGRIIHAHLIITNHT  DC+VNQ+NSLIN Y KCDE+LVARQMFD M +RNVVSWSALMAGYMQNGS LEVF L 
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISS-CDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKMVV+D+I PNEYVIAT++SS C SQMYVEGKQCHGYALKSGLELHQYVKNALIQ+YSKCSDVRAA+QIL TVPGYDIFCYNLV++GLL+H+H  EA+E
Subjt:  KKMVVKDNIFPNEYVIATVISS-CDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VL LMI E IEWNNATYVTIFRLCASLKDL+LGKQVHAQML++DIDYDVYIGSSIIDMYGKCG VLSGR  FD+LQSQNVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEID IPPNEYTLAV LNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM CCDSITWNAIITGHSHHGLGKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        LSMFQDML T ECPNYVTFIGVLSACAHLSLV EG YYFNHLMKQFGIVPGLEHYTCI+GLLSRSG+LDEAENFM+SN INWDVV+WRTLL ACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++PEDVG+YILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+KVRDLLS+I+PLGYVPDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDI+DEQK+DNLSYHSEKLAVAYGLMKTP GAPIRVIKNLRMCDDCHTAVKLISKVANR IIVRDANRFHHF+DGCCSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

A0A6J1F637 pentatricopeptide repeat-containing protein At5g396800.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN  +SP  DP+KLLKVAADAKNL FGR IHAHLIITN    DC+VNQ+NSLINLYVKCDE+ +ARQMFD MS+RNVVSW ALMAGYMQNGSPLEVFELF
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKM+VKDNIFPNEYVIATVISSC DSQMYVEG+QCHG++LKSGLELHQYVKNALIQ+YSKCSDVRAAL+IL TVPGYD+FCYNLV++GLL+H+H  EA+E
Subjt:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VLKLMI EG +WNNAT+VTIFR+CASLKDLK GK VHA+MLKSDIDYDVYIGSSIIDMYGKCGNVLSGRA FDQLQ++NVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM CCDSITWNAIITGHSHH +GKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        L++F DMLT RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFM+SN INWDVV+WRTLLNACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+ VRDLL+KIRPLGYVPDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDIEDEQK+DNLSYHSEKLAVAYGLMK PSGAPIRVIKNLRMCDDCHTA+KLISK+ANRTIIVRDANRFHHFQDG CSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

A0A6J1KL92 pentatricopeptide repeat-containing protein At5g396800.0e+0088.21Show/hide
Query:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF
        SN  +SP  DP+KLLKVAADAKNL FGR+IHAHL+ITN   RDC+VNQ+NSLINLYVKCDE+ +ARQMFD MS+RNVVSW ALMAGYMQNGSPL+VFELF
Subjt:  SNFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELF

Query:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE
        KKM+VKDNIFPNEYVIATVISSC DSQMYVEGKQCHG++LKSGLELHQYVKNALIQ+YSKCSDVRAAL+IL TVPGYD+FCYNLV++GLL+H+H GEA+E
Subjt:  KKMVVKDNIFPNEYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVE

Query:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL
        VLKLMI EG +WNNAT+VTIFR+CASLKDLKLGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGRA FDQLQ++NVVSWT+IMAA+FQNGFFEEAL
Subjt:  VLKLMISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEAL

Query:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET
        +LFS MEIDHIPPNEYTLAVLLNSAAGLSALS GDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQ VFSNM CCDSITWNAIITGHSHH +GKE 
Subjt:  DLFSNMEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKET

Query:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY
        L++F DMLT RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQ GIVPGLEHYTCIVGLLSRSGRLDEAENFM+SN INWDVV+WRTLLNACYVHRNY
Subjt:  LSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNY

Query:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI
        DKGKQIAEYLLQ++ EDVG+YILLSNMHARVRRWDGVVK+RKLMRERNVKKEPGVSWLEIRNIAHVFTSED KHPES+ IY+ +RDLL+KIRPLGYVPDI
Subjt:  DKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDI

Query:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        A VLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGAPIRVIKNLRMCDDCHTA+KLISKVANRTIIVRDANRFHHFQDG CSCGDYW
Subjt:  ASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

SwissProt top hitse value%identityAlignment
Q67V61 PHD finger protein PERSISTENT TAPETAL CELL 15.3e-14243.01Show/hide
Query:  GKKRRRGGGSGSGERVFRFKNFGENGHPAELVGQ--FRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQPPFHVVLFVVEEAVETATVAARQCKQCQY
        G  RRR      GE +FRF+ F + G+PA   G   FR+NV  LL F   E+G   G  +CWSF+L++ R PP  V LFVVEE  E A    RQC  C++
Subjt:  GKKRRRGGGSGSGERVFRFKNFGENGHPAELVGQ--FRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQPPFHVVLFVVEEAVETATVAARQCKQCQY

Query:  VGWGQHMICNKKYHFVLPSKETMAAA--------------SSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNS-LPGYLLMDFWDRLCNA
        +GWG+H+IC+K+YHF+LP +E+ A A              +  + S  +      +GH+LHGV H NG+ HL++L+GLE G+  + G+ +MD WDR+C+A
Subjt:  VGWGQHMICNKKYHFVLPSKETMAAA--------------SSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNS-LPGYLLMDFWDRLCNA

Query:  LKPRKVSLRDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLH
        L  R VSL D A+K  MELRLLH +A+G++WF +WGY++   ++ V +  Y++++  L ++PL +++ H +    ++P + +KYQ++S H LL+L DLL 
Subjt:  LKPRKVSLRDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLH

Query:  FMLNLKS--PLHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAE--FRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTL
        FML L++  P  +  +++Y     G + E SCRWS KRV+MA R +++AL+RAE   RWV+RQEVRDAAR YIGDTGLLDFVLKSLGNH+VGNY+VRRT+
Subjt:  FMLNLKS--PLHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAE--FRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTL

Query:  NPVTKVLEYCLQDVCSSV-------------QMKPRCKVGRAELAKDISNLYKYILGDQKPLQAKGILPTLTMTSQIILDSKFLIKEYVGEPQAKLKVE-
        NPVTKVLEYCL+DV S +             +M+ R ++ RA+L +D+ +LY+++L +       G    + +  +++LD K  +K+Y  E QA      
Subjt:  NPVTKVLEYCLQDVCSSV-------------QMKPRCKVGRAELAKDISNLYKYILGDQKPLQAKGILPTLTMTSQIILDSKFLIKEYVGEPQAKLKVE-

Query:  ----NEDELNLLCTIRVKREEATITLPYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIVDLEAKGRDL-MFELVEVGGKLVFEGS------GE
                +NL CT+ V      +  PYE + L  + T+ ELK E +R F E+Y GLRSF    +V + A    L +  LV+VG  +V +GS      GE
Subjt:  ----NEDELNLLCTIRVKREEATITLPYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIVDLEAKGRDL-MFELVEVGGKLVFEGS------GE

Query:  DEGSFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTR
        D              G    +R+++CACGAV+DDGERM CCDICE WQHTR
Subjt:  DEGSFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTR

Q9FK93 Pentatricopeptide repeat-containing protein At5g396802.1e-21553.04Show/hide
Query:  KLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPN
        +LLKV A++  L  G  IHAHLI+TN + R     Q+NSLINLYVKC E + AR++FD M  RNVVSW A+M GY  +G   EV +LFK M       PN
Subjt:  KLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPN

Query:  EYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGIEW
        E+V   V  SC +S    EGKQ HG  LK GL  H++V+N L+ +YS CS    A+++L  +P  D+  ++  +SG L+     E ++VL+   +E   W
Subjt:  EYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGIEW

Query:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIP
        NN TY++  RL ++L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   +QN+   T+IM A+FQ+  FEEAL+LFS M+   +P
Subjt:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTTRE
        PNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M   D +TWN +I+G SHHGLG+E L  F  M+ T E
Subjt:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTTRE

Query:  CPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++GL+YFN LMK+F + P ++HYTCIVGLLS++G   +AE+FM++  I WDVV+WRTLLNACYV RNY  GK++AEY ++
Subjt:  CPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQK
          P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+A   HD+++EQ+
Subjt:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQK

Query:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

Q9FMS5 PHD finger protein MALE STERILITY 13.7e-15949.08Show/hide
Query:  KKRRRGGGSGSGERVFRFKNFGENGHPAELVG-QFRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQP-PFHVVLFVVEEAVETATVAARQCKQCQYV
        KKR+RG       RVFR K FGE+GHPAE+    FR+N+  LLEFG +ES    G    WSF+L++QR P P +V+LFVVEE +E +     +C  CQYV
Subjt:  KKRRRGGGSGSGERVFRFKNFGENGHPAELVG-QFRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQP-PFHVVLFVVEEAVETATVAARQCKQCQYV

Query:  GWGQHMICNKKYHFVLPSKETMAA--------ASSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNSLPGYLLMDFWDRLCNALKPRKVSL
        GWG  MICNKKYHFV+PSKETMAA         +   K   S  L+ELQGH+LHG FHSNGF HLLSLNG+E G+ L G+ +MD WDRLC  LK RK+ L
Subjt:  GWGQHMICNKKYHFVLPSKETMAA--------ASSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNSLPGYLLMDFWDRLCNALKPRKVSL

Query:  RDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLHFMLNLKSP
         D + KKGMELRLLH +A G+ WF +WGY+F  G + VT ++Y+KA+E+++ +PL L+ HH  S + + P + SKYQSLS  PL+TL DL  FML+L S 
Subjt:  RDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLHFMLNLKSP

Query:  LHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAEFRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTLNPVTKVLEYCLQ
        L   N +  S      +  T+CRWS KR++MA++V+IE+LKR E+RW+SRQEVRDAAR YIGDTGLLDFVLKSLGN VVGNYLVRR+LNPV KVLEY L+
Subjt:  LHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAEFRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTLNPVTKVLEYCLQ

Query:  DVCS-------------------SVQMKPRCKVGRAELAKDISNLYKYILGDQKPLQAK--GILPTLTMTSQIILDSKFLIKE--YVGEPQAK-LKVENE
        D+ +                    +      K+ R ++ KD+   YK+IL D K +     GIL  + M S+ ILD+K+ IKE  Y+ +  AK L ++  
Subjt:  DVCS-------------------SVQMKPRCKVGRAELAKDISNLYKYILGDQKPLQAK--GILPTLTMTSQIILDSKFLIKE--YVGEPQAK-LKVENE

Query:  DELNLLCTIRVKREEATITL---PYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIV--DLEAKGRDLMFELVEVGGK-LVFEGS----GEDEG
        +EL + CTI  K       +   P ECI +KK+ T+ E+  E ER F++IYW LR  VV+ +V   +E    D   E+   G K LV EG+       E 
Subjt:  DELNLLCTIRVKREEATITL---PYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIV--DLEAKGRDLMFELVEVGGK-LVFEGS----GEDEG

Query:  SFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTRAV
        +  + +++        +KRI EC CGA E+DGERMVCCDICE+WQHTR V
Subjt:  SFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTRAV

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331705.2e-14539.14Show/hide
Query:  IKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFP
        I +L  A    +L  G+ +H   +       D  +   NSLIN+Y K  +   AR +FD+MS R+++SW++++AG  QNG  +E   LF ++ ++  + P
Subjt:  IKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFP

Query:  NEYVIATVISSCDS--QMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGI
        ++Y + +V+ +  S  +     KQ H +A+K       +V  ALI  YS+   ++ A +IL+    +D+  +N +M+G  Q     + +++  LM  +G 
Subjt:  NEYVIATVISSCDS--QMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGI

Query:  EWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDH
          ++ T  T+F+ C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++ + +  FD +   + V+WT++++   +NG  E A  +FS M +  
Subjt:  EWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDH

Query:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTT
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  +   +   WNA++ G + HG GKETL +F+ M + 
Subjt:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTT

Query:  RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E   +   +   +GI P +EHY+C+   L R+G + +AEN ++S  +      +RTLL AC V  + + GK++A  L
Subjt:  RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDE
        L+LEP D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY+KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDE

Query:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.2e-13836.51Show/hide
Query:  PHQDPIKLLKVAADAKNLNF-GRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVV
        P  + +  L VA  A    F G+ +HA+         + K+    +L+NLY KC ++  A   F      NVV W+ ++  Y         F +F++M +
Subjt:  PHQDPIKLLKVAADAKNLNF-GRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVV

Query:  KDNIFPNEYVIATVISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLM
        ++ I PN+Y   +++ +C     +E G+Q H   +K+  +L+ YV + LI +Y+K   +  A  IL    G D+  +  +++G  Q+    +A+   + M
Subjt:  KDNIFPNEYVIATVISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLM

Query:  ISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSN
        +  GI  +          CA L+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W ++++   Q+G  EEAL +F  
Subjt:  ISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSN

Query:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQ
        M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G E L  F 
Subjt:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQ

Query:  DMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ
         M+ +   PN+VT +GVLSAC+H+ LVD+G+ YF  +  ++G+ P  EHY C+V +L+R+G L  A+ F++   I  D + WRTLL+AC VH+N + G+ 
Subjt:  DMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ

Query:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLH
         A +LL+LEPED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I++  +DL  +   +GYV D  S+L+
Subjt:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLH

Query:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  +K +SKV+NR IIVRDA RFHHF+ G CSC DYW
Subjt:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-14036.51Show/hide
Query:  PHQDPIKLLKVAADAKNLNF-GRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVV
        P  + +  L VA  A    F G+ +HA+         + K+    +L+NLY KC ++  A   F      NVV W+ ++  Y         F +F++M +
Subjt:  PHQDPIKLLKVAADAKNLNF-GRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVV

Query:  KDNIFPNEYVIATVISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLM
        ++ I PN+Y   +++ +C     +E G+Q H   +K+  +L+ YV + LI +Y+K   +  A  IL    G D+  +  +++G  Q+    +A+   + M
Subjt:  KDNIFPNEYVIATVISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLM

Query:  ISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSN
        +  GI  +          CA L+ LK G+Q+HAQ   S    D+   ++++ +Y +CG +      F+Q ++ + ++W ++++   Q+G  EEAL +F  
Subjt:  ISEGIEWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSN

Query:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQ
        M  + I  N +T    + +A+  + +  G Q+HA   K+G      V NALI MY+K G I  A++ F  +   + ++WNAII  +S HG G E L  F 
Subjt:  MEIDHIPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQ

Query:  DMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ
         M+ +   PN+VT +GVLSAC+H+ LVD+G+ YF  +  ++G+ P  EHY C+V +L+R+G L  A+ F++   I  D + WRTLL+AC VH+N + G+ 
Subjt:  DMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQ

Query:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLH
         A +LL+LEPED  TY+LLSN++A  ++WD     R+ M+E+ VKKEPG SW+E++N  H F   D  HP ++ I++  +DL  +   +GYV D  S+L+
Subjt:  IAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLH

Query:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  +K +SKV+NR IIVRDA RFHHF+ G CSC DYW
Subjt:  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-14639.14Show/hide
Query:  IKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFP
        I +L  A    +L  G+ +H   +       D  +   NSLIN+Y K  +   AR +FD+MS R+++SW++++AG  QNG  +E   LF ++ ++  + P
Subjt:  IKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFP

Query:  NEYVIATVISSCDS--QMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGI
        ++Y + +V+ +  S  +     KQ H +A+K       +V  ALI  YS+   ++ A +IL+    +D+  +N +M+G  Q     + +++  LM  +G 
Subjt:  NEYVIATVISSCDS--QMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGI

Query:  EWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDH
          ++ T  T+F+ C  L  +  GKQVHA  +KS  D D+++ S I+DMY KCG++ + +  FD +   + V+WT++++   +NG  E A  +FS M +  
Subjt:  EWNNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDH

Query:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTT
        + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY+K G I  A  +F  +   +   WNA++ G + HG GKETL +F+ M + 
Subjt:  IPPNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTT

Query:  RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL
           P+ VTFIGVLSAC+H  LV E   +   +   +GI P +EHY+C+   L R+G + +AEN ++S  +      +RTLL AC V  + + GK++A  L
Subjt:  RECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYL

Query:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDE
        L+LEP D   Y+LLSNM+A   +WD +   R +M+   VKK+PG SW+E++N  H+F  +D  + ++ LIY+KV+D++  I+  GYVP+    L D+E+E
Subjt:  LQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDE

Query:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        +K   L YHSEKLAVA+GL+ TP   PIRVIKNLR+C DCH A+K I+KV NR I++RDANRFH F+DG CSCGDYW
Subjt:  QKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-13838.22Show/hide
Query:  ADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPNEYVIAT
        A+   L  GR +H H+I T     D  V   N L+N+Y KC  +  AR++F  M+ ++ VSW++++ G  QNG  +E  E +K M  + +I P  + + +
Subjt:  ADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPNEYVIAT

Query:  VISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHT-GEAVEVLKLMISEGIEWNNATY
         +SSC S  + + G+Q HG +LK G++L+  V NAL+ LY++   +    +I  ++P +D   +N ++  L +   +  EAV         G + N  T+
Subjt:  VISSCDSQMYVE-GKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHT-GEAVEVLKLMISEGIEWNNATY

Query:  VTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQL-QSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIPPNEY
         ++    +SL   +LGKQ+H   LK++I  +    +++I  YGKCG +      F ++ + ++ V+W S+++ +  N    +ALDL   M       + +
Subjt:  VTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQL-QSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIPPNEY

Query:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDM-LTTRECPN
          A +L++ A ++ L  G ++HA + ++ L+ +V+VG+AL+ MYSK G +  A R F+ MP  +S +WN++I+G++ HG G+E L +F+ M L  +  P+
Subjt:  TLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDM-LTTRECPN

Query:  YVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL
        +VTF+GVLSAC+H  L++EG  +F  +   +G+ P +EH++C+  +L R+G LD+ E+F++   +  +V+ WRT+L AC     R  + GK+ AE L QL
Subjt:  YVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNAC--YVHRNYDKGKQIAEYLLQL

Query:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQKV
        EPE+   Y+LL NM+A   RW+ +VK RK M++ +VKKE G SW+ +++  H+F + D  HP++++IYKK+++L  K+R  GYVP     L+D+E E K 
Subjt:  EPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQKV

Query:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
        + LSYHSEKLAVA+ L  +  S  PIR++KNLR+C DCH+A K ISK+  R II+RD+NRFHHFQDG CSC D+W
Subjt:  DNLSYHSEKLAVAYGL-MKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW

AT5G22260.1 RING/FYVE/PHD zinc finger superfamily protein2.6e-16049.08Show/hide
Query:  KKRRRGGGSGSGERVFRFKNFGENGHPAELVG-QFRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQP-PFHVVLFVVEEAVETATVAARQCKQCQYV
        KKR+RG       RVFR K FGE+GHPAE+    FR+N+  LLEFG +ES    G    WSF+L++QR P P +V+LFVVEE +E +     +C  CQYV
Subjt:  KKRRRGGGSGSGERVFRFKNFGENGHPAELVG-QFRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQP-PFHVVLFVVEEAVETATVAARQCKQCQYV

Query:  GWGQHMICNKKYHFVLPSKETMAA--------ASSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNSLPGYLLMDFWDRLCNALKPRKVSL
        GWG  MICNKKYHFV+PSKETMAA         +   K   S  L+ELQGH+LHG FHSNGF HLLSLNG+E G+ L G+ +MD WDRLC  LK RK+ L
Subjt:  GWGQHMICNKKYHFVLPSKETMAA--------ASSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNSLPGYLLMDFWDRLCNALKPRKVSL

Query:  RDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLHFMLNLKSP
         D + KKGMELRLLH +A G+ WF +WGY+F  G + VT ++Y+KA+E+++ +PL L+ HH  S + + P + SKYQSLS  PL+TL DL  FML+L S 
Subjt:  RDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLHFMLNLKSP

Query:  LHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAEFRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTLNPVTKVLEYCLQ
        L   N +  S      +  T+CRWS KR++MA++V+IE+LKR E+RW+SRQEVRDAAR YIGDTGLLDFVLKSLGN VVGNYLVRR+LNPV KVLEY L+
Subjt:  LHTQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAEFRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTLNPVTKVLEYCLQ

Query:  DVCS-------------------SVQMKPRCKVGRAELAKDISNLYKYILGDQKPLQAK--GILPTLTMTSQIILDSKFLIKE--YVGEPQAK-LKVENE
        D+ +                    +      K+ R ++ KD+   YK+IL D K +     GIL  + M S+ ILD+K+ IKE  Y+ +  AK L ++  
Subjt:  DVCS-------------------SVQMKPRCKVGRAELAKDISNLYKYILGDQKPLQAK--GILPTLTMTSQIILDSKFLIKE--YVGEPQAK-LKVENE

Query:  DELNLLCTIRVKREEATITL---PYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIV--DLEAKGRDLMFELVEVGGK-LVFEGS----GEDEG
        +EL + CTI  K       +   P ECI +KK+ T+ E+  E ER F++IYW LR  VV+ +V   +E    D   E+   G K LV EG+       E 
Subjt:  DELNLLCTIRVKREEATITL---PYECIALKKNGTIDELKEEVERKFKEIYWGLRSFVVKRIV--DLEAKGRDLMFELVEVGGK-LVFEGS----GEDEG

Query:  SFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTRAV
        +  + +++        +KRI EC CGA E+DGERMVCCDICE+WQHTR V
Subjt:  SFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTRAV

AT5G39680.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-21653.04Show/hide
Query:  KLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPN
        +LLKV A++  L  G  IHAHLI+TN + R     Q+NSLINLYVKC E + AR++FD M  RNVVSW A+M GY  +G   EV +LFK M       PN
Subjt:  KLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFPN

Query:  EYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGIEW
        E+V   V  SC +S    EGKQ HG  LK GL  H++V+N L+ +YS CS    A+++L  +P  D+  ++  +SG L+     E ++VL+   +E   W
Subjt:  EYVIATVISSC-DSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGIEW

Query:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIP
        NN TY++  RL ++L+DL L  QVH++M++   + +V    ++I+MYGKCG VL  +  FD   +QN+   T+IM A+FQ+  FEEAL+LFS M+   +P
Subjt:  NNATYVTIFRLCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIP

Query:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTTRE
        PNEYT A+LLNS A LS L  GD LH    KSG + +V+VGNAL+ MY+KSG I  A++ FS M   D +TWN +I+G SHHGLG+E L  F  M+ T E
Subjt:  PNEYTLAVLLNSAAGLSALSLGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTTRE

Query:  CPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ
         PN +TFIGVL AC+H+  V++GL+YFN LMK+F + P ++HYTCIVGLLS++G   +AE+FM++  I WDVV+WRTLLNACYV RNY  GK++AEY ++
Subjt:  CPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQ

Query:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQK
          P D G Y+LLSN+HA+ R W+GV K+R LM  R VKKEPGVSW+ IRN  HVF +ED +HPE  LIY KV++++SKI+PLGY PD+A   HD+++EQ+
Subjt:  LEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQK

Query:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW
         DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+A+KLISK++ R I++RD+NRFHHF DG CSC DYW
Subjt:  VDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVANRTIIVRDANRFHHFQDGCCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACCACCACCCTCATTTTTATAACTCATCCACTTCGCTATAAATACACTTATTTTCATCCTCTGCTTGGGCGGCCAACGGTGATCACGACGATGACCGAAGTGTT
CGGGAAGAAGAGGAGGAGGGGCGGTGGCAGTGGCAGTGGAGAGAGGGTGTTTAGATTCAAAAACTTTGGGGAGAATGGGCATCCGGCAGAGCTTGTGGGGCAGTTCCGGG
AGAATGTGGAGGCGCTGCTAGAGTTCGGGCGGTGGGAGAGCGGAGAGTGTGGTGGTGGATTCCGGTGTTGGTCGTTTCGGCTACAGGTTCAGCGACAGCCGCCTTTCCAC
GTCGTGCTGTTTGTGGTGGAGGAGGCAGTGGAGACGGCCACCGTGGCGGCGAGGCAGTGTAAACAATGCCAGTATGTTGGGTGGGGGCAGCATATGATATGCAATAAAAA
GTATCACTTTGTGTTGCCATCAAAGGAGACAATGGCGGCTGCTTCTTCAAACAGCAAAAGCATGTTGAGTTTGAGGCTTATGGAATTACAAGGTCATATCCTTCATGGGG
TTTTCCATTCTAATGGCTTTGCCCATTTGTTGTCTCTCAATGGACTAGAAAAGGGCAATTCTTTACCTGGCTATCTTCTCATGGACTTTTGGGATCGTTTGTGCAATGCA
CTTAAACCAAGGAAGGTGAGTTTGAGGGACATTGCACAAAAGAAAGGAATGGAACTAAGGCTACTCCACAGCATTGCACATGGGCAATCATGGTTTAGCCAATGGGGATA
CAAATTCAGCCATGGAGCCTTCAAAGTGACCATTCAAATGTACCAAAAAGCCATTGAAACCCTTCAAACCCTCCCTCTCTCTCTCATTCTCCACCACTTTGCCTCTACTC
ATCCTCAAATTCCCTTCATTTTCTCTAAGTACCAATCCCTTTCCAACCACCCTCTTCTCACCCTCCAAGACCTCCTCCATTTCATGCTCAACCTCAAATCCCCTCTCCAC
ACACAAAACTCCATCGAATACTCCCATACCTTGGCCGGATTCTTGCTCGAGACCTCTTGTCGATGGTCACCAAAGCGTGTCGAGATGGCAGTTCGAGTCATCATTGAGGC
ATTGAAACGAGCAGAGTTTCGATGGGTTTCACGACAGGAGGTCCGAGATGCAGCCCGTATTTATATTGGTGATACAGGGTTGTTGGATTTCGTGTTGAAGTCATTGGGAA
ATCATGTTGTTGGAAATTATTTGGTTCGAAGAACATTGAATCCTGTGACGAAAGTGCTTGAGTATTGCTTGCAAGACGTTTGTAGTAGTGTTCAAATGAAACCTAGGTGT
AAAGTTGGGAGAGCTGAGTTAGCAAAAGATATCTCCAACCTCTATAAATACATTCTTGGAGACCAAAAGCCATTGCAAGCCAAAGGAATTCTTCCCACTCTAACAATGAC
TTCGCAGATCATTCTCGACTCGAAGTTCCTCATCAAGGAGTACGTGGGGGAGCCGCAAGCAAAACTCAAGGTCGAAAATGAAGACGAATTAAACCTCTTGTGCACAATTA
GAGTGAAAAGAGAAGAAGCAACAATAACACTTCCATATGAGTGCATTGCATTGAAGAAGAATGGGACCATTGATGAGCTAAAGGAGGAAGTGGAAAGGAAGTTCAAAGAA
ATATATTGGGGATTGAGGAGCTTTGTAGTAAAAAGGATAGTGGATTTGGAGGCAAAAGGGAGGGATTTGATGTTTGAATTAGTGGAAGTTGGTGGGAAGTTGGTGTTTGA
AGGAAGTGGAGAAGATGAAGGAAGCTTTTTCTTCAATAATAATAATAATAATAATAATGGTGGAATAATGAATAAGAGAATTATGGAGTGTGCTTGTGGAGCTGTGGAAG
ATGATGGGGAGAGGATGGTGTGCTGTGATATTTGTGAAATTTGGCAGCACACAAGAGCCGTAGCTCCACCGTTCGTCTCCTTCTTCACACGGCAGCGCCGAACCTGGTCC
AATTTCTTTTCTTCTCCTCACCAGGACCCAATAAAGCTGTTGAAAGTAGCTGCAGATGCCAAGAACTTAAACTTTGGTAGAATAATCCATGCCCATCTGATCATTACCAA
TCACACCCATAGAGACTGCAAAGTAAACCAACTGAATTCCCTTATTAATTTGTACGTGAAATGTGATGAAGTATTGGTTGCTCGGCAAATGTTTGATAGTATGTCTAGAA
GAAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACATGCAAAATGGGAGTCCCTTGGAAGTTTTTGAGTTGTTCAAAAAGATGGTCGTGAAGGATAATATCTTCCCC
AATGAATATGTGATTGCTACTGTTATATCTTCTTGTGATAGTCAAATGTATGTAGAGGGGAAGCAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGCTTCATCAATA
TGTTAAGAATGCACTTATTCAGCTGTACTCTAAATGTTCAGATGTAAGGGCAGCATTGCAGATATTATATACTGTCCCCGGTTATGACATATTTTGTTATAATTTGGTAA
TGAGTGGGCTTCTACAGCACACACATACGGGAGAAGCTGTAGAAGTTCTGAAGTTAATGATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTTCGC
CTTTGTGCTAGTCTTAAAGATTTGAAATTAGGTAAGCAAGTTCATGCTCAAATGTTGAAAAGTGATATTGACTATGATGTCTATATTGGAAGTTCTATCATTGACATGTA
CGGGAAGTGTGGTAATGTGTTGAGTGGAAGAGCCTGTTTCGATCAGTTGCAAAGCCAAAATGTCGTTTCTTGGACATCAATCATGGCAGCTCATTTTCAGAATGGATTCT
TTGAAGAAGCACTGGATCTGTTTTCAAATATGGAAATTGATCATATTCCTCCCAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGCTTGTCTGCTCTAAGC
CTTGGGGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAGGGCAATGTTATAGTAGGTAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCAGC
ACAACGTGTGTTTTCAAATATGCCGTGTTGTGATTCCATTACCTGGAATGCAATAATCACTGGCCATTCACACCATGGTCTTGGCAAAGAAACTTTAAGCATGTTCCAGG
ACATGTTGACTACTAGAGAGTGTCCTAATTATGTAACCTTTATTGGTGTTCTCTCTGCTTGTGCCCATTTAAGCCTGGTGGATGAAGGATTATACTATTTTAATCATTTG
ATGAAACAGTTTGGTATTGTTCCTGGATTAGAGCACTATACCTGTATTGTTGGGCTTCTAAGTAGATCTGGACGACTGGATGAAGCTGAGAACTTTATGAAGTCAAATCA
AATAAATTGGGATGTTGTATCCTGGCGCACCCTTCTCAATGCTTGTTATGTTCATAGAAATTATGATAAAGGGAAGCAAATAGCAGAGTACTTGCTACAGTTGGAGCCTG
AGGATGTAGGAACTTATATTCTATTATCAAACATGCACGCGAGAGTTAGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAG
CCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCATGTTTTTACATCTGAGGATACTAAACACCCAGAGTCCAATCTGATTTATAAAAAGGTAAGGGACTTATTATC
TAAGATTCGACCTTTGGGGTATGTTCCTGATATTGCTAGTGTATTGCACGATATTGAGGACGAGCAAAAGGTAGACAATCTTAGCTATCACAGTGAGAAGCTTGCTGTAG
CATATGGCCTGATGAAGACACCATCGGGCGCACCAATCAGGGTGATTAAGAACCTTAGGATGTGCGATGATTGTCACACTGCTGTCAAACTTATCTCCAAGGTCGCTAAT
AGGACTATAATTGTTAGAGATGCCAATCGCTTCCATCATTTTCAAGATGGTTGTTGCTCATGTGGAGATTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACACCACCACCCTCATTTTTATAACTCATCCACTTCGCTATAAATACACTTATTTTCATCCTCTGCTTGGGCGGCCAACGGTGATCACGACGATGACCGAAGTGTT
CGGGAAGAAGAGGAGGAGGGGCGGTGGCAGTGGCAGTGGAGAGAGGGTGTTTAGATTCAAAAACTTTGGGGAGAATGGGCATCCGGCAGAGCTTGTGGGGCAGTTCCGGG
AGAATGTGGAGGCGCTGCTAGAGTTCGGGCGGTGGGAGAGCGGAGAGTGTGGTGGTGGATTCCGGTGTTGGTCGTTTCGGCTACAGGTTCAGCGACAGCCGCCTTTCCAC
GTCGTGCTGTTTGTGGTGGAGGAGGCAGTGGAGACGGCCACCGTGGCGGCGAGGCAGTGTAAACAATGCCAGTATGTTGGGTGGGGGCAGCATATGATATGCAATAAAAA
GTATCACTTTGTGTTGCCATCAAAGGAGACAATGGCGGCTGCTTCTTCAAACAGCAAAAGCATGTTGAGTTTGAGGCTTATGGAATTACAAGGTCATATCCTTCATGGGG
TTTTCCATTCTAATGGCTTTGCCCATTTGTTGTCTCTCAATGGACTAGAAAAGGGCAATTCTTTACCTGGCTATCTTCTCATGGACTTTTGGGATCGTTTGTGCAATGCA
CTTAAACCAAGGAAGGTGAGTTTGAGGGACATTGCACAAAAGAAAGGAATGGAACTAAGGCTACTCCACAGCATTGCACATGGGCAATCATGGTTTAGCCAATGGGGATA
CAAATTCAGCCATGGAGCCTTCAAAGTGACCATTCAAATGTACCAAAAAGCCATTGAAACCCTTCAAACCCTCCCTCTCTCTCTCATTCTCCACCACTTTGCCTCTACTC
ATCCTCAAATTCCCTTCATTTTCTCTAAGTACCAATCCCTTTCCAACCACCCTCTTCTCACCCTCCAAGACCTCCTCCATTTCATGCTCAACCTCAAATCCCCTCTCCAC
ACACAAAACTCCATCGAATACTCCCATACCTTGGCCGGATTCTTGCTCGAGACCTCTTGTCGATGGTCACCAAAGCGTGTCGAGATGGCAGTTCGAGTCATCATTGAGGC
ATTGAAACGAGCAGAGTTTCGATGGGTTTCACGACAGGAGGTCCGAGATGCAGCCCGTATTTATATTGGTGATACAGGGTTGTTGGATTTCGTGTTGAAGTCATTGGGAA
ATCATGTTGTTGGAAATTATTTGGTTCGAAGAACATTGAATCCTGTGACGAAAGTGCTTGAGTATTGCTTGCAAGACGTTTGTAGTAGTGTTCAAATGAAACCTAGGTGT
AAAGTTGGGAGAGCTGAGTTAGCAAAAGATATCTCCAACCTCTATAAATACATTCTTGGAGACCAAAAGCCATTGCAAGCCAAAGGAATTCTTCCCACTCTAACAATGAC
TTCGCAGATCATTCTCGACTCGAAGTTCCTCATCAAGGAGTACGTGGGGGAGCCGCAAGCAAAACTCAAGGTCGAAAATGAAGACGAATTAAACCTCTTGTGCACAATTA
GAGTGAAAAGAGAAGAAGCAACAATAACACTTCCATATGAGTGCATTGCATTGAAGAAGAATGGGACCATTGATGAGCTAAAGGAGGAAGTGGAAAGGAAGTTCAAAGAA
ATATATTGGGGATTGAGGAGCTTTGTAGTAAAAAGGATAGTGGATTTGGAGGCAAAAGGGAGGGATTTGATGTTTGAATTAGTGGAAGTTGGTGGGAAGTTGGTGTTTGA
AGGAAGTGGAGAAGATGAAGGAAGCTTTTTCTTCAATAATAATAATAATAATAATAATGGTGGAATAATGAATAAGAGAATTATGGAGTGTGCTTGTGGAGCTGTGGAAG
ATGATGGGGAGAGGATGGTGTGCTGTGATATTTGTGAAATTTGGCAGCACACAAGAGCCGTAGCTCCACCGTTCGTCTCCTTCTTCACACGGCAGCGCCGAACCTGGTCC
AATTTCTTTTCTTCTCCTCACCAGGACCCAATAAAGCTGTTGAAAGTAGCTGCAGATGCCAAGAACTTAAACTTTGGTAGAATAATCCATGCCCATCTGATCATTACCAA
TCACACCCATAGAGACTGCAAAGTAAACCAACTGAATTCCCTTATTAATTTGTACGTGAAATGTGATGAAGTATTGGTTGCTCGGCAAATGTTTGATAGTATGTCTAGAA
GAAATGTGGTATCTTGGAGCGCTTTAATGGCTGGGTACATGCAAAATGGGAGTCCCTTGGAAGTTTTTGAGTTGTTCAAAAAGATGGTCGTGAAGGATAATATCTTCCCC
AATGAATATGTGATTGCTACTGTTATATCTTCTTGTGATAGTCAAATGTATGTAGAGGGGAAGCAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGCTTCATCAATA
TGTTAAGAATGCACTTATTCAGCTGTACTCTAAATGTTCAGATGTAAGGGCAGCATTGCAGATATTATATACTGTCCCCGGTTATGACATATTTTGTTATAATTTGGTAA
TGAGTGGGCTTCTACAGCACACACATACGGGAGAAGCTGTAGAAGTTCTGAAGTTAATGATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTTCGC
CTTTGTGCTAGTCTTAAAGATTTGAAATTAGGTAAGCAAGTTCATGCTCAAATGTTGAAAAGTGATATTGACTATGATGTCTATATTGGAAGTTCTATCATTGACATGTA
CGGGAAGTGTGGTAATGTGTTGAGTGGAAGAGCCTGTTTCGATCAGTTGCAAAGCCAAAATGTCGTTTCTTGGACATCAATCATGGCAGCTCATTTTCAGAATGGATTCT
TTGAAGAAGCACTGGATCTGTTTTCAAATATGGAAATTGATCATATTCCTCCCAATGAATATACACTGGCAGTGTTGTTAAACTCTGCTGCAGGCTTGTCTGCTCTAAGC
CTTGGGGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAGGGCAATGTTATAGTAGGTAATGCCTTGATCATTATGTATTCCAAGAGTGGGGACATTTTAGCAGC
ACAACGTGTGTTTTCAAATATGCCGTGTTGTGATTCCATTACCTGGAATGCAATAATCACTGGCCATTCACACCATGGTCTTGGCAAAGAAACTTTAAGCATGTTCCAGG
ACATGTTGACTACTAGAGAGTGTCCTAATTATGTAACCTTTATTGGTGTTCTCTCTGCTTGTGCCCATTTAAGCCTGGTGGATGAAGGATTATACTATTTTAATCATTTG
ATGAAACAGTTTGGTATTGTTCCTGGATTAGAGCACTATACCTGTATTGTTGGGCTTCTAAGTAGATCTGGACGACTGGATGAAGCTGAGAACTTTATGAAGTCAAATCA
AATAAATTGGGATGTTGTATCCTGGCGCACCCTTCTCAATGCTTGTTATGTTCATAGAAATTATGATAAAGGGAAGCAAATAGCAGAGTACTTGCTACAGTTGGAGCCTG
AGGATGTAGGAACTTATATTCTATTATCAAACATGCACGCGAGAGTTAGGAGGTGGGATGGTGTTGTTAAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAAGAG
CCTGGAGTGAGCTGGTTAGAAATAAGAAATATTGCCCATGTTTTTACATCTGAGGATACTAAACACCCAGAGTCCAATCTGATTTATAAAAAGGTAAGGGACTTATTATC
TAAGATTCGACCTTTGGGGTATGTTCCTGATATTGCTAGTGTATTGCACGATATTGAGGACGAGCAAAAGGTAGACAATCTTAGCTATCACAGTGAGAAGCTTGCTGTAG
CATATGGCCTGATGAAGACACCATCGGGCGCACCAATCAGGGTGATTAAGAACCTTAGGATGTGCGATGATTGTCACACTGCTGTCAAACTTATCTCCAAGGTCGCTAAT
AGGACTATAATTGTTAGAGATGCCAATCGCTTCCATCATTTTCAAGATGGTTGTTGCTCATGTGGAGATTATTGGTGA
Protein sequenceShow/hide protein sequence
MHTTTLIFITHPLRYKYTYFHPLLGRPTVITTMTEVFGKKRRRGGGSGSGERVFRFKNFGENGHPAELVGQFRENVEALLEFGRWESGECGGGFRCWSFRLQVQRQPPFH
VVLFVVEEAVETATVAARQCKQCQYVGWGQHMICNKKYHFVLPSKETMAAASSNSKSMLSLRLMELQGHILHGVFHSNGFAHLLSLNGLEKGNSLPGYLLMDFWDRLCNA
LKPRKVSLRDIAQKKGMELRLLHSIAHGQSWFSQWGYKFSHGAFKVTIQMYQKAIETLQTLPLSLILHHFASTHPQIPFIFSKYQSLSNHPLLTLQDLLHFMLNLKSPLH
TQNSIEYSHTLAGFLLETSCRWSPKRVEMAVRVIIEALKRAEFRWVSRQEVRDAARIYIGDTGLLDFVLKSLGNHVVGNYLVRRTLNPVTKVLEYCLQDVCSSVQMKPRC
KVGRAELAKDISNLYKYILGDQKPLQAKGILPTLTMTSQIILDSKFLIKEYVGEPQAKLKVENEDELNLLCTIRVKREEATITLPYECIALKKNGTIDELKEEVERKFKE
IYWGLRSFVVKRIVDLEAKGRDLMFELVEVGGKLVFEGSGEDEGSFFFNNNNNNNNGGIMNKRIMECACGAVEDDGERMVCCDICEIWQHTRAVAPPFVSFFTRQRRTWS
NFFSSPHQDPIKLLKVAADAKNLNFGRIIHAHLIITNHTHRDCKVNQLNSLINLYVKCDEVLVARQMFDSMSRRNVVSWSALMAGYMQNGSPLEVFELFKKMVVKDNIFP
NEYVIATVISSCDSQMYVEGKQCHGYALKSGLELHQYVKNALIQLYSKCSDVRAALQILYTVPGYDIFCYNLVMSGLLQHTHTGEAVEVLKLMISEGIEWNNATYVTIFR
LCASLKDLKLGKQVHAQMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRACFDQLQSQNVVSWTSIMAAHFQNGFFEEALDLFSNMEIDHIPPNEYTLAVLLNSAAGLSALS
LGDQLHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQRVFSNMPCCDSITWNAIITGHSHHGLGKETLSMFQDMLTTRECPNYVTFIGVLSACAHLSLVDEGLYYFNHL
MKQFGIVPGLEHYTCIVGLLSRSGRLDEAENFMKSNQINWDVVSWRTLLNACYVHRNYDKGKQIAEYLLQLEPEDVGTYILLSNMHARVRRWDGVVKIRKLMRERNVKKE
PGVSWLEIRNIAHVFTSEDTKHPESNLIYKKVRDLLSKIRPLGYVPDIASVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIRVIKNLRMCDDCHTAVKLISKVAN
RTIIVRDANRFHHFQDGCCSCGDYW