; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006395 (gene) of Snake gourd v1 genome

Gene IDTan0006395
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTP_methylase domain-containing protein
Genome locationLG06:4121290..4124980
RNA-Seq ExpressionTan0006395
SyntenyTan0006395
Gene Ontology termsGO:0019354 - siroheme biosynthetic process (biological process)
GO:0032259 - methylation (biological process)
GO:0004851 - uroporphyrin-III C-methyltransferase activity (molecular function)
InterPro domainsIPR000878 - Tetrapyrrole methylase
IPR003043 - Uroporphiryn-III C-methyltransferase, conserved site
IPR006366 - Uroporphyrin-III C-methyltransferase
IPR014776 - Tetrapyrrole methylase, subdomain 2
IPR014777 - Tetrapyrrole methylase, subdomain 1
IPR035996 - Tetrapyrrole methylase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575668.1 hypothetical protein SDJN03_26307, partial [Cucurbita argyrosperma subsp. sororia]3.1e-19093.82Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC SSSSSSSSPFTEKHSVKRYQRDDWLYKNQAD+SS+A SCSIP DSESIR+NDIALQLPELKKLLQ
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKR +SG DDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

XP_008461254.1 PREDICTED: uroporphyrinogen-III C-methyltransferase [Cucumis melo]2.2e-19192.47Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MAR  +LQSLSSPF S PTIPRSPNFKPI SFHCS SSSSSSSPFTEKHS+KRYQRDDWLYKNQ+DQ S+  SCSIP DSES+R+NDIA+QLPELKKLL+
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKRV+SGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVL+LVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAAVERGTTPQQRTVFA LK+LADEIKAAELVSPTLI+IG+VVSLSPHWSLSS EASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

XP_022954326.1 uncharacterized protein LOC111456604 [Cucurbita moschata]1.1e-19093.33Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHC---SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKK
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC   SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQAD+SS+A SCSIP DSESIR+NDIALQLPELKK
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHC---SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKK

Query:  LLQVLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGA
        LLQVLREKR +SG DDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGA
Subjt:  LLQVLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGA

Query:  TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA
        TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA
Subjt:  TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA

Query:  IKLMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        +KLM HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  IKLMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

XP_022991905.1 uncharacterized protein LOC111488402 [Cucurbita maxima]3.1e-19093.01Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC    SSSSSPFTEKHSVKRYQRDDW+YKNQAD+SS+A SCSI CDSESIR+NDIALQLPELKKLLQ
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKR +SGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

XP_023548332.1 uncharacterized protein LOC111807000 [Cucurbita pepo subsp. pepo]1.8e-19093.28Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC    SSSSSPFTEKHSVKRYQRDDWLYKNQAD+SS+A SCSIP DSESIR+NDIALQLPELKKLLQ
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKR +SGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

TrEMBL top hitse value%identityAlignment
A0A0A0KBI8 TP_methylase domain-containing protein1.5e-19091.94Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MAR  +LQSLSSPF S PTIPRSPNFKPI SFHCSS+SSSSSSPFTEKHSVKRYQRDDWLYK Q+DQ S+  SCSIP DSESIR+NDIA+QLPELKKLL+
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKRV++GCDDGKCGPG+VFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVL+LVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPL+VAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAAVERGTTPQQRTVFA+LK+LADEIKAAELVSPTLI+IG+VVSLSPHWSLSS EASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

A0A1S3CDT8 uroporphyrinogen-III C-methyltransferase1.0e-19192.47Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MAR  +LQSLSSPF S PTIPRSPNFKPI SFHCS SSSSSSSPFTEKHS+KRYQRDDWLYKNQ+DQ S+  SCSIP DSES+R+NDIA+QLPELKKLL+
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKRV+SGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVL+LVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAAVERGTTPQQRTVFA LK+LADEIKAAELVSPTLI+IG+VVSLSPHWSLSS EASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

A0A6J1D4K3 uncharacterized protein LOC1110175353.9e-18690.88Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MARVY+LQSLSSPF SRP  PR+P FKPI S HC    SSSSSPFTEKHS+KRYQRDDW+YKNQADQ+S A SCS+PCDS+SIR++DIALQLPELK+LL 
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDD-GKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATV
        VLREKRVN GCDD G+CGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATV
Subjt:  VLREKRVNSGCDD-GKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATV

Query:  VRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIK
        VRLKGGDPLVFGRGGEEMDFLQQ+GIQVKIVPGITAASGI+AELGIPLTHRGVAT+VRFLTGHSRKGGTDPLFVAENAAD DSTLVVYMGLSTLPSLA+K
Subjt:  VRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIK

Query:  LMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        LM HGLPPDTPAAAVERGTTPQQRTVFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEAS LVEA
Subjt:  LMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

A0A6J1GQM1 uncharacterized protein LOC1114566045.2e-19193.33Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHC---SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKK
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC   SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQAD+SS+A SCSIP DSESIR+NDIALQLPELKK
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHC---SSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKK

Query:  LLQVLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGA
        LLQVLREKR +SG DDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGA
Subjt:  LLQVLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGA

Query:  TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA
        TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA
Subjt:  TVVRLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLA

Query:  IKLMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        +KLM HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  IKLMQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

A0A6J1JU84 uncharacterized protein LOC1114884021.5e-19093.01Show/hide
Query:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ
        MARV +LQSLSS F SRPTIPRS NFKPICSFHC    SSSSSPFTEKHSVKRYQRDDW+YKNQAD+SS+A SCSI CDSESIR+NDIALQLPELKKLLQ
Subjt:  MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQ

Query:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV
        VLREKR +SGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVG DARLLYVGKTAG+HSRTQEEIHELLLNFAEAGATVV
Subjt:  VLREKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVV

Query:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL
        RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGI+AELGIPLTHRGVATSVRFLTGHSR+GGTDPLFVAENAADPDSTLVVYMGLSTLPSLA+KL
Subjt:  RLKGGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKL

Query:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
        M HGLPPDTPAAA+ERGTTPQQR VFAELK+LADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA
Subjt:  MQHGLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA

SwissProt top hitse value%identityAlignment
A0KP37 Siroheme synthase 25.7e-6251.98Show/hide
Query:  EKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK
        E+ +N G D  K   G V LVG GPGDP LLTLKA++ IQ A+++LYD+LVS ++LDLV  DA L+ VGK AG HS  QEE + LL+ +A+AG  VVRLK
Subjt:  EKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK

Query:  GGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQH
        GGDP +FGRGGEE++ L ++GI   +VPGITAA+G +A  GIPLTHR  A S  F+TGH +  G +P +  +  A    TLV+YMGL     +  +L+ H
Subjt:  GGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQH

Query:  GLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL
        G    TP A +ERGTT +QR +   L +LA+   AA+ VSP+LI+IG+VV+L
Subjt:  GLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL

B8GUD3 Siroheme synthase2.0e-6249.6Show/hide
Query:  EKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK
        EK + +G D    G G VFLVG GPGDP+LLT +A++++Q AD+++YD LVS  +++LV  DA ++Y GK    H+  QEEI++LL+  A+ G  V+RLK
Subjt:  EKRVNSGCDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK

Query:  GGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQH
        GGDP +FGRGGEE+D L Q+GI  ++VPGITAA+G ++  GIPLTHR  A +V F TGH R G  D  +  +  A P  T+V YMGL  LP +  +LM H
Subjt:  GGDPLVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQH

Query:  GLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL
        G+ PD P A VE+GTT  QR +   L ++ D +K  ++  PTLII+G+VV L
Subjt:  GLPPDTPAAAVERGTTPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL

Q0VQ05 Siroheme synthase4.9e-6150.21Show/hide
Query:  GNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMD
        G V+LVG GPGDP+LLT +A++++Q AD++LYDRLV   ++DL   DA L+YVGK    H+  Q+ I+ELL+++A+ G  V RLKGGDP +FGRGGEE+D
Subjt:  GNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMD

Query:  FLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGT
         +  +GI  ++VPGITAASG ++  GIPLTHR  A SVRF+TGH + G  D     ++      T+V YMGL  L  +  +L+ HG   DTP A V RGT
Subjt:  FLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGT

Query:  TPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSP
        T  Q  +   L  L D+I+  E+ +PTLII+G VVSL P
Subjt:  TPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSLSP

Q42606 S-adenosyl-L-methionine-dependent uroporphyrinogen III methyltransferase, chloroplastic7.3e-15080.4Show/hide
Query:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV
        N  PIC  H  +++SSSSSPFTEKHSV+RYQRD WLYK         PS S   D   +R+NDIA QLPELKKLL VL+EKRV  GC  G CGPG+V+LV
Subjt:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV

Query:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
        GTGPGDPELLTLKAV+VIQSADLLLYDRLVSNDVL+LV PDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
Subjt:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG

Query:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT
        I+V+++PGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPD+TLVVYMGL TLPSLA KLM HGLP DTPA AVERGTTP QRT
Subjt:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT

Query:  VFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVE
        VFAELK+ A EI++A LVSPTLIIIGKVV LSP W   +KE+S LVE
Subjt:  VFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVE

Q6F8G6 Siroheme synthase5.7e-6253.59Show/hide
Query:  GNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMD
        G V+LVG GPGDPELLTLKA++++Q AD+++YDRLVS  +L+L   DA  +YVGK    HS  QE I+ LL+ +A+AG  V RLKGGDP +FGRGGEE+ 
Subjt:  GNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMD

Query:  FLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGT
         L   GI  ++VPGITAASG SA  GIPLTHR  A SVRFLTGH ++G   P          + TLV+YMGL  L  +  +L+ HG  PD P A V +GT
Subjt:  FLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGT

Query:  TPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL
        TP+Q+ V   L N+A +I   ++ +PTL IIG+VVSL
Subjt:  TPQQRTVFAELKNLADEIKAAELVSPTLIIIGKVVSL

Arabidopsis top hitse value%identityAlignment
AT1G45110.1 Tetrapyrrole (Corrin/Porphyrin) Methylases7.8e-0629.49Show/hide
Query:  DDGKCGP--GNVFLVGTGPGDPELLTLKAVKVIQSADLLL-YDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK-GGDP
        DD K GP    ++LVGT  G+ E +TL+A++V++SAD++L  D   S  +L      A+LL       YH   + +  + +L   + G  V  +   G P
Subjt:  DDGKCGP--GNVFLVGTGPGDPELLTLKAVKVIQSADLLL-YDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLK-GGDP

Query:  LVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHS
         +   G +      ++ I V  +PG   A  + A L          T V FL  HS
Subjt:  LVFGRGGEEMDFLQQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHS

AT5G40850.1 urophorphyrin methylase 15.2e-15180.4Show/hide
Query:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV
        N  PIC  H  +++SSSSSPFTEKHSV+RYQRD WLYK         PS S   D   +R+NDIA QLPELKKLL VL+EKRV  GC  G CGPG+V+LV
Subjt:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV

Query:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
        GTGPGDPELLTLKAV+VIQSADLLLYDRLVSNDVL+LV PDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
Subjt:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG

Query:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT
        I+V+++PGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPD+TLVVYMGL TLPSLA KLM HGLP DTPA AVERGTTP QRT
Subjt:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT

Query:  VFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVE
        VFAELK+ A EI++A LVSPTLIIIGKVV LSP W   +KE+S LVE
Subjt:  VFAELKNLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVE

AT5G40850.2 urophorphyrin methylase 13.1e-13578.86Show/hide
Query:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV
        N  PIC  H  +++SSSSSPFTEKHSV+RYQRD WLYK         PS S   D   +R+NDIA QLPELKKLL VL+EKRV  GC  G CGPG+V+LV
Subjt:  NFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSGCDDGKCGPGNVFLV

Query:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
        GTGPGDPELLTLKAV+VIQSADLLLYDRLVSNDVL+LV PDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG
Subjt:  GTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFLQQQG

Query:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT
        I+V+++PGITAASGI+AELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPD+TLVVYMGL TLPSLA KLM HGLP DTPA AVERGTTP QRT
Subjt:  IQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRT

Query:  VFAELKNLADEIKAAEL
         F  L      +K  +L
Subjt:  VFAELKNLADEIKAAEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGTGTTTACGAGCTTCAATCGCTTTCATCTCCATTTTTGTCTCGCCCAACAATACCCAGGTCCCCAAATTTCAAACCCATTTGCTCGTTTCACTGCAGCTCTTC
TTCCTCTTCTTCTTCTTCCCCATTTACAGAGAAACACTCCGTCAAGAGATACCAAAGAGACGATTGGCTGTACAAGAACCAAGCGGACCAAAGTTCAATTGCTCCATCGT
GTTCTATTCCCTGTGATTCTGAGTCCATACGGAAGAATGACATTGCCTTGCAGCTGCCGGAGCTGAAGAAATTGCTGCAGGTGCTAAGGGAAAAGAGGGTCAATAGTGGA
TGCGATGATGGAAAATGTGGGCCTGGGAATGTATTTCTGGTGGGGACTGGCCCTGGAGATCCAGAGCTTTTGACATTGAAGGCGGTGAAAGTTATTCAGAGTGCTGATTT
GCTTTTGTATGATCGATTGGTCTCTAATGATGTGTTGGATTTGGTGGGTCCTGATGCTAGGCTTCTCTATGTGGGCAAGACTGCAGGTTACCATAGCAGAACCCAGGAGG
AGATTCATGAGCTACTTCTGAACTTTGCTGAAGCTGGAGCTACAGTTGTGAGACTTAAAGGAGGAGACCCTCTTGTGTTTGGAAGGGGTGGGGAGGAGATGGATTTCTTG
CAACAACAAGGGATTCAAGTAAAAATTGTTCCTGGTATAACTGCTGCTTCAGGTATATCAGCTGAATTGGGGATTCCTTTAACACACAGGGGCGTTGCAACGAGTGTCAG
GTTCCTCACTGGTCACTCGAGAAAGGGTGGAACCGATCCTCTATTCGTAGCAGAAAATGCAGCTGATCCAGATTCAACTCTGGTGGTATATATGGGTCTGTCGACTCTTC
CATCTCTTGCCATTAAGTTGATGCAACACGGTCTGCCACCGGATACCCCAGCTGCTGCTGTAGAACGAGGGACAACACCTCAACAAAGAACTGTTTTTGCGGAACTGAAG
AACCTTGCAGATGAAATCAAAGCAGCAGAGTTGGTTTCACCCACTTTAATCATAATTGGAAAAGTGGTTTCTCTCTCACCACATTGGTCTCTTTCTTCCAAAGAAGCTTC
CAGTTTGGTGGAGGCTTAA
mRNA sequenceShow/hide mRNA sequence
TGAAACACCAAGTTGTCAAATTCATGGAAGGAAGCCAATGGCTAGACAGTCCAAAGGAATTTCGATTTGGTCTCTCGCACATCTCAAAAGCCAAAAGGCCACTCCTTTTG
CCTTCTCATCTTCTTCAAAGTCCATGAATTAGAGCGGCGAGCCGCCTCTTATTTATCCCTTATTTATTCCGACCAAATTCCCTTGGGGACATCGCATTTTTCCCCTTTCT
CCTTCGCCATGGCTCGTGTTTACGAGCTTCAATCGCTTTCATCTCCATTTTTGTCTCGCCCAACAATACCCAGGTCCCCAAATTTCAAACCCATTTGCTCGTTTCACTGC
AGCTCTTCTTCCTCTTCTTCTTCTTCCCCATTTACAGAGAAACACTCCGTCAAGAGATACCAAAGAGACGATTGGCTGTACAAGAACCAAGCGGACCAAAGTTCAATTGC
TCCATCGTGTTCTATTCCCTGTGATTCTGAGTCCATACGGAAGAATGACATTGCCTTGCAGCTGCCGGAGCTGAAGAAATTGCTGCAGGTGCTAAGGGAAAAGAGGGTCA
ATAGTGGATGCGATGATGGAAAATGTGGGCCTGGGAATGTATTTCTGGTGGGGACTGGCCCTGGAGATCCAGAGCTTTTGACATTGAAGGCGGTGAAAGTTATTCAGAGT
GCTGATTTGCTTTTGTATGATCGATTGGTCTCTAATGATGTGTTGGATTTGGTGGGTCCTGATGCTAGGCTTCTCTATGTGGGCAAGACTGCAGGTTACCATAGCAGAAC
CCAGGAGGAGATTCATGAGCTACTTCTGAACTTTGCTGAAGCTGGAGCTACAGTTGTGAGACTTAAAGGAGGAGACCCTCTTGTGTTTGGAAGGGGTGGGGAGGAGATGG
ATTTCTTGCAACAACAAGGGATTCAAGTAAAAATTGTTCCTGGTATAACTGCTGCTTCAGGTATATCAGCTGAATTGGGGATTCCTTTAACACACAGGGGCGTTGCAACG
AGTGTCAGGTTCCTCACTGGTCACTCGAGAAAGGGTGGAACCGATCCTCTATTCGTAGCAGAAAATGCAGCTGATCCAGATTCAACTCTGGTGGTATATATGGGTCTGTC
GACTCTTCCATCTCTTGCCATTAAGTTGATGCAACACGGTCTGCCACCGGATACCCCAGCTGCTGCTGTAGAACGAGGGACAACACCTCAACAAAGAACTGTTTTTGCGG
AACTGAAGAACCTTGCAGATGAAATCAAAGCAGCAGAGTTGGTTTCACCCACTTTAATCATAATTGGAAAAGTGGTTTCTCTCTCACCACATTGGTCTCTTTCTTCCAAA
GAAGCTTCCAGTTTGGTGGAGGCTTAATAGAATTAAAGAGCCTAGAAAAGGTTATGCAAACCAGGAATAGTTCTTCATTCTGTCCTTTACAAATTACTTCGACGAGCAAC
TGAGATAAGAAGCTCGAGATGTCGAGTTTTACCACAAGATTTTTGTCGATTTTGGGGAGAAAACATTGAAACCATGACAGATGGAAAAAGAAGAATTAGGGACTCTCGTG
TCCCTTCGCATTTGATTCGTCTGCTCATTGCATTCGAATGACTAGACTCGGAATTTTTCTGAAGGATTGGATTGATGCAGCAATCTTCTTTCTCGGATTCTCGGAGCACG
GTGAGCAGCATATTCTCATACCATTTCTTGCATTGATTCAAATAAAAGTTCCAAACTTCAATTTTTTCTTAGTTTATGAGGGATTCTAATGTGGAGAGATTACTTAATCA
GCAATCAATTTTTTTGAAAAAGTATAGTTGATAAAGTGTAGGTGAAAATCTTGAAGAAATTTCCTTGGAATATCTGATGTATTGATTAGACTCCCCTTTTGTTTTGTTTT
GTTTGTTTTTTTTTTCTTGGGTTACTTCCAGTTCCAGAATTATCTTATCTTGAGTAGAATTTTTAGAAATAACAACATATGCAATAGCTTAGAAATTCACCTCAAACATG
ACTCTATAGTTCCTGCAGACTGGGATTCATCTTTCTGAGATATAATGATATAAATTTGTAGTGTTTGGGGAATGTTAATTATTGTATTTCTTTGACCCAGATTGGCTTAA
ATTTGGATTAAAAGGTTGGTTTGATTAAGAGCTTTTTGGAAGTGTCTGGTTTTTAGGCATTATTATGATCTCGTTTGGAAGTTAAGTTGTTGAGTTCAACAGGTAAAAGT
GGGGAGATTTAAACCTTTGACCTTTTGGTTGAGAACTTAAAAGTATATGCCTCGACCAGTTGTGCTATGTTCAACTTGGCAATTAAAAGCTACAATGGAAACTTTTTTTT
TTCTTGGTAATTTGGTTAAATTCAATAAGCTAGGTGTCTTAGTTTGATTGGAAGTGAACCGTCAATATTTTTGTAATTATTTAAGCGGAATCAACTAAAAAC
Protein sequenceShow/hide protein sequence
MARVYELQSLSSPFLSRPTIPRSPNFKPICSFHCSSSSSSSSSPFTEKHSVKRYQRDDWLYKNQADQSSIAPSCSIPCDSESIRKNDIALQLPELKKLLQVLREKRVNSG
CDDGKCGPGNVFLVGTGPGDPELLTLKAVKVIQSADLLLYDRLVSNDVLDLVGPDARLLYVGKTAGYHSRTQEEIHELLLNFAEAGATVVRLKGGDPLVFGRGGEEMDFL
QQQGIQVKIVPGITAASGISAELGIPLTHRGVATSVRFLTGHSRKGGTDPLFVAENAADPDSTLVVYMGLSTLPSLAIKLMQHGLPPDTPAAAVERGTTPQQRTVFAELK
NLADEIKAAELVSPTLIIIGKVVSLSPHWSLSSKEASSLVEA