; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034271 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034271
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationchr3:5867340..5870775
RNA-Seq ExpressionLag0034271
SyntenyLag0034271
Gene Ontology termsGO:0016491 - oxidoreductase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587867.1 Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia]2.6e-16280.56Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M ++SHP   QTLQQQLL+NGGDTPESYIYKGGY  GDS +N+PLP+AEIPVVDLSQLS S  GEA LEE RLALSSWGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ
        E+FFALP+EEKNRY RE DG EG G+DLILSE QILDWT RLYL+VNPED+R L+FWP+NP SF ED+ EFT+KVKEI+E VL+AMAASL +E KSFS+Q
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ

Query:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS
        +G RPVLVTRFNFYPPC TPHLVLGLKEHSDGSA TIVLLDK+VEGL+++KD+  YR+PVPAIADSLLIN+GEQ +IMSNGIFKSP+HRAVTNSERQRIS
Subjt:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYV-ETYFQYYQKGQRPVDGLKI
        VACFCCPEKDKEIKPIEGLIDE RPRLF +VKNYV ETYFQYYQKGQR VD LKI
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYV-ETYFQYYQKGQRPVDGLKI

TYK10347.1 protein SRG1-like [Cucumis melo var. makuwa]1.9e-15274.65Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S  P+ QTLQQQLL+NGG TPESYIYKGGY  GDS++N PLPLA+IPV+DLSQLS ++ GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE
        E+FF+LPIEEK RY RE DG+EG GNDLI S QQILDW+ RLYL+ NP+D+R L+FWP NP SF  ED+ E+TVK+ EI+ETVL+AMA SLN+E  SF++
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE

Query:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI
        Q+G RP L  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDKQVEGL+LRKDD  YRVPVPA+ADSLLI +GEQ +IMSNGIFKS IHRAVTNSERQRI
Subjt:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI

Query:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        SV  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_008465534.1 PREDICTED: protein SRG1-like [Cucumis melo]5.0e-15374.29Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S  P+ QTLQQQLL+NGG TPESYIYKGGY  GDS++N PLPLA+IPV+DLSQLS ++ GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK RY RE DG+EG GNDLI S QQILDW+ RLYL+ NP+D+R L+FWP NP SF ED+ E+TVK+ +I++TVL+AMA SLN+E  SF++Q
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ

Query:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS
        +G RP L  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDKQVEGL+LRKDD  YRVPVPA+ADSLLI +GEQ +IMSNGIFKS IHRAVTNSERQRIS
Subjt:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        V  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_011655280.1 probable 2-oxoglutarate-dependent dioxygenase ANS isoform X2 [Cucumis sativus]1.4e-15274.29Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S   + QTLQQQLL+NGG TPESYIYKGGY  G S++N PLPLAEIPVVDLSQLS  +AGE PL +LRLALS+WGCFQA NH ISSSFL+K+R+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK RY RE DG+EG GNDL  S QQ LDW+ RLY + +PED+R L  WP NP SF ED+ E+TVK+ EI+ETVL+AMA SLN+E  SF++Q
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ

Query:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS
        +G RP L TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDKQVEGLQLRKDD  YRVPVPAIADSLL+ +GEQ ++MSNGIFKS +HRAVTNSERQRIS
Subjt:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        V CFCCPEKD EIKP+EGLIDE RPRLF SVKNY+ETYFQ YQ+GQR VDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

XP_011655698.2 probable 2-oxoglutarate-dependent dioxygenase ANS [Cucumis sativus]3.0e-15877.12Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S   + QTLQQQLL+NGG TPESYIYKGGY  G S++N PLPLAEIPVVDLSQLS  +AGE PL +LRLALS+WGCFQAINH ISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK RY RE DG+EG GNDLILSEQQILDW+ RLY + NPED+R L+ WP NP SF ED+DE+TVK+ EI+ETVL+AMA+SL++E  SF++Q
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ

Query:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS
        +G RP L+TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDKQVEGLQLRKDD  YRVPVPAIADSLLI +GEQ ++MSNGIFKS IHRAVTNSERQRIS
Subjt:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        + CFCCPEKD EIKPIEGLIDE RPRLF SVKNY+ETYFQ YQKG+RPVDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

TrEMBL top hitse value%identityAlignment
A0A0A0LW96 Uncharacterized protein1.2e-12874.41Show/hide
Query:  LSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFW
        L   +AGE PL +LRLALS+WGCFQA NH ISSSFL+K+R+ISE+FF+LPIEEK RY RE DG+EG GNDL  S QQ LDW+ RLY + +PED+R L  W
Subjt:  LSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFW

Query:  PQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYR
        P NP SF ED+ E+TVK+ EI+ETVL+AMA SLN+E  SF++Q+G RP L TRFNFYPPC TPHLVLGLKEHSDGSAITI+LLDKQVEGLQLRKDD  YR
Subjt:  PQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYR

Query:  VPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        VPVPAIADSLL+ +GEQ ++MSNGIFKS +HRAVTNSERQRISV CFCCPEKD EIKP+EGLIDE RPRLF SVKNY+ETYFQ YQ+GQR VDGL+I
Subjt:  VPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A1S3CP07 protein SRG1-like2.4e-15374.29Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S  P+ QTLQQQLL+NGG TPESYIYKGGY  GDS++N PLPLA+IPV+DLSQLS ++ GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ
        E+FF+LPIEEK RY RE DG+EG GNDLI S QQILDW+ RLYL+ NP+D+R L+FWP NP SF ED+ E+TVK+ +I++TVL+AMA SLN+E  SF++Q
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQ

Query:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS
        +G RP L  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDKQVEGL+LRKDD  YRVPVPA+ADSLLI +GEQ +IMSNGIFKS IHRAVTNSERQRIS
Subjt:  MGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRIS

Query:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        V  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  VACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A5A7U2F3 Protein SRG1-like2.7e-15274.65Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S  P+ QTLQQQLL+NGG TPESYIYKGGY  GDS++N PLPLA+IPV+DLSQLS ++ GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE
        E+FF+LPIEEK RY RE DG+EG GNDLI S QQILDW+ RLYL+ NPED R L+FWP NP SF  ED+ E+TVK+ EI+ETVL+AMA SLN+E  SF++
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE

Query:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI
        Q+G RP L  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDKQVEGL+LRKDD  YRVPVPA+ADSLLI +GEQ +IMSNGIFKS IHRAVTNSER+RI
Subjt:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI

Query:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        SV  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A5D3CEG2 Protein SRG1-like9.1e-15374.65Show/hide
Query:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS
        M + S  P+ QTLQQQLL+NGG TPESYIYKGGY  GDS++N PLPLA+IPV+DLSQLS ++ GEAPL  LRLALS+WGCFQA NHGISSSFL+KMR+IS
Subjt:  MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRIS

Query:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE
        E+FF+LPIEEK RY RE DG+EG GNDLI S QQILDW+ RLYL+ NP+D+R L+FWP NP SF  ED+ E+TVK+ EI+ETVL+AMA SLN+E  SF++
Subjt:  EEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSF-WEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSE

Query:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI
        Q+G RP L  RFNFYPPC TPHLVLGLKEHSDG+AIT++LLDKQVEGL+LRKDD  YRVPVPA+ADSLLI +GEQ +IMSNGIFKS IHRAVTNSERQRI
Subjt:  QMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRI

Query:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        SV  FCCPEKD EIKPIEGLIDE RPRLF S KNY++TYFQ YQKG+R VDGL+I
Subjt:  SVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

A0A6J1BYU1 uncharacterized protein LOC1110069925.0e-15175.29Show/hide
Query:  QTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEE
        +T QQ+LL+NGGDTPESYIYK GYG GDS +NNPLPLAEIPVVDL+QLS S    A LE+LRLAL+SWGCFQAINH ISSSFL+K+ +IS +FF+LP+EE
Subjt:  QTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEE

Query:  KNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTR
        KN+ CRE  G+EG G D++ SEQQILDWT RLYL VNPED+R L++WPQNPQSF ED+ EFT+K+K+I+ETVLMAMA S+N+EA SF+EQ+G RP L TR
Subjt:  KNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTR

Query:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKD
        FNFYPPC  P LVLGLKEHSDGSAITIVLLD++VEGLQ RKDD  +RVPVPA+ADSLLIN+GEQ +IMSNG+FKS +HRAVTNSE+QRISVACFCCPEKD
Subjt:  FNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKD

Query:  KEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        +EI+PIEGLIDE RPRL+ +VKNYV +YFQ YQKGQRPVD LKI
Subjt:  KEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

SwissProt top hitse value%identityAlignment
A2A1A0 S-norcoclaurine synthase 13.4e-4835.4Show/hide
Query:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP
        EIPV+DLS+L         L +   A   WG FQ INHG+    ++KM+  +E+FF LP +EKN Y +  +G+EG G   + SE+Q LDW    +LI  P
Subjt:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP

Query:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQ
          +R+++FWP +P SF E +++++++++++   +   MA +L +E++  ++ +  R V        P   +    LGL  HSD + +T+++   +V GL 
Subjt:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQ

Query:  LRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDE
        ++KD+    VP+  I  + ++N+G+  +IMSNGI+KS  HRAV N++++R+S+A F  PE   +I P+  L+ E
Subjt:  LRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDE

D4N502 Codeine O-demethylase6.9e-4936.3Show/hide
Query:  IPVVDLSQ-LSPS-AAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVN
        +PV+DL   LSP    G+  L++L  A   WG FQ +NHG+ +  +D ++   + FF LP+ EK +Y ++    EG G   I SE Q LDWT    ++  
Subjt:  IPVVDLSQ-LSPS-AAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVN

Query:  PEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGL
        P   R    +P+ P  F E ++ +  K+K++   V   +  SL +        +    +   R N+YPPCP P LVLGL  HSD S +TI+L   +VEGL
Subjt:  PEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGL

Query:  QLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF
        Q+RK++    + +  + D+ ++N+G+  +IM+NGI++S  HRAV NS ++R+S+A F   + + EI PI  L+    P LF
Subjt:  QLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF

O80449 Jasmonate-induced oxygenase 41.9e-5135.06Show/hide
Query:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP
        EIPV+D++ +     G   L  +R A   WG FQ +NHG++ S ++++R    EFF LP+EEK +Y    D  EG G+ L + +   LDW+   +L   P
Subjt:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP

Query:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQM--GNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEG
           R+   WP  P    E ++++  +V+++ E +   ++ SL ++     + +  G++     R NFYP CP P L LGL  HSD   ITI+L D++V G
Subjt:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQM--GNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEG

Query:  LQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVK--NYVETYFQYYQK
        LQ+R+ D    V + ++ ++L++N+G+Q QI+SNGI+KS  H+ + NS  +R+S+A F  P  D  + PIE L+   RP L+  ++   Y     Q    
Subjt:  LQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTSVK--NYVETYFQYYQK

Query:  GQRPVDGL
        G+  VD L
Subjt:  GQRPVDGL

Q39224 Protein SRG11.2e-5333.98Show/hide
Query:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP
        EIP++D+ +L  S   ++ +E+L  A   WG FQ +NHGI SSFLDK++   ++FF LP+EEK ++ +  D +EG G   ++SE Q LDW    +  V P
Subjt:  EIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNP

Query:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGN-RPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGL
         + R    +P+ P  F + ++ ++ +V+ + + ++  MA +L I+ +   +   +   V   R N+YPPCP P  V+GL  HSD   +T+++    VEGL
Subjt:  EDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGN-RPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGL

Query:  QLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF--TSVKNYVETYFQYYQKG
        Q++KD     VPV  + ++ ++N+G+  +I++NG ++S  HR V NSE++R+S+A F      KE+ P + L++  +   F   ++K Y +  F     G
Subjt:  QLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLF--TSVKNYVETYFQYYQKG

Query:  QRPVDGLKI
        +  +D L+I
Subjt:  QRPVDGLKI

Q94LP4 2-oxoglutarate-dependent dioxygenase 112.4e-4932.33Show/hide
Query:  PESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGC
        PE YI      S +   NN      IP++DL +L    + E    +LR A   WG F  INHG+    +  ++R   +FF+ P++ K  Y +  + +EG 
Subjt:  PESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGC

Query:  GNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVL
        G   + SE Q LDW   LYL V+P D R L+FWP +P SF + +D ++ + K +   +   MA ++  + +S  +    +P    R  +YPPC     V+
Subjt:  GNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVL

Query:  GLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGR
        GL  HSD   +T++L    V+GLQ++KD   + +  P    +L+ N+G+  +I+SNG F+S  HRAV N  ++RIS A F  P ++  I P+   + +G+
Subjt:  GLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGR

Query:  PRLFT-SVKNYVETYFQYYQKGQRPVDGLKI
         +  + S  ++++  F     G+  V+ LK+
Subjt:  PRLFT-SVKNYVETYFQYYQKGQRPVDGLKI

Arabidopsis top hitse value%identityAlignment
AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.0e-9648.54Show/hide
Query:  QQLLVNGGDTPESYIYKGGYGSGDSDS-NNPLPLAEIPVVDLSQL-SPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKN
        Q+++  G   PE Y++    G G+S   N  +P  +IP +DLS L S S  G+  +++L  ALS+WG  Q +NHGI+ +FLDK+ +++++FFALP EEK+
Subjt:  QQLLVNGGDTPESYIYKGGYGSGDSDS-NNPLPLAEIPVVDLSQL-SPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKN

Query:  RYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFN
        +  RE   ++G GND+ILS+ Q+LDW  RL+L   PED+R L+FWPQ P  F E +DE+T+K + ++E    AMA SL +E   F E  G   V+ +RFN
Subjt:  RYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFN

Query:  FYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKE
        F+PPCP P  V+G+K H+DGSAIT++L DK VEGLQ  KD   Y+ P+  + D++LI LG+Q +IMSNGI+KSP+HR VTN E++RISVA FC P  DKE
Subjt:  FYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKE

Query:  IKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        I P +GL+ E RPRL+ +V  YV+ +++YYQ+G+R ++   I
Subjt:  IKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

AT3G21420.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.2e-5738.14Show/hide
Query:  EIPVVDLSQLSPSAAGEAPLEELRL--ALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIV
        +IPV+DLS+LS     +   E L+L  A   WG FQ INHGI    ++ +  ++ EFF +P+EEK +Y  E   V+G G   I SE Q LDW +   L V
Subjt:  EIPVVDLSQLSPSAAGEAPLEELRL--ALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIV

Query:  NPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDK-QVE
        +P   R+ + WP  P  F E ++ ++ +++E+ + +L  +A SL ++ + F E  G   V   R N+YPPC +P LVLGL  HSDGSA+T++   K    
Subjt:  NPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDK-QVE

Query:  GLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLI-DEGRPRLFTSVK--NYVETYFQYY
        GLQ+ KD+    VPV  + ++L+IN+G+  +++SNG +KS  HRAVTN E++R+++  F  P  + EI+P+  L+ DE  P  + S    +Y   Y    
Subjt:  GLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLI-DEGRPRLFTSVK--NYVETYFQYY

Query:  QKGQRPVDGLKI
         +G++ +D  KI
Subjt:  QKGQRPVDGLKI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.9e-9849.27Show/hide
Query:  QQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLS-QLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNR
        Q+++  G   PE Y++           N  +P  +IP +DL+  LS S AG+  L +L  ALS+WG  Q +NHGI+ +FLDK+ ++++EFFALP EEK +
Subjt:  QQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLS-QLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNR

Query:  YCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNF
          RE D ++G GND+IL + Q+LDW  RLY+   PEDQR L FWP+ P  F E + E+T+K + ++E    AMA SL +E  SF +  G    L TRFN 
Subjt:  YCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNF

Query:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEI
        YPPCP+P  V+G+K H+DGSAIT++L DK V GLQ +KD   Y+ P+  + D++LIN+G+Q +IMSNGI+KSP+HR VTN E++RISVA FC P  DKEI
Subjt:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEI

Query:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
        +P+  L+ E RPRL+ +VK YVE YF+YYQ+G+RP++   I
Subjt:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.3e-9448.81Show/hide
Query:  QQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLS-QLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNR
        Q+++  G   PE Y+            N  +P+ +IP +DLS  LSPS  G   L +L  ALS+WG  Q INHGI+ + LDK+ ++++EF ALP EEK +
Subjt:  QQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLS-QLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEEKNR

Query:  YCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNF
        Y RE   ++G GND+IL + Q+LDW  RLY+   PEDQR L+FWP  P  F E + E+T+K   +   V  AMA SL +E   F +  G    + TRFN 
Subjt:  YCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNF

Query:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEI
        YPPCP P  V+G++ H+D SA T++L DK VEGLQ  KD   Y+ PV A +D++LIN+G+Q +IMSNGI+KSP+HR VTN+E++RISVA FC P  DKEI
Subjt:  YPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEI

Query:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPV
        +P++GL+ E RPRL+  VKNYV+   +YY +GQRP+
Subjt:  KPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPV

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.5e-9447.28Show/hide
Query:  PRPQTLQQQLLVNGGDTPESYIY-KGGYGSGDSDSNNPLPLAEIPVVDLSQL-SPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFA
        P+ +T+ Q+++  G   PE Y+Y   G G GD   N  LP  +I ++DL+ L S S  G   L +L  A+S+WG  Q +NHGIS + LDK+  ++++FF 
Subjt:  PRPQTLQQQLLVNGGDTPESYIY-KGGYGSGDSDSNNPLPLAEIPVVDLSQL-SPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFA

Query:  LPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRP
        LP +EK +Y RE    +G GND+ILS+ Q+LDW  RLYLI  PEDQR L+FWP+NP  F E + E+T+K + ++E    A+A SL +E   F E  G   
Subjt:  LPIEEKNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRP

Query:  VLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFC
         L TRFN YPPCP P  VLGLK HSDGSA T++L DK VEGLQ  KD   Y+  +  +  ++LIN+G+  ++MSNGI+KSP+HR V N +++RI VA FC
Subjt:  VLVTRFNFYPPCPTPHLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFC

Query:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI
          ++DKEI+P+ GL+ E RPRL+ +VK   + +F YYQ+G+RP++   I
Subjt:  CPEKDKEIKPIEGLIDEGRPRLFTSVKNYVETYFQYYQKGQRPVDGLKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACAATCTCACCCCCCACGCCCACAAACACTCCAACAACAACTTCTCGTCAACGGCGGCGACACGCCGGAAAGCTATATTTACAAAGGCGGCTACGGCAGCGG
AGATTCCGACAGTAATAATCCACTTCCACTGGCAGAGATTCCAGTCGTTGACCTCTCCCAACTGTCTCCGTCGGCGGCCGGCGAGGCTCCGTTAGAGGAGCTCCGGCTGG
CTCTGAGTTCATGGGGATGTTTTCAGGCGATTAATCACGGCATTTCAAGTTCGTTTCTGGACAAGATGCGTCGAATAAGTGAGGAATTTTTTGCACTGCCGATTGAAGAG
AAGAACAGATATTGCAGAGAAGGTGATGGGGTTGAAGGATGTGGGAATGATTTGATTTTGTCAGAGCAACAAATTCTTGATTGGACCCATCGCTTGTATCTTATTGTGAA
TCCAGAGGATCAGAGACACCTCCAGTTTTGGCCTCAAAATCCTCAATCTTTCTGGGAAGATGTAGACGAGTTTACAGTAAAAGTAAAGGAGATAATGGAAACTGTGCTGA
TGGCCATGGCAGCATCCTTAAACATAGAGGCGAAGAGCTTTTCAGAGCAGATGGGAAACCGCCCTGTTTTAGTGACAAGGTTCAACTTCTATCCGCCATGTCCGACGCCT
CACCTTGTTCTTGGTCTCAAAGAACACTCCGATGGCTCAGCCATCACCATTGTTTTACTCGACAAACAAGTTGAAGGCCTCCAATTGCGCAAGGATGACCACTGCTACAG
AGTCCCTGTTCCTGCCATTGCCGATTCTCTTCTCATCAACCTTGGGGAACAACCTCAGATTATGAGCAATGGGATCTTCAAAAGCCCTATTCATCGGGCAGTGACGAATT
CAGAGAGGCAGAGGATTTCAGTGGCATGTTTCTGTTGCCCAGAAAAAGATAAAGAGATCAAGCCCATCGAGGGATTGATCGACGAGGGGAGGCCGAGATTGTTCACAAGT
GTGAAGAACTATGTTGAAACCTATTTTCAGTACTACCAGAAGGGTCAGAGACCAGTTGATGGATTGAAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAACAATCTCACCCCCCACGCCCACAAACACTCCAACAACAACTTCTCGTCAACGGCGGCGACACGCCGGAAAGCTATATTTACAAAGGCGGCTACGGCAGCGG
AGATTCCGACAGTAATAATCCACTTCCACTGGCAGAGATTCCAGTCGTTGACCTCTCCCAACTGTCTCCGTCGGCGGCCGGCGAGGCTCCGTTAGAGGAGCTCCGGCTGG
CTCTGAGTTCATGGGGATGTTTTCAGGCGATTAATCACGGCATTTCAAGTTCGTTTCTGGACAAGATGCGTCGAATAAGTGAGGAATTTTTTGCACTGCCGATTGAAGAG
AAGAACAGATATTGCAGAGAAGGTGATGGGGTTGAAGGATGTGGGAATGATTTGATTTTGTCAGAGCAACAAATTCTTGATTGGACCCATCGCTTGTATCTTATTGTGAA
TCCAGAGGATCAGAGACACCTCCAGTTTTGGCCTCAAAATCCTCAATCTTTCTGGGAAGATGTAGACGAGTTTACAGTAAAAGTAAAGGAGATAATGGAAACTGTGCTGA
TGGCCATGGCAGCATCCTTAAACATAGAGGCGAAGAGCTTTTCAGAGCAGATGGGAAACCGCCCTGTTTTAGTGACAAGGTTCAACTTCTATCCGCCATGTCCGACGCCT
CACCTTGTTCTTGGTCTCAAAGAACACTCCGATGGCTCAGCCATCACCATTGTTTTACTCGACAAACAAGTTGAAGGCCTCCAATTGCGCAAGGATGACCACTGCTACAG
AGTCCCTGTTCCTGCCATTGCCGATTCTCTTCTCATCAACCTTGGGGAACAACCTCAGATTATGAGCAATGGGATCTTCAAAAGCCCTATTCATCGGGCAGTGACGAATT
CAGAGAGGCAGAGGATTTCAGTGGCATGTTTCTGTTGCCCAGAAAAAGATAAAGAGATCAAGCCCATCGAGGGATTGATCGACGAGGGGAGGCCGAGATTGTTCACAAGT
GTGAAGAACTATGTTGAAACCTATTTTCAGTACTACCAGAAGGGTCAGAGACCAGTTGATGGATTGAAGATTTAG
Protein sequenceShow/hide protein sequence
MEEQSHPPRPQTLQQQLLVNGGDTPESYIYKGGYGSGDSDSNNPLPLAEIPVVDLSQLSPSAAGEAPLEELRLALSSWGCFQAINHGISSSFLDKMRRISEEFFALPIEE
KNRYCREGDGVEGCGNDLILSEQQILDWTHRLYLIVNPEDQRHLQFWPQNPQSFWEDVDEFTVKVKEIMETVLMAMAASLNIEAKSFSEQMGNRPVLVTRFNFYPPCPTP
HLVLGLKEHSDGSAITIVLLDKQVEGLQLRKDDHCYRVPVPAIADSLLINLGEQPQIMSNGIFKSPIHRAVTNSERQRISVACFCCPEKDKEIKPIEGLIDEGRPRLFTS
VKNYVETYFQYYQKGQRPVDGLKI