; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012334 (gene) of Snake gourd v1 genome

Gene IDTan0012334
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSarcosine oxidase
Genome locationLG01:102209939..102211573
RNA-Seq ExpressionTan0012334
SyntenyTan0012334
Gene Ontology termsGO:0046653 - tetrahydrofolate metabolic process (biological process)
GO:0008115 - sarcosine oxidase activity (molecular function)
InterPro domainsIPR006076 - FAD dependent oxidoreductase
IPR006281 - Sarcosine oxidase, monomeric
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024517.1 putative sarcosine oxidase, partial [Cucurbita argyrosperma subsp. argyrosperma]8.9e-21085.96Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELW+ AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKYSGRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA++NGA LKDNAEVV+IK+D SSGG+VVSIANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGGDFPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDP+KR WG  G+    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALKEWIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK+FADQV+LH
Subjt:  GNVKNFADQVKLH

XP_022936028.1 probable sarcosine oxidase [Cucurbita moschata]1.5e-20985.96Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VL LEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWR AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKYSGRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVV+IK++ SSGG+VVS ANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGGDFPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDPDKR WG  G+    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALKEWIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK+FADQV+LH
Subjt:  GNVKNFADQVKLH

XP_022975268.1 probable sarcosine oxidase [Cucurbita maxima]3.1e-21086.2Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWR AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKYSGRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVV+IK+D SSGG+VVS+ANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGG+FPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDPDKR WG  GQ    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALK+WIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK FADQV+LH
Subjt:  GNVKNFADQVKLH

XP_023534889.1 probable sarcosine oxidase [Cucurbita pepo subsp. pepo]1.2e-20986.2Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWR AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKY GRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVV+IK+D SSGG+VVSIANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGGDFPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDPDKR WG  G+    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALKEWIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFE N K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK+FADQV+LH
Subjt:  GNVKNFADQVKLH

XP_038896765.1 probable sarcosine oxidase [Benincasa hispida]1.9e-20785.68Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T FDVIV+G GVMGSST YHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYH LVMESYELWR AEAEIGYRVYFPAEQLDIGPSDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAAVV+TCRKHSIPH+VLD  QLAEKYSGRVEIP+DWV VWSKYGGVIKPTKAVSMFQ LAYKNGAVLKDNAEVV+IK+DES+G IVVSIANGE F G
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSV  I+LPIQPLE T+ YWRIK+GAEAEYAIGGDFPTFASYG+PY+YGTPSLEFPGLIK+A+HGGHQC+PDKR WG  G++ +S
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
            ALKEWIE RFGGRVDSS+P ATQLCMYSMTPDEDFVIDFLGG FEKDVVIG GFSGHGFKMSP +GRILA+LALKG AEGVELKYF+I RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKL
        GN+K+FADQVKL
Subjt:  GNVKNFADQVKL

TrEMBL top hitse value%identityAlignment
A0A0A0LJG6 Sarcosine oxidase4.3e-20282.08Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA SDTLFDVIV+G GVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYY+ LVMESYELWR AE EIGY+VYFP EQLDIG  DD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSL AVV+TCRKHSIPHLVLD  +L EKYSGRVEIPADWVGVWSKYGGVIKPTKAVSM+Q+LAYKNGAV+KDNAEVV+IK+DES+G IVVSIANGE FRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAW+KKLVKSVGGI+LPI+PLE ++ YWRIK+G EAEYAIGG FPT ASYG+PY+YGTPSLEFPGLIK+A+HGGH+C+PDKR WG  G++ ++
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
            ALKEWI+ +FGGRVDSS P +TQ CMYSMTPD DFVIDFLGG FEKDVVIG GFSGHGFKMSP +GRILA+LAL G AEGVELKYF++ RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK+FADQVKLH
Subjt:  GNVKNFADQVKLH

A0A1S3CR44 Sarcosine oxidase5.5e-19780Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA SDT FDVIV+G GVMGSSTAYHLAKTGNRVL+LEQFDFLHHRGSSHGESRTIRATYPEDYYH LVMESYELWR AEAEIG++VY+PAEQLDIGPS+ 
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        +SL AVV+TC KHSIPHLVLD  +L EKYSGRVEIPA+WV V SKYGGVIKPTKAVSMFQ+LAYKNG VLKDNAEVV+IK+DES+G IVVS ANGE F G
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAW+KKLVKSVGGI+LPI PLE ++ YWRIK+G EAEYAI G FPT ASYG+PY+YGTPSLEFPGLIK+A+H G+ C+PDKR WG +G++ ++
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
            ALKEWI+ +FGGRVDSS P ++QLCMYSMTPDEDFVIDFLGG FEKDVVIG GFSGHGFKMSP +GRILA+LALKG AEGVELKYF++ RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLHQH
        GNVK+FADQVKLHQ+
Subjt:  GNVKNFADQVKLHQH

A0A5D3E5F8 Sarcosine oxidase9.3e-19780Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA SDT FDVIV+G GVMGSSTAYHLAKTGNRVL+LEQFDFLHHRGSSHGESRTIRATYPEDYYH LVMESYELWR AEAEIG++VY+PAEQLDIGPS+ 
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        +SL AVV+TC KHSIPHLVLD  +L EKYSGRVEIPA+WV V SKYGGVIKPTKAVSMFQ+LAYKNG VLKDNAEVV+IK+DES+G IVVS ANGE F G
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAW+KKLVKSVGGI+LPI PLE ++ YWRIK+G EAEYAI G FPT ASYG+PY+YGTPSLEFPGLIK+A+H G+ C+PDKR WG  G++ ++
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
            ALKEWI+ +FGGRVDSS P ++QLCMYSMTPDEDFVIDFLGG FEKDVVIG GFSGHGFKMSP +GRILA+LALKG AEGVELKYF++ RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLHQH
        GNVK+FADQVKLHQ+
Subjt:  GNVKNFADQVKLHQH

A0A6J1FC48 Sarcosine oxidase7.4e-21085.96Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VL LEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWR AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKYSGRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVV+IK++ SSGG+VVS ANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGGDFPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDPDKR WG  G+    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALKEWIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK+FADQV+LH
Subjt:  GNVKNFADQVKLH

A0A6J1IIQ6 Sarcosine oxidase1.5e-21086.2Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA S T +DVIV+G GVMGSSTAYHLAKTGN+VLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWR AEAEIGYRVYFPAEQLDIG SDD
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
        KSLAA V+TC+KHSIPHLVLD  QLAEKYSGRVEIPADWV VWSKYGGVIKPTKAVSMFQ+LA+KNGA LKDNAEVV+IK+D SSGG+VVS+ANGERFRG
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS
        KKCVVTVGAWA+KLVKSVGGI+LPIQPLE T+ YWRIK+GAE EYAIGG+FPTFASYGD YIYGTPSLEFPGLIK+AVHGGH+CDPDKR WG  GQ    
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMS

Query:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK
        + +TALK+WIEGRFGGRVDSSEP +TQLCMYSMTPDEDFVIDFLGG F KDVVIG GFSGHGFKMSP +GRILADLALKGVAEG+ELKYFRI RFEEN K
Subjt:  MSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQK

Query:  GNVKNFADQVKLH
        GNVK FADQV+LH
Subjt:  GNVKNFADQVKLH

SwissProt top hitse value%identityAlignment
P79371 Peroxisomal sarcosine oxidase3.4e-6334.07Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA    L+D IVIG G+ G  T YHL K   R+L+LEQF   H RGSSHG+SR IR  Y ED+Y  ++ E Y++W   E E G +++     L +G  ++
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGER-FR
        + L  +     +  + H  L   +L +++   + +P   VG+    GGVI   KA+   Q    + G +++D  +VV+I     + G++V++    R ++
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGER-FR

Query:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKR--PWGGKG
         K  V+T G W  +L++ + GI++P+Q L   + YWR  +     Y +   FP F   G    +IYG P+ E+PGL+K++ H G+  DP++R  P     
Subjt:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKR--PWGGKG

Query:  QMSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRF
           + +  + +++ +           EPA  + CMY+ TPDE F++D        ++VIGAGFSGHGFK++PVVG+IL +L++K +    +L  FRI RF
Subjt:  QMSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRF

Query:  EENQKGNV
            K ++
Subjt:  EENQKGNV

Q29RU9 Peroxisomal sarcosine oxidase1.6e-6535.87Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA    L+D IVIG G+ G  TAYHLAK   +VL+LEQF   H RGSSHG+SR IR  YPED+Y  ++ E Y LW   E E G ++Y     L +G  ++
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG
          L  +  T  +  + H  L   +L +++   + +    VG+    GGV+   KA+   Q    + G ++ D  +VV+IK    SG  V+       ++ 
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRG

Query:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG----DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQ
        K  ++T G W  +L++ +G  +LP+Q L   + YW+ K      Y++   FP F   G      +IYG PS E+PGL+K+  H G+  DP++R       
Subjt:  KKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG----DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQ

Query:  MSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFE
         S    +  L  ++           EPA  + CMY+ TPD  FV+D        ++VIGAGFSGHGFK+SPVVG+IL +L++K +    +L  FRI RF 
Subjt:  MSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFE

Query:  ENQKGNV
           K ++
Subjt:  ENQKGNV

Q9D826 Peroxisomal sarcosine oxidase1.1e-6134.98Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA     +D IVIG G+ G  TAYHLAK    VL+LEQF   H RGSSHG+SR IR  YPED+Y  ++ E Y+ W   E E G +++   E L +G  ++
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANG-ERFR
          L  +  T  +  I H  L    L +++   +      VG+  K GGV+   KA+   Q +  + G  + D  +VV+I+      G+ V++    + ++
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANG-ERFR

Query:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTF--ASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQM
            V+T G W  +L+  + GI+LP+Q L   + YWR K      Y +   FP          +IYG P+ E+PGL+KI  H G   DP++R        
Subjt:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTF--ASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQM

Query:  SMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEE
        S    +  L  ++     G    +EP   + CMY+ TPDE F++D        ++VIGAGFSGHGFK++PVVG+IL +L++K +    +L  FR+ RF  
Subjt:  SMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEE

Query:  NQKGNV
          K ++
Subjt:  NQKGNV

Q9P0Z9 Peroxisomal sarcosine oxidase4.1e-6434.56Show/hide
Query:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD
        MA    L+D IVIG G+ G  TAYHLAK   R+L+LEQF   H RGSSHG+SR IR  Y ED+Y  ++ E Y++W   E E G +++     L +G  ++
Subjt:  MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDD

Query:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGER-FR
        + L  +     +  + H  L   +L +++   + +P   VG+    GGVI   KA+   Q    + G +++D  +VV+I     + G++V++    R ++
Subjt:  KSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGER-FR

Query:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKR--PWGGKG
         K  V+T G W  +L++ + GI++P+Q L   + YWR  +     Y +   FP F   G    +IYG P+ E+PGL+K++ H G+  DP++R  P     
Subjt:  GKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYG--DPYIYGTPSLEFPGLIKIAVHGGHQCDPDKR--PWGGKG

Query:  QMSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRF
           + +  + +++ +           EPA  + CMY+ TPDE F++D        ++VIGAGFSGHGFK++PVVG+IL +L++K +    +L  FRI RF
Subjt:  QMSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRF

Query:  EENQKGNV
            K ++
Subjt:  EENQKGNV

Q9SJA7 Probable sarcosine oxidase4.7e-16163.94Show/hide
Query:  MAYSDT-LFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSD
        M YSD   FDVIV+G GVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDYY+S+V ES  LW AA++EIGY+V+FP +Q D+GP+D
Subjt:  MAYSDT-LFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSD

Query:  DKSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSG-GIVVSIANGERF
         +SL +VV TC+KH + H V+D   ++E +SGR+ IP +W+GV ++ GG+IKPTKAVSMFQ+LA  +GA+L+DN +V +IK+D  SG G++V    G++F
Subjt:  DKSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSG-GIVVSIANGERF

Query:  RGKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMS
         GKKC+VT GAW  KLVK+V GID P++PLETT+ YWRIK+G E ++ I G+FPTFASYG PY+YGTPSLE+PGLIK+AVHGG+ CDPDKRPWG      
Subjt:  RGKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMS

Query:  MSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVA--EGVELKYFRIGRFE
          + +  LKEWI+ RFGG VDS  P ATQLCMYSMTPDEDFVIDFLGG F +DVV+G GFSGHGFKM+P VGRILAD+A++  A   GVE+K F + RFE
Subjt:  MSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVA--EGVELKYFRIGRFE

Query:  ENQKGNVKNFADQVKL
        +N KGN K + DQV L
Subjt:  ENQKGNVKNFADQVKL

Arabidopsis top hitse value%identityAlignment
AT2G24580.1 FAD-dependent oxidoreductase family protein3.3e-16263.94Show/hide
Query:  MAYSDT-LFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSD
        M YSD   FDVIV+G GVMGSS AY LAK G + L+LEQFDFLHHRGSSHGESRTIRATYPEDYY+S+V ES  LW AA++EIGY+V+FP +Q D+GP+D
Subjt:  MAYSDT-LFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSD

Query:  DKSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSG-GIVVSIANGERF
         +SL +VV TC+KH + H V+D   ++E +SGR+ IP +W+GV ++ GG+IKPTKAVSMFQ+LA  +GA+L+DN +V +IK+D  SG G++V    G++F
Subjt:  DKSLAAVVETCRKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSG-GIVVSIANGERF

Query:  RGKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMS
         GKKC+VT GAW  KLVK+V GID P++PLETT+ YWRIK+G E ++ I G+FPTFASYG PY+YGTPSLE+PGLIK+AVHGG+ CDPDKRPWG      
Subjt:  RGKKCVVTVGAWAKKLVKSVGGIDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMS

Query:  MSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVA--EGVELKYFRIGRFE
          + +  LKEWI+ RFGG VDS  P ATQLCMYSMTPDEDFVIDFLGG F +DVV+G GFSGHGFKM+P VGRILAD+A++  A   GVE+K F + RFE
Subjt:  MSMSMTALKEWIEGRFGGRVDSSEPAATQLCMYSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVA--EGVELKYFRIGRFE

Query:  ENQKGNVKNFADQVKL
        +N KGN K + DQV L
Subjt:  ENQKGNVKNFADQVKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTATTCCGACACTCTATTCGACGTGATTGTCATCGGCGGCGGCGTAATGGGAAGTTCAACGGCATACCATCTCGCCAAAACAGGGAACAGAGTTCTGATTCTCGA
GCAATTCGATTTTCTTCACCACAGAGGCTCTTCTCACGGCGAATCTCGCACCATACGCGCCACCTATCCGGAGGATTACTATCACAGCCTCGTTATGGAATCTTACGAAT
TATGGCGGGCGGCGGAGGCGGAAATCGGCTACAGAGTCTACTTTCCGGCGGAACAGCTCGATATCGGACCTTCCGACGACAAAAGTCTGGCCGCCGTCGTGGAGACCTGC
CGGAAACATTCGATCCCTCATCTGGTACTCGATGGGCGGCAACTGGCGGAGAAGTACTCCGGGAGGGTGGAGATTCCGGCGGATTGGGTGGGGGTGTGGAGCAAGTACGG
CGGCGTAATAAAGCCGACGAAGGCGGTTTCGATGTTCCAAAGTTTGGCCTACAAAAACGGCGCCGTTTTGAAGGACAATGCGGAAGTAGTAGATATCAAGAAAGATGAGA
GTAGTGGAGGAATAGTTGTTTCAATAGCGAATGGGGAGAGATTTAGGGGAAAAAAATGTGTGGTGACAGTTGGAGCTTGGGCTAAAAAGTTAGTTAAATCAGTTGGTGGG
ATTGATTTGCCAATTCAGCCATTAGAGACTACAATTTTCTACTGGAGAATCAAGGACGGGGCCGAGGCCGAGTATGCGATCGGAGGGGACTTCCCGACCTTCGCTAGCTA
TGGCGACCCGTATATTTACGGGACGCCTTCGCTCGAGTTTCCAGGGTTGATTAAGATCGCCGTGCACGGCGGGCATCAATGTGATCCGGACAAGCGCCCGTGGGGGGGCA
AAGGGCAAATGTCAATGTCGATGTCGATGACAGCATTGAAGGAATGGATAGAGGGGAGGTTTGGGGGGAGAGTTGATTCAAGCGAACCGGCGGCGACGCAGTTGTGTATG
TACTCAATGACGCCGGATGAGGACTTTGTGATTGATTTTTTGGGAGGGGGATTTGAGAAGGATGTCGTGATCGGCGCCGGGTTTTCGGGACACGGGTTCAAAATGTCGCC
GGTGGTCGGGCGGATTCTGGCGGATCTAGCATTGAAGGGGGTGGCGGAGGGGGTGGAGCTGAAGTATTTTAGGATAGGAAGGTTTGAGGAGAATCAGAAAGGGAATGTCA
AGAACTTTGCGGATCAAGTAAAGCTTCACCAGCACCTTTCCTCTACTATTTGA
mRNA sequenceShow/hide mRNA sequence
GCATCTTCCAAGAAGCTTCTGTATTTCATCTTTCAACTCCACAACGCGCCTGTTCCATTGCACTCATCGGAGACTCCGCCGAAGAGAGACAGAGACAGAGAGAGAAAAAA
AACAAAAAAAATCCAAATCGGAGAGAGAAATGGCCTATTCCGACACTCTATTCGACGTGATTGTCATCGGCGGCGGCGTAATGGGAAGTTCAACGGCATACCATCTCGCC
AAAACAGGGAACAGAGTTCTGATTCTCGAGCAATTCGATTTTCTTCACCACAGAGGCTCTTCTCACGGCGAATCTCGCACCATACGCGCCACCTATCCGGAGGATTACTA
TCACAGCCTCGTTATGGAATCTTACGAATTATGGCGGGCGGCGGAGGCGGAAATCGGCTACAGAGTCTACTTTCCGGCGGAACAGCTCGATATCGGACCTTCCGACGACA
AAAGTCTGGCCGCCGTCGTGGAGACCTGCCGGAAACATTCGATCCCTCATCTGGTACTCGATGGGCGGCAACTGGCGGAGAAGTACTCCGGGAGGGTGGAGATTCCGGCG
GATTGGGTGGGGGTGTGGAGCAAGTACGGCGGCGTAATAAAGCCGACGAAGGCGGTTTCGATGTTCCAAAGTTTGGCCTACAAAAACGGCGCCGTTTTGAAGGACAATGC
GGAAGTAGTAGATATCAAGAAAGATGAGAGTAGTGGAGGAATAGTTGTTTCAATAGCGAATGGGGAGAGATTTAGGGGAAAAAAATGTGTGGTGACAGTTGGAGCTTGGG
CTAAAAAGTTAGTTAAATCAGTTGGTGGGATTGATTTGCCAATTCAGCCATTAGAGACTACAATTTTCTACTGGAGAATCAAGGACGGGGCCGAGGCCGAGTATGCGATC
GGAGGGGACTTCCCGACCTTCGCTAGCTATGGCGACCCGTATATTTACGGGACGCCTTCGCTCGAGTTTCCAGGGTTGATTAAGATCGCCGTGCACGGCGGGCATCAATG
TGATCCGGACAAGCGCCCGTGGGGGGGCAAAGGGCAAATGTCAATGTCGATGTCGATGACAGCATTGAAGGAATGGATAGAGGGGAGGTTTGGGGGGAGAGTTGATTCAA
GCGAACCGGCGGCGACGCAGTTGTGTATGTACTCAATGACGCCGGATGAGGACTTTGTGATTGATTTTTTGGGAGGGGGATTTGAGAAGGATGTCGTGATCGGCGCCGGG
TTTTCGGGACACGGGTTCAAAATGTCGCCGGTGGTCGGGCGGATTCTGGCGGATCTAGCATTGAAGGGGGTGGCGGAGGGGGTGGAGCTGAAGTATTTTAGGATAGGAAG
GTTTGAGGAGAATCAGAAAGGGAATGTCAAGAACTTTGCGGATCAAGTAAAGCTTCACCAGCACCTTTCCTCTACTATTTGATTAATCTATAGGCATGACATTCGTGGTT
TTCTCTTTCTTCTTCTTTCTCTTATGATTCGGATTATCTCTCTCTGTTTTGGACTTTTAATAATTTAAGTGTCCGTGTGATAGCTATTTAGACTCAATCGATAACAGTTT
CATTTTTTGTTTTTGAAAATTTAGGATAGATAAAACCATGGTAGAAAAATTGGGAGAAAACAAGCTTTATTTTCACATAGTTTATCAAATAGGGC
Protein sequenceShow/hide protein sequence
MAYSDTLFDVIVIGGGVMGSSTAYHLAKTGNRVLILEQFDFLHHRGSSHGESRTIRATYPEDYYHSLVMESYELWRAAEAEIGYRVYFPAEQLDIGPSDDKSLAAVVETC
RKHSIPHLVLDGRQLAEKYSGRVEIPADWVGVWSKYGGVIKPTKAVSMFQSLAYKNGAVLKDNAEVVDIKKDESSGGIVVSIANGERFRGKKCVVTVGAWAKKLVKSVGG
IDLPIQPLETTIFYWRIKDGAEAEYAIGGDFPTFASYGDPYIYGTPSLEFPGLIKIAVHGGHQCDPDKRPWGGKGQMSMSMSMTALKEWIEGRFGGRVDSSEPAATQLCM
YSMTPDEDFVIDFLGGGFEKDVVIGAGFSGHGFKMSPVVGRILADLALKGVAEGVELKYFRIGRFEENQKGNVKNFADQVKLHQHLSSTI