; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G06090 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G06090
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTOM1-like protein 4 isoform X1
Genome locationClcChr08:17505675..17513538
RNA-Seq ExpressionClc08G06090
SyntenyClc08G06090
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578929.1 TOM1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0077.26Show/hide
Query:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWGSNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG
        T+EDCIIFRGWDSAA  DDDSQSESGVCSPTLWGSNSRT+ QFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL+NS+R QDG
Subjt:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWGSNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG

Query:  DAASLSRDDSSSETSFRRDRSKNRSETRALVTRSKSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG
        D ASLSRDDS+SETSFRRD SK RSETRALVTRS+SVD+GGFYLKMF PLPFGQ+SAKKK NLRTDSGL+ SSRVSPKPPPVDR+WWRKRS      NEG
Subjt:  DAASLSRDDSSSETSFRRDRSKNRSETRALVTRSKSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG

Query:  SISGGSMTSSGSSNSTSSERS-NSRS------------------DPSAHFGFRVDVVNKFTMPFMSWTRKQSGTRWPVVLVQHENAEIQGQKGL---FGC
        S+SG      GSSNSTSSERS NS S                  DPSA   FRVDV++K T P M   R     +W  ++     A  +G        G 
Subjt:  SISGGSMTSSGSSNSTSSERS-NSRS------------------DPSAHFGFRVDVVNKFTMPFMSWTRKQSGTRWPVVLVQHENAEIQGQKGL---FGC

Query:  KRI-REMSTN--AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKI
        ++I R+MSTN  AAACAERATND LIAPDWAINIELCDIINMDPRQAKDALKILKKRLA KNPK QLLAL+ L+A+S+NCGDTVF+LIVDRNILHEMVKI
Subjt:  KRI-REMSTN--AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKI

Query:  VKKKPDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQN
        VKKKPDS VRDKIL LVDAWQA FGGG KGKYPQYYAAYNELKNAGF+FPPR ENVGQF SPPQI PV E  VS  YDDLA QVSLQSD+SGLSLPEIQN
Subjt:  VKKKPDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQN

Query:  AQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINL
        AQGLADVLLE+LGALD KTPEALKQEVIVDLVDQCRSY SRVVILVNE+TDEELLCQGLVLNDSLQRVLSYHDDIAKGTF TEARR E PPVPSVPY++ 
Subjt:  AQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINL

Query:  EDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS------------TSSSPFYTRQPLFD
        E+DESEDDFTPL+RR TRDH+Y RDRKLANGQSSRVSPLPSPS  K+ GAEM+DHLSGD+YK E SPR VEPPS            +SSSPFYTRQPLF 
Subjt:  EDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS------------TSSSPFYTRQPLFD

Query:  EPPPRSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSS
        EPPPRSM TN   AT      PS+LPPPPSR+NQRQ YFEQQKAVTGGT PHLSN Y+SYD IVGNTKNLSL P TP R+AEHEEALFKDL+DF+KA +S
Subjt:  EPPPRSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSS

Query:  SSSKSNRPF
        SSSKSNRPF
Subjt:  SSSKSNRPF

KAG7016452.1 TOM1-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma]9.3e-30974.39Show/hide
Query:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWGSNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG
        T+EDCIIFRGWDSAA  DDDSQSESGVCSPTLWGSNSRT+ QFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL+NS+R QDG
Subjt:  TSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWGSNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLTNSKRQQDG

Query:  DAASLSRDDSSSETSFRRDRSKNRSETRALVTRSKSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG
        D ASLSRDDS+SETSFRRD SK RSETRALVTRS+SVD+GGFYLKMF PLPFGQ+SAKKK NLRTDSGL+ SSRVSPKPPPVDR+WWRKRS      NEG
Subjt:  DAASLSRDDSSSETSFRRDRSKNRSETRALVTRSKSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDWWRKRSSVAGGENEG

Query:  SISGGSMTSSGSSNSTSSERS-NSRS------------------DPSAHFGFRVDVVNKFTMPFM--SWTRK-------QSGTRWPVVLVQHENAEIQGQ
        S+SG      GSSNSTSSERS NS S                  DPSA   FRVDV++K T P M   W  K        S  R     +Q+ +A  Q  
Subjt:  SISGGSMTSSGSSNSTSSERS-NSRS------------------DPSAHFGFRVDVVNKFTMPFM--SWTRK-------QSGTRWPVVLVQHENAEIQGQ

Query:  KGL---FGCKRI-REMSTN--AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDR
               G ++I R+MSTN  AAACAERATND LIAPDWAINIELCDIINMDPRQAKDALKILKKRLA KNPK QLLAL+ L+A+S+NCGDTVF+LIVDR
Subjt:  KGL---FGCKRI-REMSTN--AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDR

Query:  NILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGK-----YPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVS
        NILHEMVKIVKKK PDS VRDKIL LVDAWQA FGGG KG      +         ++NAGF+FPPR ENVGQF SPPQI PV E  VS  YDDLA QVS
Subjt:  NILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGK-----YPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVS

Query:  LQSDSSGLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEAR
        LQSD+SGLSLPEI NAQGLADVLLE+L ALD KTPEALKQEVIVDLVDQCRSY SRVVILVNE+TDEELLCQGLVLNDSLQRVLSYHDDIAKGTF TEAR
Subjt:  LQSDSSGLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEAR

Query:  RAE-PPVPSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-----------
        R E PPVPSVPY++ E+DESEDDFTPL+RR TRDH+Y RDRKLANGQSSRVSPLPSPS  K+ GAEM+DHLSGD+YK E SPR VEPPS           
Subjt:  RAE-PPVPSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-----------

Query:  -TSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEE
         +SSSPFYTRQPLF EPPPRSM TN   AT      PS+LPPPPSR+NQRQ YFEQQKAVTGGT PHLSN Y+SYD IVGNTKNLSL P TP R+AEHEE
Subjt:  -TSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEE

Query:  ALFKDLVDFAKAKSSSSSKSNRPF
        ALFKDL+DF+KA +SSSSKSNRPF
Subjt:  ALFKDLVDFAKAKSSSSSKSNRPF

TYK23786.1 target of Myb protein 1 [Cucumis melo var. makuwa]5.1e-24688.89Show/hide
Query:  EIQGQKGLFGCKRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRN
        EI+ ++  +  K  R MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRL  KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRN
Subjt:  EIQGQKGLFGCKRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRN

Query:  ILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSS
        ILHEMVKIVKKK PDSTVR+KILALVDAWQAAFGGG KGKYPQYYAAYN+LKNAGFQFPPREENV QFFSPPQ QPV E PVSA YDDLAVQ SLQSDSS
Subjt:  ILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSS

Query:  GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPV
        GLSLPEIQNAQGL DVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHD+IAKGTF TEARRAEPPV
Subjt:  GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPV

Query:  PSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPP
        PSVPYIN EDDESEDDFTPLSRRPTRDH+YERDRKLANGQSSRVSPLPSPSS K+A  EM+DHLSGD+YKPE SP+IV+PPSTSSSPFYTRQPLFDEPPP
Subjt:  PSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPP

Query:  RSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        RSMPT+ L  TPRDAQSPS LPPPPSR+NQRQ YFEQQKA TGG QPHLSNDY SYD IVGNTK LSLSPT TRSAEHEEALFKDLVDFAKAK SSSSKS
Subjt:  RSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

XP_008441589.1 PREDICTED: target of Myb protein 1 [Cucumis melo]1.6e-24490.98Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS
        MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRL  KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRNILHEMVKIVKKK PDS
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS

Query:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV
        TVR+KILALVDAWQAAFGGG KGKYPQYYAAYN+LKNAGFQFPPREENV QFFSPPQ QPV E PVSA YDDLAVQ SLQSDSSGLSLPEIQNAQGL DV
Subjt:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD
        LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHD+IAKGTF TEARRAEPPVPSVPYIN EDDESEDD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD

Query:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ
        FTPLSRRPTRDH+YERDRKLANGQSSRVSPLPSPSS K+A  EM+DHLSGD+YKPE SP+IV+PPSTSSSPFYTRQPLFDEPPPRSMPT+ L  TPRDAQ
Subjt:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ

Query:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
        SPS LPPPPSR+NQRQ YFEQQKA TGG QPHLSNDY SYD IVGNTK LSLSPT TRSAEHEEALFKDLVDFAKAK SSSSKSNRPF
Subjt:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

XP_038886345.1 TOM1-like protein 4 isoform X2 [Benincasa hispida]1.5e-24289.66Show/hide
Query:  KRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKK
        K  R+MSTNA ACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRLA KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRNILHEMVKIVKK
Subjt:  KRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKK

Query:  KPDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQG
        KPDSTVRDKIL LVDAWQAA GGG KGK+PQYYAAYNELKNAGFQFPPREENV QFFSPPQIQPV EHPVSA YDDLAVQ SLQSDSSGL LPEIQNAQ 
Subjt:  KPDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQG

Query:  LADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDE
        LA VLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELL QGLVLNDSLQRVLS HDDIAKGTF  EAR AEPPVPSVPYIN EDD+
Subjt:  LADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDE

Query:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSS-SPFYTRQPLFDEPPPRSMPTNTLPAT
        SEDDFTPLSRRPTRD++YERDRKLANG SSRVSPLPSPSS K+A  EM+DHLSGDMYKPE SPRIVEPPSTSS SPFYTRQPLFDEPPPRS+ TN L  T
Subjt:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSS-SPFYTRQPLFDEPPPRSMPTNTLPAT

Query:  PRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
        PRD QSPSALPPPPSR+NQRQ YFEQQKAVTGG+QPHLSNDYSSYD IVGNTKNLSLSPTPTRS EHEE LFKDLVDFAKAKSSSSSK NRPF
Subjt:  PRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

TrEMBL top hitse value%identityAlignment
A0A0A0KIF1 Uncharacterized protein5.3e-24190.16Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS
        MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRL  KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRNILHEMVKIVKKK PDS
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS

Query:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV
        TVR+KILALVDAWQAAFGGG +GKYPQYY AYN+LKNAGF+FPPREENV QFFSPPQIQPV E PVSA Y+DLAVQ SLQSDSSGLSLPEIQNAQGL DV
Subjt:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD
        LLEMLGALDPKTPEALKQEVI DLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTF  EARR EPPVPSVPYIN EDD SEDD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD

Query:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ
         TPLSRRPTRDH+YERDRKLANGQSSRVSPLPSPSS  +A  EM+DHLSGD+YKPE SPRIVEPPST SSPFYTRQPLFDEPPPRSMPTN L  TPRDAQ
Subjt:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ

Query:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
        SPS LPPPPSR+NQRQ YFEQQKA TGG+QPHLSNDYSSYD +VGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAK SSSSKSNRPF
Subjt:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

A0A1S3B3S9 target of Myb protein 17.9e-24590.98Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS
        MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRL  KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRNILHEMVKIVKKK PDS
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PDS

Query:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV
        TVR+KILALVDAWQAAFGGG KGKYPQYYAAYN+LKNAGFQFPPREENV QFFSPPQ QPV E PVSA YDDLAVQ SLQSDSSGLSLPEIQNAQGL DV
Subjt:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD
        LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHD+IAKGTF TEARRAEPPVPSVPYIN EDDESEDD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD

Query:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ
        FTPLSRRPTRDH+YERDRKLANGQSSRVSPLPSPSS K+A  EM+DHLSGD+YKPE SP+IV+PPSTSSSPFYTRQPLFDEPPPRSMPT+ L  TPRDAQ
Subjt:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQ

Query:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
        SPS LPPPPSR+NQRQ YFEQQKA TGG QPHLSNDY SYD IVGNTK LSLSPT TRSAEHEEALFKDLVDFAKAK SSSSKSNRPF
Subjt:  SPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

A0A5D3DJE5 Target of Myb protein 12.5e-24688.89Show/hide
Query:  EIQGQKGLFGCKRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRN
        EI+ ++  +  K  R MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRL  KNPKIQLLAL+ LEA+SKNCGDTVF+LIVDRN
Subjt:  EIQGQKGLFGCKRIREMSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRN

Query:  ILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSS
        ILHEMVKIVKKK PDSTVR+KILALVDAWQAAFGGG KGKYPQYYAAYN+LKNAGFQFPPREENV QFFSPPQ QPV E PVSA YDDLAVQ SLQSDSS
Subjt:  ILHEMVKIVKKK-PDSTVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSS

Query:  GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPV
        GLSLPEIQNAQGL DVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHD+IAKGTF TEARRAEPPV
Subjt:  GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPV

Query:  PSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPP
        PSVPYIN EDDESEDDFTPLSRRPTRDH+YERDRKLANGQSSRVSPLPSPSS K+A  EM+DHLSGD+YKPE SP+IV+PPSTSSSPFYTRQPLFDEPPP
Subjt:  PSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPP

Query:  RSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        RSMPT+ L  TPRDAQSPS LPPPPSR+NQRQ YFEQQKA TGG QPHLSNDY SYD IVGNTK LSLSPT TRSAEHEEALFKDLVDFAKAK SSSSKS
Subjt:  RSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

A0A6J1FH21 TOM1-like protein 4 isoform X21.3e-22383.9Show/hide
Query:  MSTN-AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDS
        MSTN AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLA KNPK QLLAL+ L+A+SKNCGDTVF+LIVDRNILHEMVKIVKKKPDS
Subjt:  MSTN-AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDS

Query:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV
        TVRDKIL LVDAWQA FGGG KGKYPQYYAAYNELKNAGF+FPPR ENVGQF SPPQI PV E  VS  YDDLA QVSLQSD+SGLSLPEIQNAQGLADV
Subjt:  TVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINLEDDESED
        LLE+LGALD KTPEALKQEVIVDLVDQCRSY SRVVILVNE+TDEELLCQGLVLNDSLQRVLSYHDDIAKGTF TEARR E PPVPSVPY++ E+DESED
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINLEDDESED

Query:  DFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-------TSSSPFYTRQPLFDEPPPRSMPTNTL
        DFTPL+RR TRDH+Y RDRKLANGQSSRVSPLPSPS  K+ GAEM+DHLSGD+YK E SPR VEPPS       +SSSPFYTRQPLF EPPPRSM TN  
Subjt:  DFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-------TSSSPFYTRQPLFDEPPPRSMPTNTL

Query:  PATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
         AT      PS+LPPPPSR+NQRQ YFEQQKAVTGGT PHLSN Y+SYD IVGNTKNLSL P TP R+AEHEEALFKDL+DF+KA +SSSSKSNRPF
Subjt:  PATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

A0A6J1FHH3 TOM1-like protein 4 isoform X13.2e-22283.73Show/hide
Query:  MSTN-AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PD
        MSTN AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLA KNPK QLLAL+ L+A+SKNCGDTVF+LIVDRNILHEMVKIVKKK PD
Subjt:  MSTN-AAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKK-PD

Query:  STVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLAD
        STVRDKIL LVDAWQA FGGG KGKYPQYYAAYNELKNAGF+FPPR ENVGQF SPPQI PV E  VS  YDDLA QVSLQSD+SGLSLPEIQNAQGLAD
Subjt:  STVRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLAD

Query:  VLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINLEDDESE
        VLLE+LGALD KTPEALKQEVIVDLVDQCRSY SRVVILVNE+TDEELLCQGLVLNDSLQRVLSYHDDIAKGTF TEARR E PPVPSVPY++ E+DESE
Subjt:  VLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAE-PPVPSVPYINLEDDESE

Query:  DDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-------TSSSPFYTRQPLFDEPPPRSMPTNT
        DDFTPL+RR TRDH+Y RDRKLANGQSSRVSPLPSPS  K+ GAEM+DHLSGD+YK E SPR VEPPS       +SSSPFYTRQPLF EPPPRSM TN 
Subjt:  DDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPS-------TSSSPFYTRQPLFDEPPPRSMPTNT

Query:  LPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF
          AT      PS+LPPPPSR+NQRQ YFEQQKAVTGGT PHLSN Y+SYD IVGNTKNLSL P TP R+AEHEEALFKDL+DF+KA +SSSSKSNRPF
Subjt:  LPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP-TPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF

SwissProt top hitse value%identityAlignment
O80910 TOM1-like protein 61.5e-5933.64Show/hide
Query:  STNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTV
        S +A    ++AT+D+L+ PDW  N+E+CD +N    QAKD +K +KKRL  K+ ++QLLAL +LE + KNCGD +   + ++NIL EMVKIVKKK D  V
Subjt:  STNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTV

Query:  RDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHP----------------------------------VSA
        RDKIL +VD+WQ AF GGP+GKYPQYY AY+EL+ +G +FP R  +     +PP   P    P                                    A
Subjt:  RDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHP----------------------------------VSA

Query:  SYDDLAVQVSLQSDSS------------GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLND
         Y    V   + S SS            GLSL  I++ + + D+L +ML A+DP   EA+K EVIVDLV++CRS   +++ ++  T D+ELL +GL LND
Subjt:  SYDDLAVQVSLQSDSS------------GLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLND

Query:  SLQRVLSYHDDIAKGTFVTEARRAEP-----------------------------PVPSV-----PYINLEDDESEDDFTPLSRR-------PTRDHVYE
        SLQ +L+ HD IA G+ +       P                             P+P+        I+ E +E ED+F  L+RR        T D    
Subjt:  SLQRVLSYHDDIAKGTFVTEARRAEP-----------------------------PVPSV-----PYINLEDDESEDDFTPLSRR-------PTRDHVYE

Query:  RDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNT----LPATPRDAQ-SPSALPPPPS
             A+   +   P P P  N +   +M+D LS  +  P        PP+ SS P          PPP     NT     P    D+  +P A    P 
Subjt:  RDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNT----LPATPRDAQ-SPSALPPPPS

Query:  RHNQRQLY--FEQQKAVTGGTQPHLSNDYSSYDTI
        +   +Q Y   +Q +   G +QP  S     Y  +
Subjt:  RHNQRQLY--FEQQKAVTGGTQPHLSNDYSSYDTI

Q6NQK0 TOM1-like protein 46.1e-11751.59Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST
        M+ +AAACAERATND+LI PDWAINIELCD+INMDP QAK+A+K+LKKRL  KN K+Q+LAL+ LE +SKNCG+ V++LI+DR +L++MVKIVKKKP+  
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST

Query:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQ-SDSSGLSLPEIQNAQGLADV
        VR+KIL L+D WQ AF GG  G+YPQYY AYN+L++AG +FPPR E+   FF+PPQ QP          +D A+Q SLQ  D+S LSL EIQ+A+G  DV
Subjt:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQ-SDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAK-GTFVTEAR--RAEPPVPSVPYINL--EDD
        L++MLGA DP  PE+LK+EVIVDLV+QCR+Y  RV+ LVN TTDEELLCQGL LND+LQ VL  HDDIA  G+  +  R  RA PPV  V  IN   EDD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAK-GTFVTEAR--RAEPPVPSVPYINL--EDD

Query:  ESEDDFTPLSRR---PTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPE--ESPRIVE-----PPSTSSSPFYTRQPLFDEPPPR
        ES+D+F  L+ R   PTR  V+  D                        + M+D LSGD+YKP+   S + V+     PP TSSS   +  P+FD+  P+
Subjt:  ESEDDFTPLSRR---PTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPE--ESPRIVE-----PPSTSSSPFYTRQPLFDEPPPR

Query:  SMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSL-SPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
                   + ++    LPPPPSRHNQRQ +FE   + +G          SSY+   G T+NLSL S  P +  + E+ LFKDLV+FAK +SS ++ +
Subjt:  SMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSL-SPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NR
        NR
Subjt:  NR

Q8L860 TOM1-like protein 91.0e-7641.97Show/hide
Query:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL
        A  ERAT+++LI PDWA+N+E+CD++N DP QAKD +K +KKR+  +NPK QLLAL +LE + KNCGD V   + ++ ++HEMV+IVKKKPD  V++KIL
Subjt:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL

Query:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM
         L+D WQ AF GGP+ +YPQYYA Y EL  AG  FP R E     F+PPQ QP+T +P    +A   +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM

Query:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE
        L AL+P   E LKQEV+VDLV+QCR+Y  RVV LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T ++  +P   +         P I+  D  
Subjt:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE

Query:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM
        ++ +    S                NG  ++++ LP+P     +    +D LSGD     P   P+   P ++  +         D          P   
Subjt:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM

Query:  PTNTLPATPRDAQSPSA
        P   +P  P+  Q P++
Subjt:  PTNTLPATPRDAQSPSA

Q9C9Y1 TOM1-like protein 81.5e-6746.42Show/hide
Query:  ERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKILALV
        +RAT+D+LI PDWA+N+E+CD++N +P Q ++ +  +KKRL  +  K+QLLAL +LE +  NCG+ +   + +++ILH+MVK+ K+KP+  V++KIL L+
Subjt:  ERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKILALV

Query:  DAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV-SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEMLGALD
        D WQ +F  GP+G++PQYYAAY EL  AG  FP R +      S  Q  P T +P  S +    A+  S +S+   LSL EIQNA+G+ DVL EM+ A+D
Subjt:  DAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV-SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEMLGALD

Query:  PKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFV---TEARRAEPPVPSVPYINLEDDESED
            E LKQEV+VDLV QCR+Y  RVV LVN T+DE +LCQGL LND LQR+L+ H+ IA G  +    E  + E P  +   I++   E+++
Subjt:  PKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFV---TEARRAEPPVPSVPYINLEDDESED

Q9LPL6 TOM1-like protein 37.6e-12852.6Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST
        M+ NAAACAERATND+LI PDWAINIELCDIINM+P QAK+A+K+LKKRL  KN K+Q+LAL+ LE +SKNCG++V++LIVDR+IL +MVKIVKKKPD T
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST

Query:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQS-DSSGLSLPEIQNAQGLADV
        VR+KIL+L+D WQ AFGG   G++PQYY AYNEL++AG +FPPR E+   FF+PPQ QP+     +AS +D A+Q SLQS D+S LS+ EIQ+AQG  DV
Subjt:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQS-DSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD
        L +MLGALDP  PE LK+E+IVDLV+QCR+Y  RV+ LVN T+DEEL+CQGL LND+LQRVL +HDD AKG  V        P+ S+ + + +DDES+DD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD

Query:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAE--MMDHLSGDMYKPEESPRIVEPPSTSSSPFYT-RQPLFDEPPPRSM----------
        F  L+ R  R    E  R    G  + + P P PSS +    +   MD LSGD+YKP+E+   V+PPSTS S  +    P+FDEP P+S           
Subjt:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAE--MMDHLSGDMYKPEESPRIVEPPSTSSSPFYT-RQPLFDEPPPRSM----------

Query:  ---PTNTLPATPRDAQSPSALPPPPS-RHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPT--------PTRSAEHEEALFKDLVDFAK
            T  LP  P + Q P   PP  S R N+R  YF+         Q   S   SSYD ++G ++NLSL+PT        P +  + E+ LFKDL+DFAK
Subjt:  ---PTNTLPATPRDAQSPSALPPPPS-RHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPT--------PTRSAEHEEALFKDLVDFAK

Query:  AKSSSSSKS------NRPF
         ++SSSS S      N+PF
Subjt:  AKSSSSSKS------NRPF

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 15.4e-12952.6Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST
        M+ NAAACAERATND+LI PDWAINIELCDIINM+P QAK+A+K+LKKRL  KN K+Q+LAL+ LE +SKNCG++V++LIVDR+IL +MVKIVKKKPD T
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST

Query:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQS-DSSGLSLPEIQNAQGLADV
        VR+KIL+L+D WQ AFGG   G++PQYY AYNEL++AG +FPPR E+   FF+PPQ QP+     +AS +D A+Q SLQS D+S LS+ EIQ+AQG  DV
Subjt:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQS-DSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD
        L +MLGALDP  PE LK+E+IVDLV+QCR+Y  RV+ LVN T+DEEL+CQGL LND+LQRVL +HDD AKG  V        P+ S+ + + +DDES+DD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDD

Query:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAE--MMDHLSGDMYKPEESPRIVEPPSTSSSPFYT-RQPLFDEPPPRSM----------
        F  L+ R  R    E  R    G  + + P P PSS +    +   MD LSGD+YKP+E+   V+PPSTS S  +    P+FDEP P+S           
Subjt:  FTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAE--MMDHLSGDMYKPEESPRIVEPPSTSSSPFYT-RQPLFDEPPPRSM----------

Query:  ---PTNTLPATPRDAQSPSALPPPPS-RHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPT--------PTRSAEHEEALFKDLVDFAK
            T  LP  P + Q P   PP  S R N+R  YF+         Q   S   SSYD ++G ++NLSL+PT        P +  + E+ LFKDL+DFAK
Subjt:  ---PTNTLPATPRDAQSPSALPPPPS-RHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSPT--------PTRSAEHEEALFKDLVDFAK

Query:  AKSSSSSKS------NRPF
         ++SSSS S      N+PF
Subjt:  AKSSSSSKS------NRPF

AT1G76970.1 Target of Myb protein 14.3e-11851.59Show/hide
Query:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST
        M+ +AAACAERATND+LI PDWAINIELCD+INMDP QAK+A+K+LKKRL  KN K+Q+LAL+ LE +SKNCG+ V++LI+DR +L++MVKIVKKKP+  
Subjt:  MSTNAAACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDST

Query:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQ-SDSSGLSLPEIQNAQGLADV
        VR+KIL L+D WQ AF GG  G+YPQYY AYN+L++AG +FPPR E+   FF+PPQ QP          +D A+Q SLQ  D+S LSL EIQ+A+G  DV
Subjt:  VRDKILALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQ-SDSSGLSLPEIQNAQGLADV

Query:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAK-GTFVTEAR--RAEPPVPSVPYINL--EDD
        L++MLGA DP  PE+LK+EVIVDLV+QCR+Y  RV+ LVN TTDEELLCQGL LND+LQ VL  HDDIA  G+  +  R  RA PPV  V  IN   EDD
Subjt:  LLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAK-GTFVTEAR--RAEPPVPSVPYINL--EDD

Query:  ESEDDFTPLSRR---PTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPE--ESPRIVE-----PPSTSSSPFYTRQPLFDEPPPR
        ES+D+F  L+ R   PTR  V+  D                        + M+D LSGD+YKP+   S + V+     PP TSSS   +  P+FD+  P+
Subjt:  ESEDDFTPLSRR---PTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGDMYKPE--ESPRIVE-----PPSTSSSPFYTRQPLFDEPPPR

Query:  SMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSL-SPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
                   + ++    LPPPPSRHNQRQ +FE   + +G          SSY+   G T+NLSL S  P +  + E+ LFKDLV+FAK +SS ++ +
Subjt:  SMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSL-SPTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NR
        NR
Subjt:  NR

AT3G08790.1 ENTH/VHS/GAT family protein1.1e-6846.42Show/hide
Query:  ERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKILALV
        +RAT+D+LI PDWA+N+E+CD++N +P Q ++ +  +KKRL  +  K+QLLAL +LE +  NCG+ +   + +++ILH+MVK+ K+KP+  V++KIL L+
Subjt:  ERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKILALV

Query:  DAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV-SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEMLGALD
        D WQ +F  GP+G++PQYYAAY EL  AG  FP R +      S  Q  P T +P  S +    A+  S +S+   LSL EIQNA+G+ DVL EM+ A+D
Subjt:  DAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV-SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEMLGALD

Query:  PKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFV---TEARRAEPPVPSVPYINLEDDESED
            E LKQEV+VDLV QCR+Y  RVV LVN T+DE +LCQGL LND LQR+L+ H+ IA G  +    E  + E P  +   I++   E+++
Subjt:  PKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFV---TEARRAEPPVPSVPYINLEDDESED

AT4G32760.1 ENTH/VHS/GAT family protein7.4e-7841.97Show/hide
Query:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL
        A  ERAT+++LI PDWA+N+E+CD++N DP QAKD +K +KKR+  +NPK QLLAL +LE + KNCGD V   + ++ ++HEMV+IVKKKPD  V++KIL
Subjt:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL

Query:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM
         L+D WQ AF GGP+ +YPQYYA Y EL  AG  FP R E     F+PPQ QP+T +P    +A   +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM

Query:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE
        L AL+P   E LKQEV+VDLV+QCR+Y  RVV LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T ++  +P   +         P I+  D  
Subjt:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE

Query:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM
        ++ +    S                NG  ++++ LP+P     +    +D LSGD     P   P+   P ++  +         D          P   
Subjt:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM

Query:  PTNTLPATPRDAQSPSA
        P   +P  P+  Q P++
Subjt:  PTNTLPATPRDAQSPSA

AT4G32760.2 ENTH/VHS/GAT family protein7.4e-7841.97Show/hide
Query:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL
        A  ERAT+++LI PDWA+N+E+CD++N DP QAKD +K +KKR+  +NPK QLLAL +LE + KNCGD V   + ++ ++HEMV+IVKKKPD  V++KIL
Subjt:  ACAERATNDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKIL

Query:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM
         L+D WQ AF GGP+ +YPQYYA Y EL  AG  FP R E     F+PPQ QP+T +P    +A   +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGGPKGKYPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPV---SASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEM

Query:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE
        L AL+P   E LKQEV+VDLV+QCR+Y  RVV LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T ++  +P   +         P I+  D  
Subjt:  LGALDPKTPEALKQEVIVDLVDQCRSYHSRVVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPS--------VPYINLEDDE

Query:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM
        ++ +    S                NG  ++++ LP+P     +    +D LSGD     P   P+   P ++  +         D          P   
Subjt:  SEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEMMDHLSGD--MYKPEESPRIVEPPSTSSSPFYTRQPLFDE-------PPPRSM

Query:  PTNTLPATPRDAQSPSA
        P   +P  P+  Q P++
Subjt:  PTNTLPATPRDAQSPSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGATATTACGACGAAATTCCCAAGAAAACCTACCCATAGCCCAGATCCTATCCCACGAACAGCCGCTTTCTATTATAAATCTCCAGAACAGGACTCCACTTCCGA
AGATTGCATCATTTTCAGAGGCTGGGACAGTGCTGCTGCCGTCGACGATGACTCTCAATCGGAATCGGGGGTTTGTTCACCCACGCTTTGGGGTTCCAATTCTCGAACCA
GCCCCCAATTTCACCGCCCTCGTAATCGCAGCCTCTCCCCAACTTCCCGAACCCAAGCCATAGCCAGAGGCCAACAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAA
TCATCTTACGAGCTCTCTCTCAAAGATCTCGTTGAACATCACTTGACTAATTCGAAACGTCAACAAGACGGTGACGCTGCTTCCCTTTCAAGAGACGATTCCAGCTCTGA
AACTTCCTTCCGAAGAGACCGTAGCAAGAACCGGAGTGAAACTAGGGCACTTGTTACCAGAAGTAAAAGCGTCGATAACGGTGGATTTTACCTCAAAATGTTCTTCCCAC
TGCCTTTCGGCCAGGTTTCGGCTAAAAAGAAGAGTAATCTTAGAACCGATTCGGGGTTGAGTGGTAGTTCGAGAGTGTCGCCGAAGCCACCACCGGTGGACAGAGACTGG
TGGAGGAAGAGATCGTCGGTGGCTGGCGGCGAGAACGAGGGTAGTATCTCCGGCGGAAGCATGACGAGTAGCGGCAGTAGTAATAGCACTAGCAGCGAAAGAAGCAATAG
CAGATCCGACCCATCGGCCCACTTTGGCTTTCGAGTTGACGTTGTCAACAAGTTTACTATGCCATTCATGTCGTGGACTCGTAAACAAAGTGGGACCCGATGGCCGGTAG
TCTTAGTCCAGCACGAAAACGCGGAGATTCAAGGACAGAAGGGCTTATTTGGGTGCAAAAGAATCAGAGAGATGTCTACCAATGCTGCTGCTTGCGCTGAGAGAGCTACA
AATGATGTGCTTATCGCTCCCGATTGGGCGATAAATATTGAGCTCTGTGATATTATTAACATGGATCCTAGGCAAGCGAAGGATGCATTGAAGATACTCAAGAAGCGTCT
TGCTGGCAAAAATCCTAAAATACAACTTCTAGCGCTCCATGTATTAGAGGCTGTAAGCAAAAATTGTGGCGATACTGTTTTTAGGCTGATTGTAGATCGTAATATCCTGC
ACGAAATGGTTAAAATTGTAAAGAAGAAGCCTGATTCAACTGTACGGGACAAAATATTAGCTTTGGTAGATGCATGGCAAGCAGCCTTTGGTGGTGGCCCCAAGGGAAAG
TATCCACAGTACTATGCAGCCTACAATGAATTGAAGAATGCTGGATTTCAATTTCCACCTAGAGAAGAGAACGTTGGGCAGTTCTTTAGTCCACCTCAGATACAGCCAGT
CACTGAGCACCCTGTTTCAGCCAGTTATGATGATCTTGCTGTTCAGGTTTCTCTCCAGTCTGATTCTTCTGGTTTAAGCTTGCCAGAAATTCAAAATGCACAGGGACTAG
CAGATGTTCTATTGGAAATGCTTGGTGCGCTGGACCCTAAGACTCCCGAGGCTTTGAAGCAAGAAGTGATTGTTGATCTTGTTGATCAATGCCGTTCCTACCACAGCCGT
GTCGTGATACTTGTGAATGAGACGACAGATGAGGAACTATTATGTCAAGGATTGGTATTGAATGACAGTCTGCAGCGTGTACTCAGCTACCACGATGACATTGCTAAAGG
AACCTTTGTGACGGAAGCTAGAAGAGCAGAACCTCCCGTTCCATCGGTTCCATATATCAACCTTGAGGACGATGAGTCGGAAGATGACTTCACTCCGTTATCTCGCAGGC
CAACGAGAGATCACGTTTATGAAAGGGACAGGAAACTGGCAAATGGTCAATCATCTCGAGTAAGTCCACTTCCTTCACCCTCATCAAATAAGTCGGCTGGTGCTGAAATG
ATGGATCATCTTAGCGGCGACATGTACAAACCTGAAGAGTCTCCAAGGATAGTTGAGCCACCATCAACTTCTTCTTCACCTTTCTACACTAGACAGCCTTTGTTCGACGA
ACCACCTCCAAGAAGCATGCCCACCAATACACTCCCGGCAACACCTCGGGACGCTCAATCTCCAAGTGCCCTCCCTCCCCCACCCTCAAGACATAATCAAAGACAACTAT
ATTTTGAGCAACAAAAAGCTGTTACAGGAGGCACTCAGCCCCATTTGAGCAACGATTATAGTTCTTATGACACCATAGTTGGGAATACCAAGAATCTTTCACTCAGTCCT
ACCCCAACCAGATCCGCTGAGCATGAAGAGGCCCTTTTCAAAGATCTGGTGGATTTTGCCAAGGCCAAGTCATCTTCATCCTCCAAATCCAACCGACCATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCGATATTACGACGAAATTCCCAAGAAAACCTACCCATAGCCCAGATCCTATCCCACGAACAGCCGCTTTCTATTATAAATCTCCAGAACAGGACTCCACTTCCGA
AGATTGCATCATTTTCAGAGGCTGGGACAGTGCTGCTGCCGTCGACGATGACTCTCAATCGGAATCGGGGGTTTGTTCACCCACGCTTTGGGGTTCCAATTCTCGAACCA
GCCCCCAATTTCACCGCCCTCGTAATCGCAGCCTCTCCCCAACTTCCCGAACCCAAGCCATAGCCAGAGGCCAACAGGAGCTCATGGAGATGGTCAGGAACATGCCCGAA
TCATCTTACGAGCTCTCTCTCAAAGATCTCGTTGAACATCACTTGACTAATTCGAAACGTCAACAAGACGGTGACGCTGCTTCCCTTTCAAGAGACGATTCCAGCTCTGA
AACTTCCTTCCGAAGAGACCGTAGCAAGAACCGGAGTGAAACTAGGGCACTTGTTACCAGAAGTAAAAGCGTCGATAACGGTGGATTTTACCTCAAAATGTTCTTCCCAC
TGCCTTTCGGCCAGGTTTCGGCTAAAAAGAAGAGTAATCTTAGAACCGATTCGGGGTTGAGTGGTAGTTCGAGAGTGTCGCCGAAGCCACCACCGGTGGACAGAGACTGG
TGGAGGAAGAGATCGTCGGTGGCTGGCGGCGAGAACGAGGGTAGTATCTCCGGCGGAAGCATGACGAGTAGCGGCAGTAGTAATAGCACTAGCAGCGAAAGAAGCAATAG
CAGATCCGACCCATCGGCCCACTTTGGCTTTCGAGTTGACGTTGTCAACAAGTTTACTATGCCATTCATGTCGTGGACTCGTAAACAAAGTGGGACCCGATGGCCGGTAG
TCTTAGTCCAGCACGAAAACGCGGAGATTCAAGGACAGAAGGGCTTATTTGGGTGCAAAAGAATCAGAGAGATGTCTACCAATGCTGCTGCTTGCGCTGAGAGAGCTACA
AATGATGTGCTTATCGCTCCCGATTGGGCGATAAATATTGAGCTCTGTGATATTATTAACATGGATCCTAGGCAAGCGAAGGATGCATTGAAGATACTCAAGAAGCGTCT
TGCTGGCAAAAATCCTAAAATACAACTTCTAGCGCTCCATGTATTAGAGGCTGTAAGCAAAAATTGTGGCGATACTGTTTTTAGGCTGATTGTAGATCGTAATATCCTGC
ACGAAATGGTTAAAATTGTAAAGAAGAAGCCTGATTCAACTGTACGGGACAAAATATTAGCTTTGGTAGATGCATGGCAAGCAGCCTTTGGTGGTGGCCCCAAGGGAAAG
TATCCACAGTACTATGCAGCCTACAATGAATTGAAGAATGCTGGATTTCAATTTCCACCTAGAGAAGAGAACGTTGGGCAGTTCTTTAGTCCACCTCAGATACAGCCAGT
CACTGAGCACCCTGTTTCAGCCAGTTATGATGATCTTGCTGTTCAGGTTTCTCTCCAGTCTGATTCTTCTGGTTTAAGCTTGCCAGAAATTCAAAATGCACAGGGACTAG
CAGATGTTCTATTGGAAATGCTTGGTGCGCTGGACCCTAAGACTCCCGAGGCTTTGAAGCAAGAAGTGATTGTTGATCTTGTTGATCAATGCCGTTCCTACCACAGCCGT
GTCGTGATACTTGTGAATGAGACGACAGATGAGGAACTATTATGTCAAGGATTGGTATTGAATGACAGTCTGCAGCGTGTACTCAGCTACCACGATGACATTGCTAAAGG
AACCTTTGTGACGGAAGCTAGAAGAGCAGAACCTCCCGTTCCATCGGTTCCATATATCAACCTTGAGGACGATGAGTCGGAAGATGACTTCACTCCGTTATCTCGCAGGC
CAACGAGAGATCACGTTTATGAAAGGGACAGGAAACTGGCAAATGGTCAATCATCTCGAGTAAGTCCACTTCCTTCACCCTCATCAAATAAGTCGGCTGGTGCTGAAATG
ATGGATCATCTTAGCGGCGACATGTACAAACCTGAAGAGTCTCCAAGGATAGTTGAGCCACCATCAACTTCTTCTTCACCTTTCTACACTAGACAGCCTTTGTTCGACGA
ACCACCTCCAAGAAGCATGCCCACCAATACACTCCCGGCAACACCTCGGGACGCTCAATCTCCAAGTGCCCTCCCTCCCCCACCCTCAAGACATAATCAAAGACAACTAT
ATTTTGAGCAACAAAAAGCTGTTACAGGAGGCACTCAGCCCCATTTGAGCAACGATTATAGTTCTTATGACACCATAGTTGGGAATACCAAGAATCTTTCACTCAGTCCT
ACCCCAACCAGATCCGCTGAGCATGAAGAGGCCCTTTTCAAAGATCTGGTGGATTTTGCCAAGGCCAAGTCATCTTCATCCTCCAAATCCAACCGACCATTCTGA
Protein sequenceShow/hide protein sequence
MGDITTKFPRKPTHSPDPIPRTAAFYYKSPEQDSTSEDCIIFRGWDSAAAVDDDSQSESGVCSPTLWGSNSRTSPQFHRPRNRSLSPTSRTQAIARGQQELMEMVRNMPE
SSYELSLKDLVEHHLTNSKRQQDGDAASLSRDDSSSETSFRRDRSKNRSETRALVTRSKSVDNGGFYLKMFFPLPFGQVSAKKKSNLRTDSGLSGSSRVSPKPPPVDRDW
WRKRSSVAGGENEGSISGGSMTSSGSSNSTSSERSNSRSDPSAHFGFRVDVVNKFTMPFMSWTRKQSGTRWPVVLVQHENAEIQGQKGLFGCKRIREMSTNAAACAERAT
NDVLIAPDWAINIELCDIINMDPRQAKDALKILKKRLAGKNPKIQLLALHVLEAVSKNCGDTVFRLIVDRNILHEMVKIVKKKPDSTVRDKILALVDAWQAAFGGGPKGK
YPQYYAAYNELKNAGFQFPPREENVGQFFSPPQIQPVTEHPVSASYDDLAVQVSLQSDSSGLSLPEIQNAQGLADVLLEMLGALDPKTPEALKQEVIVDLVDQCRSYHSR
VVILVNETTDEELLCQGLVLNDSLQRVLSYHDDIAKGTFVTEARRAEPPVPSVPYINLEDDESEDDFTPLSRRPTRDHVYERDRKLANGQSSRVSPLPSPSSNKSAGAEM
MDHLSGDMYKPEESPRIVEPPSTSSSPFYTRQPLFDEPPPRSMPTNTLPATPRDAQSPSALPPPPSRHNQRQLYFEQQKAVTGGTQPHLSNDYSSYDTIVGNTKNLSLSP
TPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF