; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029154 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029154
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontarget of Myb protein 1
Genome locationchr8:35864441..35867517
RNA-Seq ExpressionLag0029154
SyntenyLag0029154
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR014645 - Target of Myb protein 1
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141604.1 TOM1-like protein 4 isoform X1 [Momordica charantia]9.0e-21780.28Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATND+L APDWAINIELCDI+NMDPRQ KDALKILKKRLAS++PK QLLAL+ LEALSKNCG+ V KLIV+R ILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
        NVRDKIL LVDAWQ AFGG SKGKYPQYYAAY ELKNAGFQFPPREENV QFFS P+IQP +E PVSAYDD +VQ SLQSDAS LSLPEIQNAQGL+DVL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        MEML ALDPKTPEALKQEVIVDLVDQC SYHSRVM+LVNETTDEELLCQGLVLND+LQRVLS+HDDI KGT +    R EP V  P+VPYMNPEDD S+D
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPR
        +FTPLARRST+DHIYERD +L+NGQSSRV PLPS SSKK    +MIDHLS D YKPQ S RT  E PSYPPPVFPPSP TSS SPF+ TRQPLFDEPPPR
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPR

Query:  SISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSS
         ISTNPL    RD QSPGSLPPPP+RYNQRQQ+FEQQKAVT GGS PHL NG      +NIVGQTKNLSL PSTPTRSA+HEEALFK+LVDFA  KSS S
Subjt:  SISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSS

Query:  SKSNRPF
        SK +RPF
Subjt:  SKSNRPF

XP_022141608.1 TOM1-like protein 4 isoform X2 [Momordica charantia]3.6e-21880.43Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATND+L APDWAINIELCDI+NMDPRQ KDALKILKKRLAS++PK QLLAL+ LEALSKNCG+ V KLIV+R ILHEM+KIVKKKPD N
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM
        VRDKIL LVDAWQ AFGG SKGKYPQYYAAY ELKNAGFQFPPREENV QFFS P+IQP +E PVSAYDD +VQ SLQSDAS LSLPEIQNAQGL+DVLM
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM

Query:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE
        EML ALDPKTPEALKQEVIVDLVDQC SYHSRVM+LVNETTDEELLCQGLVLND+LQRVLS+HDDI KGT +    R EP V  P+VPYMNPEDD S+D+
Subjt:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE

Query:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPRS
        FTPLARRST+DHIYERD +L+NGQSSRV PLPS SSKK    +MIDHLS D YKPQ S RT  E PSYPPPVFPPSP TSS SPF+ TRQPLFDEPPPR 
Subjt:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSS
        ISTNPL    RD QSPGSLPPPP+RYNQRQQ+FEQQKAVT GGS PHL NG      +NIVGQTKNLSL PSTPTRSA+HEEALFK+LVDFA  KSS SS
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSS

Query:  KSNRPF
        K +RPF
Subjt:  KSNRPF

XP_031743524.1 TOM1-like protein 4 isoform X2 [Cucumis sativus]5.3e-21781.71Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRL S++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKKPD  
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM
        VR+KILALVDAWQAAFGG S+GKYPQYY AYN+LKNAGF+FPPREENV QFFS PQIQPVIE PVSAY+DLAVQ SLQSD+SGLSLPEIQNAQGL DVL+
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM

Query:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE
        EML ALDPKTPEALKQEVI DLVDQC SYHSRV+ILVNETTDEELLCQGLVLND+LQRVLSYHDDI KGTF M A R EPPV  PSVPY+NPEDDGSED+
Subjt:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE

Query:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSI
         TPL+RR T+DHIYERD +L NGQSSRV PLPS SSK T   EMIDHLS D YKP+ S R VE          PP    S+ SPFYTRQPLFDEPPPRS+
Subjt:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSI

Query:  STNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKSN
         TNPL  TPRDAQSP  LPPPP+RYNQRQQYFEQQKA T GGSQPHLSN    Y+N+VG TKNLSLSP TPTRSAEHEEALFKDLVDFAKAK SSSSKSN
Subjt:  STNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKSN

Query:  RPF
        RPF
Subjt:  RPF

XP_038886343.1 TOM1-like protein 4 isoform X1 [Benincasa hispida]2.6e-21681.94Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNA ACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRLA+++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
         VRDKIL LVDAWQAA GG SKGK+PQYYAAYNELKNAGFQFPPREENV QFFS PQIQPVIEHPVSAYDDLAVQ SLQSD+SGL LPEIQNAQ LA VL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        +EML ALDPKTPEALKQEVIVDLVDQC SYHSRV+ILVNETTDEELL QGLVLND+LQRVLS HDDI KGTF M A   EPPV  PSVPY+NPEDD SED
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS
        +FTPL+RR T+D+IYERD +L NG SSRV PLPS SSKKT + EMIDHLS D YKP+ S R VE          PPS  TSS SPFYTRQPLFDEPPPRS
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        ISTNPL  TPRD QSP +LPPPP+RYNQRQQYFEQQKAVT GGSQPHLSN    Y+NIVG TKNLSLSP TPTRS EHEE LFKDLVDFAKAKSSSSSK 
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

XP_038886345.1 TOM1-like protein 4 isoform X2 [Benincasa hispida]1.1e-21782.11Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        MSTNA ACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRLA+++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKKPD  
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM
        VRDKIL LVDAWQAA GG SKGK+PQYYAAYNELKNAGFQFPPREENV QFFS PQIQPVIEHPVSAYDDLAVQ SLQSD+SGL LPEIQNAQ LA VL+
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM

Query:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE
        EML ALDPKTPEALKQEVIVDLVDQC SYHSRV+ILVNETTDEELL QGLVLND+LQRVLS HDDI KGTF M A   EPPV  PSVPY+NPEDD SED+
Subjt:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE

Query:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSI
        FTPL+RR T+D+IYERD +L NG SSRV PLPS SSKKT + EMIDHLS D YKP+ S R VE          PPS  TSS SPFYTRQPLFDEPPPRSI
Subjt:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSI

Query:  STNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKSN
        STNPL  TPRD QSP +LPPPP+RYNQRQQYFEQQKAVT GGSQPHLSN    Y+NIVG TKNLSLSP TPTRS EHEE LFKDLVDFAKAKSSSSSK N
Subjt:  STNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKSN

Query:  RPF
        RPF
Subjt:  RPF

TrEMBL top hitse value%identityAlignment
A0A0A0KIF1 Uncharacterized protein6.3e-21681.55Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRL S++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
         VR+KILALVDAWQAAFGG S+GKYPQYY AYN+LKNAGF+FPPREENV QFFS PQIQPVIE PVSAY+DLAVQ SLQSD+SGLSLPEIQNAQGL DVL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        +EML ALDPKTPEALKQEVI DLVDQC SYHSRV+ILVNETTDEELLCQGLVLND+LQRVLSYHDDI KGTF M A R EPPV  PSVPY+NPEDDGSED
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS
        + TPL+RR T+DHIYERD +L NGQSSRV PLPS SSK T   EMIDHLS D YKP+ S R VE          PP    S+ SPFYTRQPLFDEPPPRS
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        + TNPL  TPRDAQSP  LPPPP+RYNQRQQYFEQQKA T GGSQPHLSN    Y+N+VG TKNLSLSP TPTRSAEHEEALFKDLVDFAKAK SSSSKS
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

A0A1S3B3S9 target of Myb protein 12.8e-21681.55Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRL S++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
         VR+KILALVDAWQAAFGG SKGKYPQYYAAYN+LKNAGFQFPPREENV QFFS PQ QPVIE PVSAYDDLAVQ SLQSD+SGLSLPEIQNAQGL DVL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        +EML ALDPKTPEALKQEVIVDLVDQC SYHSRV+ILVNETTDEELLCQGLVLND+LQRVLSYHD+I KGTF   A R EPPV  PSVPY+NPEDD SED
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS
        +FTPL+RR T+DHIYERD +L NGQSSRV PLPS SSKKT + EMIDHLS D YKP+ S + V+          PPS   +S SPFYTRQPLFDEPPPRS
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        + T+PL  TPRDAQSP  LPPPP+RYNQRQQYFEQQKA TGGG QPHLSN    Y+NIVG TK LSLSP T TRSAEHEEALFKDLVDFAKAK SSSSKS
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

A0A5D3DJE5 Target of Myb protein 12.8e-21681.55Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATNDVL APDWAINIELCDIINMDPRQAKDALKILKKRL S++PKIQLLAL+ LEALSKNCG+TVFKLIV+RNILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
         VR+KILALVDAWQAAFGG SKGKYPQYYAAYN+LKNAGFQFPPREENV QFFS PQ QPVIE PVSAYDDLAVQ SLQSD+SGLSLPEIQNAQGL DVL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        +EML ALDPKTPEALKQEVIVDLVDQC SYHSRV+ILVNETTDEELLCQGLVLND+LQRVLSYHD+I KGTF   A R EPPV  PSVPY+NPEDD SED
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS
        +FTPL+RR T+DHIYERD +L NGQSSRV PLPS SSKKT + EMIDHLS D YKP+ S + V+          PPS   +S SPFYTRQPLFDEPPPRS
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        + T+PL  TPRDAQSP  LPPPP+RYNQRQQYFEQQKA TGGG QPHLSN    Y+NIVG TK LSLSP T TRSAEHEEALFKDLVDFAKAK SSSSKS
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSN---GYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NRPF
        NRPF
Subjt:  NRPF

A0A6J1CJP5 TOM1-like protein 4 isoform X14.3e-21780.28Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATND+L APDWAINIELCDI+NMDPRQ KDALKILKKRLAS++PK QLLAL+ LEALSKNCG+ V KLIV+R ILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL
        NVRDKIL LVDAWQ AFGG SKGKYPQYYAAY ELKNAGFQFPPREENV QFFS P+IQP +E PVSAYDD +VQ SLQSDAS LSLPEIQNAQGL+DVL
Subjt:  NVRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
        MEML ALDPKTPEALKQEVIVDLVDQC SYHSRVM+LVNETTDEELLCQGLVLND+LQRVLS+HDDI KGT +    R EP V  P+VPYMNPEDD S+D
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPR
        +FTPLARRST+DHIYERD +L+NGQSSRV PLPS SSKK    +MIDHLS D YKPQ S RT  E PSYPPPVFPPSP TSS SPF+ TRQPLFDEPPPR
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPR

Query:  SISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSS
         ISTNPL    RD QSPGSLPPPP+RYNQRQQ+FEQQKAVT GGS PHL NG      +NIVGQTKNLSL PSTPTRSA+HEEALFK+LVDFA  KSS S
Subjt:  SISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSS

Query:  SKSNRPF
        SK +RPF
Subjt:  SKSNRPF

A0A6J1CL02 TOM1-like protein 4 isoform X21.8e-21880.43Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATND+L APDWAINIELCDI+NMDPRQ KDALKILKKRLAS++PK QLLAL+ LEALSKNCG+ V KLIV+R ILHEM+KIVKKKPD N
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM
        VRDKIL LVDAWQ AFGG SKGKYPQYYAAY ELKNAGFQFPPREENV QFFS P+IQP +E PVSAYDD +VQ SLQSDAS LSLPEIQNAQGL+DVLM
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLM

Query:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE
        EML ALDPKTPEALKQEVIVDLVDQC SYHSRVM+LVNETTDEELLCQGLVLND+LQRVLS+HDDI KGT +    R EP V  P+VPYMNPEDD S+D+
Subjt:  EMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDE

Query:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPRS
        FTPLARRST+DHIYERD +L+NGQSSRV PLPS SSKK    +MIDHLS D YKPQ S RT  E PSYPPPVFPPSP TSS SPF+ TRQPLFDEPPPR 
Subjt:  FTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFY-TRQPLFDEPPPRS

Query:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSS
        ISTNPL    RD QSPGSLPPPP+RYNQRQQ+FEQQKAVT GGS PHL NG      +NIVGQTKNLSL PSTPTRSA+HEEALFK+LVDFA  KSS SS
Subjt:  ISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGY-----ENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSS

Query:  KSNRPF
        K +RPF
Subjt:  KSNRPF

SwissProt top hitse value%identityAlignment
O80910 TOM1-like protein 62.8e-5632.01Show/hide
Query:  STNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNV
        S +A    ++AT+D+L  PDW  N+E+CD +N    QAKD +K +KKRL  +S ++QLLAL +LE L KNCG+ +   + E+NIL EM+KIVKKK D+ V
Subjt:  STNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNV

Query:  RDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAY---------------------------------
        RDKIL +VD+WQ AFGG  +GKYPQYY AY+EL+ +G +FP R  + S   + P   P +  P   Y                                 
Subjt:  RDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAY---------------------------------

Query:  --------------DDLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLND
                          +  ++ ++  GLSL  I++ + + D+L +ML A+DP   EA+K EVIVDLV++C S   ++M ++  T D+ELL +GL LND
Subjt:  --------------DDLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLND

Query:  NLQRVLSYHDDITKG--------------------------------TFMMGATRKEPPVPSPSVPYMNPEDDGSEDEFTPLARRSTK-------DHIYE
        +LQ +L+ HD I  G                                + + G++   P   S     ++ E +  EDEF  LARR +K       D    
Subjt:  NLQRVLSYHDDITKG--------------------------------TFMMGATRKEPPVPSPSVPYMNPEDDGSEDEFTPLARRSTK-------DHIYE

Query:  RDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSISTNPLPATPRDAQSP
              +   +  +P P      T   +MID LS     P  +       PS PPP          P P    QP FD               P   Q  
Subjt:  RDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSISTNPLPATPRDAQSP

Query:  GSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLS---NGYENIVGQTKNLSLSPSTP
           P     Y+Q QQ+ +QQ     G SQP  S    GY  +         S S P
Subjt:  GSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLS---NGYENIVGQTKNLSLSPSTP

Q6NQK0 TOM1-like protein 48.4e-11750.8Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        M+ +AAACAERATND+L  PDWAINIELCD+INMDP QAK+A+K+LKKRL S++ K+Q+LAL+ LE LSKNCGE V++LI++R +L++M+KIVKKKP+LN
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQ-SDASGLSLPEIQNAQGLADVL
        VR+KIL L+D WQ AFGG   G+YPQYY AYN+L++AG +FPPR E+   FF+ PQ QP         +D A+Q SLQ  DAS LSL EIQ+A+G  DVL
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQ-SDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITK-GTFMMGA--TRKEPPVPSPSVPYMNPEDDG
        M+ML A DP  PE+LK+EVIVDLV+QC +Y  RVM LVN TTDEELLCQGL LNDNLQ VL  HDDI   G+       TR  PPV    + + + EDD 
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITK-GTFMMGA--TRKEPPVPSPSVPYMNPEDDG

Query:  SEDEFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPP
        S+DEF  LA RS+                      P+      +++ M+D LS D YKPQ +S +  +    PP   PP P TSS S      P+FD+  
Subjt:  SEDEFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPP

Query:  PRSISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        P+           + ++   +LPPPP+R+NQRQQ+FE   + +G  S       YE   GQT+NLSL+ S P +  + E+ LFKDLV+FAK +SS ++ +
Subjt:  PRSISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NR
        NR
Subjt:  NR

Q8L860 TOM1-like protein 94.2e-6840.47Show/hide
Query:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT+++L  PDWA+N+E+CD++N DP QAKD +K +KKR+ SR+PK QLLAL +LE + KNCG+ V   + E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM
         L+D WQ AFGG  + +YPQYYA Y EL  AG  FP R E  +  F+ PQ QP+  +P +  +    +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM

Query:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D
        L+AL+P   E LKQEV+VDLV+QC +Y  RV+ LVN T+DE LLCQGL LND+LQRVL+ ++ I  G            +P  S     P+ +  +   D
Subjt:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P
           PL       +     T  +       + LP+      +    ID LS D          +   P  PP   P SP+ S  +         D      
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P

Query:  PPRSISTNP---LPATPRDAQSPGS
        P  + S NP   +P  P+  Q P S
Subjt:  PPRSISTNP---LPATPRDAQSPGS

Q9C9Y1 TOM1-like protein 81.2e-6541.64Show/hide
Query:  ERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKILALV
        +RAT+D+L  PDWA+N+E+CD++N +P Q ++ +  +KKRL SR+ K+QLLAL +LE +  NCGE +   + E++ILH+M+K+ K+KP++ V++KIL L+
Subjt:  ERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKILALV

Query:  DAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD--DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALD
        D WQ +F G  +G++PQYYAAY EL  AG  FP R +      SS Q  P   +P ++ +    A+  S +S+   LSL EIQNA+G+ DVL EM+NA+D
Subjt:  DAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD--DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALD

Query:  PKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMM---GATRKEPPVPSPSVPYMNPEDDGSEDEFTPL
            E LKQEV+VDLV QC +Y  RV+ LVN T+DE +LCQGL LND+LQR+L+ H+ I  G  M+     ++KE P  +  +  +   +  +       
Subjt:  PKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMM---GATRKEPPVPSPSVPYMNPEDDGSEDEFTPL

Query:  ARRSTKDHIYERDTRLTNGQSSRVI-----PLPSHSSKKTTNA-EMIDHLSDD
              D +   D    N  +S  +     P PS    K  N+  +ID LSD+
Subjt:  ARRSTKDHIYERDTRLTNGQSSRVI-----PLPSHSSKKTTNA-EMIDHLSDD

Q9LPL6 TOM1-like protein 33.6e-12049.15Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        M+ NAAACAERATND+L  PDWAINIELCDIINM+P QAK+A+K+LKKRL S++ K+Q+LAL+ LE LSKNCGE+V++LIV+R+IL +M+KIVKKKPDL 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQS-DASGLSLPEIQNAQGLADVL
        VR+KIL+L+D WQ AFGG S G++PQYY AYNEL++AG +FPPR E+   FF+ PQ QP++    ++ +D A+Q SLQS DAS LS+ EIQ+AQG  DVL
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQS-DASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
         +ML ALDP  PE LK+E+IVDLV+QC +Y  RVM LVN T+DEEL+CQGL LNDNLQRVL +HDD  KG  +        P+P  S+ + + +DD S+D
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKT-TNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPR
        +F  LA RS +    E       G  + ++P P  S +    ++  +D LS D YKPQE+   V+          PPS  TS  S      P+FDEP P+
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKT-TNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPR

Query:  SIS-------------TNPLPATPRDAQSPGSLPPP-PARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPST-------PTRSAEHEEA
        S S             T  LP  P + Q P   PP   AR N+R +YF+         +     + Y++++GQ++NLSL+P+        P +  + E+ 
Subjt:  SIS-------------TNPLPATPRDAQSPGSLPPP-PARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPST-------PTRSAEHEEA

Query:  LFKDLVDFAKAKSSSSSKS------NRPF
        LFKDL+DFAK ++SSSS S      N+PF
Subjt:  LFKDLVDFAKAKSSSSSKS------NRPF

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 12.6e-12149.15Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        M+ NAAACAERATND+L  PDWAINIELCDIINM+P QAK+A+K+LKKRL S++ K+Q+LAL+ LE LSKNCGE+V++LIV+R+IL +M+KIVKKKPDL 
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQS-DASGLSLPEIQNAQGLADVL
        VR+KIL+L+D WQ AFGG S G++PQYY AYNEL++AG +FPPR E+   FF+ PQ QP++    ++ +D A+Q SLQS DAS LS+ EIQ+AQG  DVL
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQS-DASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED
         +ML ALDP  PE LK+E+IVDLV+QC +Y  RVM LVN T+DEEL+CQGL LNDNLQRVL +HDD  KG  +        P+P  S+ + + +DD S+D
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSED

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKT-TNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPR
        +F  LA RS +    E       G  + ++P P  S +    ++  +D LS D YKPQE+   V+          PPS  TS  S      P+FDEP P+
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKT-TNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPR

Query:  SIS-------------TNPLPATPRDAQSPGSLPPP-PARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPST-------PTRSAEHEEA
        S S             T  LP  P + Q P   PP   AR N+R +YF+         +     + Y++++GQ++NLSL+P+        P +  + E+ 
Subjt:  SIS-------------TNPLPATPRDAQSPGSLPPP-PARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPST-------PTRSAEHEEA

Query:  LFKDLVDFAKAKSSSSSKS------NRPF
        LFKDL+DFAK ++SSSS S      N+PF
Subjt:  LFKDLVDFAKAKSSSSSKS------NRPF

AT1G76970.1 Target of Myb protein 16.0e-11850.8Show/hide
Query:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN
        M+ +AAACAERATND+L  PDWAINIELCD+INMDP QAK+A+K+LKKRL S++ K+Q+LAL+ LE LSKNCGE V++LI++R +L++M+KIVKKKP+LN
Subjt:  MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQ-SDASGLSLPEIQNAQGLADVL
        VR+KIL L+D WQ AFGG   G+YPQYY AYN+L++AG +FPPR E+   FF+ PQ QP         +D A+Q SLQ  DAS LSL EIQ+A+G  DVL
Subjt:  VRDKILALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQ-SDASGLSLPEIQNAQGLADVL

Query:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITK-GTFMMGA--TRKEPPVPSPSVPYMNPEDDG
        M+ML A DP  PE+LK+EVIVDLV+QC +Y  RVM LVN TTDEELLCQGL LNDNLQ VL  HDDI   G+       TR  PPV    + + + EDD 
Subjt:  MEMLNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITK-GTFMMGA--TRKEPPVPSPSVPYMNPEDDG

Query:  SEDEFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPP
        S+DEF  LA RS+                      P+      +++ M+D LS D YKPQ +S +  +    PP   PP P TSS S      P+FD+  
Subjt:  SEDEFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPP

Query:  PRSISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS
        P+           + ++   +LPPPP+R+NQRQQ+FE   + +G  S       YE   GQT+NLSL+ S P +  + E+ LFKDLV+FAK +SS ++ +
Subjt:  PRSISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTGGGSQPHLSNGYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKS

Query:  NR
        NR
Subjt:  NR

AT3G08790.1 ENTH/VHS/GAT family protein8.2e-6741.64Show/hide
Query:  ERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKILALV
        +RAT+D+L  PDWA+N+E+CD++N +P Q ++ +  +KKRL SR+ K+QLLAL +LE +  NCGE +   + E++ILH+M+K+ K+KP++ V++KIL L+
Subjt:  ERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKILALV

Query:  DAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD--DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALD
        D WQ +F G  +G++PQYYAAY EL  AG  FP R +      SS Q  P   +P ++ +    A+  S +S+   LSL EIQNA+G+ DVL EM+NA+D
Subjt:  DAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD--DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALD

Query:  PKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMM---GATRKEPPVPSPSVPYMNPEDDGSEDEFTPL
            E LKQEV+VDLV QC +Y  RV+ LVN T+DE +LCQGL LND+LQR+L+ H+ I  G  M+     ++KE P  +  +  +   +  +       
Subjt:  PKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMM---GATRKEPPVPSPSVPYMNPEDDGSEDEFTPL

Query:  ARRSTKDHIYERDTRLTNGQSSRVI-----PLPSHSSKKTTNA-EMIDHLSDD
              D +   D    N  +S  +     P PS    K  N+  +ID LSD+
Subjt:  ARRSTKDHIYERDTRLTNGQSSRVI-----PLPSHSSKKTTNA-EMIDHLSDD

AT4G32760.1 ENTH/VHS/GAT family protein3.0e-6940.47Show/hide
Query:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT+++L  PDWA+N+E+CD++N DP QAKD +K +KKR+ SR+PK QLLAL +LE + KNCG+ V   + E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM
         L+D WQ AFGG  + +YPQYYA Y EL  AG  FP R E  +  F+ PQ QP+  +P +  +    +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM

Query:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D
        L+AL+P   E LKQEV+VDLV+QC +Y  RV+ LVN T+DE LLCQGL LND+LQRVL+ ++ I  G            +P  S     P+ +  +   D
Subjt:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P
           PL       +     T  +       + LP+      +    ID LS D          +   P  PP   P SP+ S  +         D      
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P

Query:  PPRSISTNP---LPATPRDAQSPGS
        P  + S NP   +P  P+  Q P S
Subjt:  PPRSISTNP---LPATPRDAQSPGS

AT4G32760.2 ENTH/VHS/GAT family protein3.0e-6940.47Show/hide
Query:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT+++L  PDWA+N+E+CD++N DP QAKD +K +KKR+ SR+PK QLLAL +LE + KNCG+ V   + E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM
         L+D WQ AFGG  + +YPQYYA Y EL  AG  FP R E  +  F+ PQ QP+  +P +  +    +   + S + +   LSL EIQNA+G+ DVL EM
Subjt:  ALVDAWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYD----DLAVQVSLQSDASGLSLPEIQNAQGLADVLMEM

Query:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D
        L+AL+P   E LKQEV+VDLV+QC +Y  RV+ LVN T+DE LLCQGL LND+LQRVL+ ++ I  G            +P  S     P+ +  +   D
Subjt:  LNALDPKTPEALKQEVIVDLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSE---D

Query:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P
           PL       +     T  +       + LP+      +    ID LS D          +   P  PP   P SP+ S  +         D      
Subjt:  EFTPLARRSTKDHIYERDTRLTNGQSSRVIPLPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDE----P

Query:  PPRSISTNP---LPATPRDAQSPGS
        P  + S NP   +P  P+  Q P S
Subjt:  PPRSISTNP---LPATPRDAQSPGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACCAATGCTGCTGCCTGTGCTGAGAGAGCAACAAATGATGTGCTTAGAGCTCCCGATTGGGCCATAAATATTGAGCTCTGTGATATCATCAACATGGATCCTAG
GCAAGCGAAGGATGCATTAAAGATACTCAAGAAGCGTCTGGCAAGCAGAAGTCCTAAAATACAACTTTTAGCTCTCCATGTATTGGAAGCTCTAAGCAAAAATTGTGGTG
AGACTGTTTTTAAGCTGATCGTGGAACGCAATATTCTGCATGAAATGCTTAAAATTGTAAAGAAGAAGCCTGATTTAAATGTACGGGACAAAATTTTAGCTCTGGTAGAT
GCATGGCAAGCAGCATTTGGTGGTGACTCCAAGGGAAAGTATCCACAGTACTATGCAGCCTACAATGAATTGAAGAATGCTGGATTTCAATTTCCGCCAAGAGAAGAGAA
TGTTAGCCAGTTCTTTAGTTCACCTCAGATACAGCCTGTTATTGAGCACCCTGTTTCAGCTTATGATGATCTTGCTGTTCAGGTTTCTCTCCAGTCTGATGCTTCTGGTT
TAAGCTTGCCAGAAATTCAAAATGCCCAGGGGCTAGCAGACGTTTTAATGGAAATGCTTAATGCGTTGGATCCTAAGACTCCAGAGGCTCTAAAGCAAGAAGTGATTGTT
GATCTTGTCGATCAATGCTGTTCCTACCATAGCCGTGTCATGATACTTGTGAACGAGACCACAGATGAGGAACTGTTATGTCAAGGGTTGGTGTTGAATGACAATCTGCA
GCGCGTACTCAGCTACCATGACGACATTACGAAAGGAACATTCATGATGGGAGCTACGAGAAAAGAACCTCCTGTTCCATCTCCATCGGTTCCGTATATGAACCCTGAGG
ATGATGGTTCGGAAGATGAATTTACACCGTTAGCTCGCAGGTCAACAAAAGATCACATCTATGAAAGGGACACAAGATTGACAAATGGTCAATCATCTCGAGTTATTCCG
CTTCCTTCACACTCATCAAAGAAGACGACTAACGCAGAAATGATCGATCATCTCAGCGACGATGCATACAAGCCTCAAGAGTCTTCAAGGACAGTAGAGGAGTCACCATC
TTATCCACCTCCAGTTTTCCCACCTTCACCAATAACTTCATCTCCCTCACCTTTCTACACTAGACAGCCTCTGTTCGACGAACCACCTCCGAGAAGCATATCCACGAATC
CGCTCCCGGCAACACCTCGGGACGCTCAATCTCCAGGCTCCCTCCCTCCCCCACCAGCGAGATATAATCAAAGACAACAATATTTTGAGCAACAAAAAGCTGTCACTGGA
GGAGGCAGTCAGCCTCATTTGAGCAACGGCTATGAAAACATAGTGGGACAAACTAAGAATCTGTCCCTCAGTCCTTCCACCCCAACCAGATCCGCAGAGCATGAAGAAGC
CCTTTTCAAAGATCTGGTGGATTTTGCCAAGGCTAAGTCATCTTCATCCTCCAAATCCAACCGACCATTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTACCAATGCTGCTGCCTGTGCTGAGAGAGCAACAAATGATGTGCTTAGAGCTCCCGATTGGGCCATAAATATTGAGCTCTGTGATATCATCAACATGGATCCTAG
GCAAGCGAAGGATGCATTAAAGATACTCAAGAAGCGTCTGGCAAGCAGAAGTCCTAAAATACAACTTTTAGCTCTCCATGTATTGGAAGCTCTAAGCAAAAATTGTGGTG
AGACTGTTTTTAAGCTGATCGTGGAACGCAATATTCTGCATGAAATGCTTAAAATTGTAAAGAAGAAGCCTGATTTAAATGTACGGGACAAAATTTTAGCTCTGGTAGAT
GCATGGCAAGCAGCATTTGGTGGTGACTCCAAGGGAAAGTATCCACAGTACTATGCAGCCTACAATGAATTGAAGAATGCTGGATTTCAATTTCCGCCAAGAGAAGAGAA
TGTTAGCCAGTTCTTTAGTTCACCTCAGATACAGCCTGTTATTGAGCACCCTGTTTCAGCTTATGATGATCTTGCTGTTCAGGTTTCTCTCCAGTCTGATGCTTCTGGTT
TAAGCTTGCCAGAAATTCAAAATGCCCAGGGGCTAGCAGACGTTTTAATGGAAATGCTTAATGCGTTGGATCCTAAGACTCCAGAGGCTCTAAAGCAAGAAGTGATTGTT
GATCTTGTCGATCAATGCTGTTCCTACCATAGCCGTGTCATGATACTTGTGAACGAGACCACAGATGAGGAACTGTTATGTCAAGGGTTGGTGTTGAATGACAATCTGCA
GCGCGTACTCAGCTACCATGACGACATTACGAAAGGAACATTCATGATGGGAGCTACGAGAAAAGAACCTCCTGTTCCATCTCCATCGGTTCCGTATATGAACCCTGAGG
ATGATGGTTCGGAAGATGAATTTACACCGTTAGCTCGCAGGTCAACAAAAGATCACATCTATGAAAGGGACACAAGATTGACAAATGGTCAATCATCTCGAGTTATTCCG
CTTCCTTCACACTCATCAAAGAAGACGACTAACGCAGAAATGATCGATCATCTCAGCGACGATGCATACAAGCCTCAAGAGTCTTCAAGGACAGTAGAGGAGTCACCATC
TTATCCACCTCCAGTTTTCCCACCTTCACCAATAACTTCATCTCCCTCACCTTTCTACACTAGACAGCCTCTGTTCGACGAACCACCTCCGAGAAGCATATCCACGAATC
CGCTCCCGGCAACACCTCGGGACGCTCAATCTCCAGGCTCCCTCCCTCCCCCACCAGCGAGATATAATCAAAGACAACAATATTTTGAGCAACAAAAAGCTGTCACTGGA
GGAGGCAGTCAGCCTCATTTGAGCAACGGCTATGAAAACATAGTGGGACAAACTAAGAATCTGTCCCTCAGTCCTTCCACCCCAACCAGATCCGCAGAGCATGAAGAAGC
CCTTTTCAAAGATCTGGTGGATTTTGCCAAGGCTAAGTCATCTTCATCCTCCAAATCCAACCGACCATTCTGA
Protein sequenceShow/hide protein sequence
MSTNAAACAERATNDVLRAPDWAINIELCDIINMDPRQAKDALKILKKRLASRSPKIQLLALHVLEALSKNCGETVFKLIVERNILHEMLKIVKKKPDLNVRDKILALVD
AWQAAFGGDSKGKYPQYYAAYNELKNAGFQFPPREENVSQFFSSPQIQPVIEHPVSAYDDLAVQVSLQSDASGLSLPEIQNAQGLADVLMEMLNALDPKTPEALKQEVIV
DLVDQCCSYHSRVMILVNETTDEELLCQGLVLNDNLQRVLSYHDDITKGTFMMGATRKEPPVPSPSVPYMNPEDDGSEDEFTPLARRSTKDHIYERDTRLTNGQSSRVIP
LPSHSSKKTTNAEMIDHLSDDAYKPQESSRTVEESPSYPPPVFPPSPITSSPSPFYTRQPLFDEPPPRSISTNPLPATPRDAQSPGSLPPPPARYNQRQQYFEQQKAVTG
GGSQPHLSNGYENIVGQTKNLSLSPSTPTRSAEHEEALFKDLVDFAKAKSSSSSKSNRPF