; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020898 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020898
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTarget of Myb protein 1
Genome locationscaffold382:451430..462294
RNA-Seq ExpressionMS020898
SyntenyMS020898
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016020 - membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR014645 - Target of Myb protein 1
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139800.1 TOM1-like protein 3 [Cucumis sativus]2.6e-22483Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERAT+DMLIGPDWAINIELCDIINMDPGQAKEALKILKK+LTNKNPKIQILAL  LET+SKNCGENVFQQI+ERDILHDMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREE+SVPFFTPPQTQPIV+QPA+ +EDAAIHASLESDASGLSLPEIRNA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLL+NSTGDEELLCQGLALND LQRV+KQHDDIANGTA RE TGA  S LP + +++EDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP
        LARRSSRDNSQG S+KPAN  TEATRVGPLLPPPP+SKKPV+AGSSMVDYLSGD YKS EA ETSR  P  + +TS PPS SP+S      + +PVYDEP
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP

Query:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
         PT         A W  Q  QS S LPPPPSK+ +RQQFFDQ + RGSGSSYDSLVGHTQNLSL+PPTPTKQEK EDVLFKDLVDFAKARSSGSS PNRS
Subjt:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

XP_008447823.1 PREDICTED: TOM1-like protein 2 [Cucumis melo]9.1e-22283Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERAT+DMLIGPDWAINIELCDIINMDPGQAKEALKILKK+LTNKNPKIQILAL  LET+SKNCGENVFQQIVERDILHDMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPAA +EDAAIHASLESDASGLSLPEIRNA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQC+SYQKRVMLL+NSTGDEELLCQGLALND LQRV+KQHDDIANGTA RE TGA  S LP + +++EDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP
        LARR S   SQG S+KPANG+TEATRVGPLLPPPPSSKKPV+AGSSMVDYLSGD YKS EA ETSR  P  +  TS PPS SP+S      + +PVYDEP
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP

Query:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
         PT         A W  Q  QS S LPPPPSK+ QRQQFFDQ + RGSGSSYDSLVGHTQNLSL+ PTPTKQEK EDVLFKDLVDFAKARSSGSS PNRS
Subjt:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

XP_022153688.1 TOM1-like protein 4 isoform X1 [Momordica charantia]3.5e-27499.59Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
        LARRSSRDNSQGQSRKPANGSTE+TRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN

Query:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
        PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVG TQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
Subjt:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF

XP_022153689.1 TOM1-like protein 4 isoform X2 [Momordica charantia]3.5e-27499.59Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
        LARRSSRDNSQGQSRKPANGSTE+TRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN

Query:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
        PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVG TQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
Subjt:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF

XP_038898391.1 TOM1-like protein 3 isoform X1 [Benincasa hispida]5.9e-22983.79Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKK+LTNK+PKIQILAL ALET+SKNCGENVFQQI+ERDILHDMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPAA +EDA IHASLESDASGLSLPEIRNA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPK PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALND+LQRV+KQHDDIA+GTA RE TGAESSALPI+ +++EDDES+DDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEAS-ETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP-----
        LARRSSRDNSQG SRK ANG+TE TRV PLLPPPPSSKKPV+AGSSMVDYLSGD YKSEA+ ETSR  P     TS PPS SP+SHVE SI SQP     
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEAS-ETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP-----

Query:  -VYDEPNPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGS
         VYDEP PT         A WG Q  QS S LPPPPSK  QRQQ+FDQ + RGSGSSYDSLVGHTQ+LSL+PPTPTKQEK EDVLFKDL+DFAK RSSGS
Subjt:  -VYDEPNPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGS

Query:  SNPNRS
        S PNRS
Subjt:  SNPNRS

TrEMBL top hitse value%identityAlignment
A0A0A0K8S9 Uncharacterized protein1.2e-22483Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERAT+DMLIGPDWAINIELCDIINMDPGQAKEALKILKK+LTNKNPKIQILAL  LET+SKNCGENVFQQI+ERDILHDMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREE+SVPFFTPPQTQPIV+QPA+ +EDAAIHASLESDASGLSLPEIRNA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLL+NSTGDEELLCQGLALND LQRV+KQHDDIANGTA RE TGA  S LP + +++EDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP
        LARRSSRDNSQG S+KPAN  TEATRVGPLLPPPP+SKKPV+AGSSMVDYLSGD YKS EA ETSR  P  + +TS PPS SP+S      + +PVYDEP
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP

Query:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
         PT         A W  Q  QS S LPPPPSK+ +RQQFFDQ + RGSGSSYDSLVGHTQNLSL+PPTPTKQEK EDVLFKDLVDFAKARSSGSS PNRS
Subjt:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

A0A1S3BIX5 TOM1-like protein 24.4e-22283Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERAT+DMLIGPDWAINIELCDIINMDPGQAKEALKILKK+LTNKNPKIQILAL  LET+SKNCGENVFQQIVERDILHDMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPAA +EDAAIHASLESDASGLSLPEIRNA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQC+SYQKRVMLL+NSTGDEELLCQGLALND LQRV+KQHDDIANGTA RE TGA  S LP + +++EDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP
        LARR S   SQG S+KPANG+TEATRVGPLLPPPPSSKKPV+AGSSMVDYLSGD YKS EA ETSR  P  +  TS PPS SP+S      + +PVYDEP
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEP

Query:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
         PT         A W  Q  QS S LPPPPSK+ QRQQFFDQ + RGSGSSYDSLVGHTQNLSL+ PTPTKQEK EDVLFKDLVDFAKARSSGSS PNRS
Subjt:  NPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

A0A6J1DI53 TOM1-like protein 4 isoform X11.7e-27499.59Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
        LARRSSRDNSQGQSRKPANGSTE+TRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN

Query:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
        PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVG TQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
Subjt:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF

A0A6J1DI58 TOM1-like protein 4 isoform X21.7e-27499.59Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
        LARRSSRDNSQGQSRKPANGSTE+TRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPN

Query:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
        PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVG TQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF
Subjt:  PTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF

A0A6J1IM19 TOM1-like protein 31.2e-21480.28Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        M SNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAK ALKILK +L NKN KIQILALFALET+SKNCGENVFQQI+E DIL DMVKIVKKKP L 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME
        VREKIL LIDTWQEAFGGPRGRYPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPI++QPAA +EDAAIHASLESD SGLSLPEI+NA GLADVL+E
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLME

Query:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ
        MLGALDPKKPEGVKQEVIVDLVDQC+SYQKRVMLLVNSTGDEELLCQGLALND LQRV+KQHDDIANG+A +E T AESSALPI+ +++EDD+  DDFAQ
Subjt:  MLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISS--QPVYD
        LARRS      G +RKPA+ +TEATRVGPLLPPPPSSKKPV+AGSSMVDYLSGD YKS EASETSR  P  +  TS PPS SP+ HVE SI S  QPVYD
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKS-EASETSRTVPVHSTATSIPPSPSPVSHVEHSISS--QPVYD

Query:  EPNPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPN
        EP PT         A WG Q  QS + LP PPS + QR QFFD+ EA GSGSSYDSLVGHTQ+LSL+PPTP KQEK EDVLFKDLVDFAKARSS SS PN
Subjt:  EPNPT---------ASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPN

Query:  RS
        RS
Subjt:  RS

SwissProt top hitse value%identityAlignment
O80910 TOM1-like protein 69.2e-6833.27Show/hide
Query:  ASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGV
        +++A    ++ATSD+L+GPDW  N+E+CD +N    QAK+ +K +KK+L +K+ ++Q+LAL  LET+ KNCG+ +  Q+ E++IL +MVKIVKKK D+ V
Subjt:  ASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGV

Query:  REKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAF----------------------------------
        R+KIL ++D+WQ+AFGGP G+YPQ Y AY+EL+ +G++FP R  ++ P  TPP + P + QP   +                                  
Subjt:  REKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAF----------------------------------

Query:  -------------EDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDT
                         +  ++ ++  GLSL  I + + + D+L +ML A+DP   E VK EVIVDLV++CRS QK++M ++ STGD+ELL +GL LND+
Subjt:  -------------EDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDT

Query:  LQRVIKQHDDIANGT----------------------------ANRETTGAESSALPIVT------INNEDDESEDDFAQLARRSSRD----NSQGQSRK
        LQ ++ +HD IA+G+                             +  +    SS +P         I+ E +E ED+FAQLARR S+      +   S +
Subjt:  LQRVIKQHDDIANGT----------------------------ANRETTGAESSALPIVT------INNEDDESEDDFAQLARRSSRD----NSQGQSRK

Query:  PANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPNP-----TASWG-LQEP
          N ++ A  +    PPPP +         M+D LS           +   P    A S  PSP P +    S  +  +Y +P P      A W   Q+P
Subjt:  PANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPNP-----TASWG-LQEP

Query:  QSPSTLPPPPSKFGQRQQFFDQHEARGSGSSY
        Q P           Q QQ + QH+       Y
Subjt:  QSPSTLPPPPSKFGQRQQFFDQHEARGSGSSY

Q6NQK0 TOM1-like protein 41.3e-13858.15Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MA++AAACAERAT+DMLIGPDWAINIELCD+INMDP QAKEA+K+LKK+L +KN K+QILAL+ALET+SKNCGENV+Q I++R +L+DMVKIVKKKP+L 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLE-SDASGLSLPEIRNAQGLADVLM
        VREKILTL+DTWQEAFGG  GRYPQ Y AYN+L++AGI+FPPR E+S+ FFTPPQTQP         EDAAI ASL+  DAS LSL EI++A+G  DVLM
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLE-SDASGLSLPEIRNAQGLADVLM

Query:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP---IVTIN--NEDDES
        +MLGA DP  PE +K+EVIVDLV+QCR+YQ+RVM LVN+T DEELLCQGLALND LQ V+++HDDIAN   +  + G  + A P   IV IN  +EDDES
Subjt:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP---IVTIN--NEDDES

Query:  EDDFAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP
        +D+FA+LA RSS       +R+P +GS                       S MVD LSGD YK + + +S+ V         PP P P  H   S SS P
Subjt:  EDDFAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP

Query:  VYDEPNPTASWGLQEPQSPSTLPPPPSKFGQRQQFFD-QHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
        V+D+ +P  S   +  +    LPPPPS+  QRQQFF+  H + GS SSY+   G T+NLSL    P K+EK ED+LFKDLV+FAK RSS ++N NRS
Subjt:  VYDEPNPTASWGLQEPQSPSTLPPPPSKFGQRQQFFD-QHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

Q8L860 TOM1-like protein 96.0e-8345.64Show/hide
Query:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL
        A  ERATS+MLIGPDWA+N+E+CD++N DP QAK+ +K +KK++ ++NPK Q+LAL  LET+ KNCG+ V   + E+ ++H+MV+IVKKKPD  V+EKIL
Subjt:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL

Query:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML
         LIDTWQEAFGGPR RYPQ YA Y EL  AG  FP R E S P FTPPQTQP+   P     A   +     S E +   LSL EI+NA+G+ DVL EML
Subjt:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML

Query:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA
         AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRV+  ++ IA+G             LP  +   E  +SE     L 
Subjt:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA

Query:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD
                 G S   ANG+T ++  G L    LP PP +     + +S +D LSGD            VPV       PP P SPV+  +++++   ++ 
Subjt:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD

Query:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ
        +    P+P TA  G      P +P     P S+ G+
Subjt:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ

Q9C9Y1 TOM1-like protein 83.5e-7542.17Show/hide
Query:  ERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKILTLI
        +RATSDMLIGPDWA+N+E+CD++N +PGQ +E +  +KK+LT++  K+Q+LAL  LET+  NCGE +  Q+ E+DILH MVK+ K+KP++ V+EKIL LI
Subjt:  ERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKILTLI

Query:  DTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPP--QTQPIVHQP--AAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGAL
        DTWQE+F GP+GR+PQ YAAY EL  AGI FP R     P  TP   Q  P    P  +      AI  S ES+   LSL EI+NA+G+ DVL EM+ A+
Subjt:  DTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPP--QTQPIVHQP--AAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGAL

Query:  DPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP-----IVTINNEDDESEDDFAQ
        D    EG+KQEV+VDLV QCR+Y++RV+ LVNST DE +LCQGLALND LQR++ +H+ IA+G +  +        +P     I+ + + + ++    A 
Subjt:  DPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP-----IVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSS---MVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSI----SSQ
               D   G   +  N       V PL PP PSS  PV    +   ++D LS +  +S ++ TS     H        +     H E S     SS 
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSS---MVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSI----SSQ

Query:  PVYD---EPNPTASWGLQEPQSPSTLPPPPSKFG
        PV++      P++     +P SP+  PP    +G
Subjt:  PVYD---EPNPTASWGLQEPQSPSTLPPPPSKFG

Q9LPL6 TOM1-like protein 32.8e-14958.65Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MA+NAAACAERAT+DMLIGPDWAINIELCDIINM+P QAKEA+K+LKK+L +KN K+QILAL+ALET+SKNCGE+V+Q IV+RDIL DMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLES-DASGLSLPEIRNAQGLADVLM
        VREKIL+L+DTWQEAFGG  GR+PQ Y AYNEL++AGI+FPPR E+SVPFFTPPQTQPIV Q  A+ EDAAI ASL+S DAS LS+ EI++AQG  DVL 
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLES-DASGLSLPEIRNAQGLADVLM

Query:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTIN--NEDDESEDD
        +MLGALDP  PEG+K+E+IVDLV+QCR+YQ+RVM LVN+T DEEL+CQGLALND LQRV++ HDD A G +   T     + +P+V+IN  ++DDES+DD
Subjt:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTIN--NEDDESEDD

Query:  FAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATS-------------IPPSPSPVS
        F QLA RS R++++G  +   N         P+LPPPPSS +PV   S  +D+LSGD YK +  ET   V   ST+ S             +P S SP  
Subjt:  FAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATS-------------IPPSPSPVS

Query:  HVEHSISSQPVYD--EPNPTASWGLQEPQSPSTLPPPPS-KFGQRQQFFD----QHEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKSEDVLF
          EH++ ++PVYD  E  P A W  QEP+     PP  S +  +R ++F     QH +  S SSYD L+G ++NLSLNP         P K +K ED+LF
Subjt:  HVEHSISSQPVYD--EPNPTASWGLQEPQSPSTLPPPPS-KFGQRQQFFD----QHEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKSEDVLF

Query:  KDLVDFAKAR--SSGSSNPN
        KDL+DFAK R  SS SS PN
Subjt:  KDLVDFAKAR--SSGSSNPN

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 12.0e-15058.65Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MA+NAAACAERAT+DMLIGPDWAINIELCDIINM+P QAKEA+K+LKK+L +KN K+QILAL+ALET+SKNCGE+V+Q IV+RDIL DMVKIVKKKPDL 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLES-DASGLSLPEIRNAQGLADVLM
        VREKIL+L+DTWQEAFGG  GR+PQ Y AYNEL++AGI+FPPR E+SVPFFTPPQTQPIV Q  A+ EDAAI ASL+S DAS LS+ EI++AQG  DVL 
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLES-DASGLSLPEIRNAQGLADVLM

Query:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTIN--NEDDESEDD
        +MLGALDP  PEG+K+E+IVDLV+QCR+YQ+RVM LVN+T DEEL+CQGLALND LQRV++ HDD A G +   T     + +P+V+IN  ++DDES+DD
Subjt:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTIN--NEDDESEDD

Query:  FAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATS-------------IPPSPSPVS
        F QLA RS R++++G  +   N         P+LPPPPSS +PV   S  +D+LSGD YK +  ET   V   ST+ S             +P S SP  
Subjt:  FAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATS-------------IPPSPSPVS

Query:  HVEHSISSQPVYD--EPNPTASWGLQEPQSPSTLPPPPS-KFGQRQQFFD----QHEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKSEDVLF
          EH++ ++PVYD  E  P A W  QEP+     PP  S +  +R ++F     QH +  S SSYD L+G ++NLSLNP         P K +K ED+LF
Subjt:  HVEHSISSQPVYD--EPNPTASWGLQEPQSPSTLPPPPS-KFGQRQQFFD----QHEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKSEDVLF

Query:  KDLVDFAKAR--SSGSSNPN
        KDL+DFAK R  SS SS PN
Subjt:  KDLVDFAKAR--SSGSSNPN

AT1G76970.1 Target of Myb protein 19.3e-14058.15Show/hide
Query:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG
        MA++AAACAERAT+DMLIGPDWAINIELCD+INMDP QAKEA+K+LKK+L +KN K+QILAL+ALET+SKNCGENV+Q I++R +L+DMVKIVKKKP+L 
Subjt:  MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLG

Query:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLE-SDASGLSLPEIRNAQGLADVLM
        VREKILTL+DTWQEAFGG  GRYPQ Y AYN+L++AGI+FPPR E+S+ FFTPPQTQP         EDAAI ASL+  DAS LSL EI++A+G  DVLM
Subjt:  VREKILTLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLE-SDASGLSLPEIRNAQGLADVLM

Query:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP---IVTIN--NEDDES
        +MLGA DP  PE +K+EVIVDLV+QCR+YQ+RVM LVN+T DEELLCQGLALND LQ V+++HDDIAN   +  + G  + A P   IV IN  +EDDES
Subjt:  EMLGALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP---IVTIN--NEDDES

Query:  EDDFAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP
        +D+FA+LA RSS       +R+P +GS                       S MVD LSGD YK + + +S+ V         PP P P  H   S SS P
Subjt:  EDDFAQLARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQP

Query:  VYDEPNPTASWGLQEPQSPSTLPPPPSKFGQRQQFFD-QHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS
        V+D+ +P  S   +  +    LPPPPS+  QRQQFF+  H + GS SSY+   G T+NLSL    P K+EK ED+LFKDLV+FAK RSS ++N NRS
Subjt:  VYDEPNPTASWGLQEPQSPSTLPPPPSKFGQRQQFFD-QHEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRS

AT3G08790.1 ENTH/VHS/GAT family protein2.5e-7642.17Show/hide
Query:  ERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKILTLI
        +RATSDMLIGPDWA+N+E+CD++N +PGQ +E +  +KK+LT++  K+Q+LAL  LET+  NCGE +  Q+ E+DILH MVK+ K+KP++ V+EKIL LI
Subjt:  ERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKILTLI

Query:  DTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPP--QTQPIVHQP--AAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGAL
        DTWQE+F GP+GR+PQ YAAY EL  AGI FP R     P  TP   Q  P    P  +      AI  S ES+   LSL EI+NA+G+ DVL EM+ A+
Subjt:  DTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPP--QTQPIVHQP--AAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGAL

Query:  DPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP-----IVTINNEDDESEDDFAQ
        D    EG+KQEV+VDLV QCR+Y++RV+ LVNST DE +LCQGLALND LQR++ +H+ IA+G +  +        +P     I+ + + + ++    A 
Subjt:  DPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALP-----IVTINNEDDESEDDFAQ

Query:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSS---MVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSI----SSQ
               D   G   +  N       V PL PP PSS  PV    +   ++D LS +  +S ++ TS     H        +     H E S     SS 
Subjt:  LARRSSRDNSQGQSRKPANGSTEATRVGPLLPPPPSSKKPVLAGSS---MVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSI----SSQ

Query:  PVYD---EPNPTASWGLQEPQSPSTLPPPPSKFG
        PV++      P++     +P SP+  PP    +G
Subjt:  PVYD---EPNPTASWGLQEPQSPSTLPPPPSKFG

AT4G32760.1 ENTH/VHS/GAT family protein4.2e-8445.64Show/hide
Query:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL
        A  ERATS+MLIGPDWA+N+E+CD++N DP QAK+ +K +KK++ ++NPK Q+LAL  LET+ KNCG+ V   + E+ ++H+MV+IVKKKPD  V+EKIL
Subjt:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL

Query:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML
         LIDTWQEAFGGPR RYPQ YA Y EL  AG  FP R E S P FTPPQTQP+   P     A   +     S E +   LSL EI+NA+G+ DVL EML
Subjt:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML

Query:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA
         AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRV+  ++ IA+G             LP  +   E  +SE     L 
Subjt:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA

Query:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD
                 G S   ANG+T ++  G L    LP PP +     + +S +D LSGD            VPV       PP P SPV+  +++++   ++ 
Subjt:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD

Query:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ
        +    P+P TA  G      P +P     P S+ G+
Subjt:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ

AT4G32760.2 ENTH/VHS/GAT family protein4.2e-8445.64Show/hide
Query:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL
        A  ERATS+MLIGPDWA+N+E+CD++N DP QAK+ +K +KK++ ++NPK Q+LAL  LET+ KNCG+ V   + E+ ++H+MV+IVKKKPD  V+EKIL
Subjt:  ACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKIL

Query:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML
         LIDTWQEAFGGPR RYPQ YA Y EL  AG  FP R E S P FTPPQTQP+   P     A   +     S E +   LSL EI+NA+G+ DVL EML
Subjt:  TLIDTWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPA----AAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEML

Query:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA
         AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRV+  ++ IA+G             LP  +   E  +SE     L 
Subjt:  GALDPKKPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLA

Query:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD
                 G S   ANG+T ++  G L    LP PP +     + +S +D LSGD            VPV       PP P SPV+  +++++   ++ 
Subjt:  RRSSRDNSQGQSRKPANGSTEATRVGPL----LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSP-SPVSHVEHSISSQPVYD

Query:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ
        +    P+P TA  G      P +P     P S+ G+
Subjt:  E----PNP-TASWG---LQEPQSPSTLPPPPSKFGQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGTAATGCAGCAGCCTGCGCAGAGAGGGCAACCAGTGATATGCTGATTGGTCCAGACTGGGCCATAAACATTGAGCTATGCGATATCATTAACATGGACCCTGG
GCAAGCAAAGGAGGCATTAAAGATACTCAAGAAGAAGCTAACAAACAAAAATCCCAAGATTCAGATTTTGGCTCTTTTTGCTCTGGAGACTGTCAGCAAAAATTGTGGTG
AGAATGTGTTTCAGCAGATTGTTGAGCGTGACATCCTGCATGACATGGTTAAAATAGTGAAGAAGAAGCCAGACTTGGGTGTGAGGGAGAAAATACTAACTCTGATAGAT
ACATGGCAAGAAGCATTTGGGGGACCAAGGGGAAGGTATCCCCAGTGCTATGCTGCTTATAATGAATTAAAGAATGCTGGAATTGATTTTCCACCAAGAGAAGAGAATAG
TGTTCCGTTTTTTACTCCACCTCAAACCCAGCCCATTGTTCATCAACCTGCTGCAGCTTTTGAGGATGCTGCAATTCATGCCTCTCTTGAATCTGATGCTTCTGGGCTGA
GCTTGCCAGAAATTCGAAATGCACAGGGGCTTGCAGATGTGCTAATGGAAATGCTTGGTGCCCTGGACCCTAAAAAACCAGAGGGTGTAAAGCAAGAAGTGATTGTTGAC
CTTGTTGACCAGTGCCGGTCATATCAAAAGCGTGTCATGCTGCTTGTAAATAGTACCGGAGATGAGGAACTTTTATGTCAGGGATTGGCTTTGAATGATACTCTGCAGCG
AGTGATTAAACAGCATGATGATATTGCGAATGGAACTGCTAATAGAGAAACAACAGGAGCTGAATCATCAGCTCTTCCAATTGTAACTATCAACAATGAGGATGATGAAT
CAGAAGATGATTTTGCTCAGTTAGCTCGGAGATCATCTAGAGACAATTCTCAAGGACAGAGCAGAAAACCAGCCAATGGTAGTACTGAAGCGACCCGAGTTGGACCTCTT
CTACCTCCTCCGCCATCATCAAAGAAGCCAGTTCTTGCAGGTAGCAGCATGGTTGATTACCTTAGTGGTGATACTTACAAATCAGAAGCTTCCGAAACTTCACGTACGGT
TCCAGTTCACTCAACTGCAACTTCAATCCCGCCATCTCCTTCACCCGTCTCCCATGTGGAACACTCCATCTCATCACAACCTGTGTATGATGAACCAAATCCCACGGCAT
CATGGGGCTTGCAGGAGCCCCAAAGCCCCTCCACCCTTCCACCTCCACCATCGAAATTCGGTCAGAGACAGCAATTTTTTGACCAACATGAGGCTCGCGGGTCGGGTTCC
TCATACGACAGCTTAGTTGGACATACCCAGAACCTCTCTCTCAATCCCCCAACTCCAACTAAACAGGAAAAGTCAGAAGATGTGCTATTCAAAGATCTGGTTGATTTCGC
CAAAGCTCGGTCATCTGGATCGTCGAATCCAAACCGGTCTTTC
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGTAATGCAGCAGCCTGCGCAGAGAGGGCAACCAGTGATATGCTGATTGGTCCAGACTGGGCCATAAACATTGAGCTATGCGATATCATTAACATGGACCCTGG
GCAAGCAAAGGAGGCATTAAAGATACTCAAGAAGAAGCTAACAAACAAAAATCCCAAGATTCAGATTTTGGCTCTTTTTGCTCTGGAGACTGTCAGCAAAAATTGTGGTG
AGAATGTGTTTCAGCAGATTGTTGAGCGTGACATCCTGCATGACATGGTTAAAATAGTGAAGAAGAAGCCAGACTTGGGTGTGAGGGAGAAAATACTAACTCTGATAGAT
ACATGGCAAGAAGCATTTGGGGGACCAAGGGGAAGGTATCCCCAGTGCTATGCTGCTTATAATGAATTAAAGAATGCTGGAATTGATTTTCCACCAAGAGAAGAGAATAG
TGTTCCGTTTTTTACTCCACCTCAAACCCAGCCCATTGTTCATCAACCTGCTGCAGCTTTTGAGGATGCTGCAATTCATGCCTCTCTTGAATCTGATGCTTCTGGGCTGA
GCTTGCCAGAAATTCGAAATGCACAGGGGCTTGCAGATGTGCTAATGGAAATGCTTGGTGCCCTGGACCCTAAAAAACCAGAGGGTGTAAAGCAAGAAGTGATTGTTGAC
CTTGTTGACCAGTGCCGGTCATATCAAAAGCGTGTCATGCTGCTTGTAAATAGTACCGGAGATGAGGAACTTTTATGTCAGGGATTGGCTTTGAATGATACTCTGCAGCG
AGTGATTAAACAGCATGATGATATTGCGAATGGAACTGCTAATAGAGAAACAACAGGAGCTGAATCATCAGCTCTTCCAATTGTAACTATCAACAATGAGGATGATGAAT
CAGAAGATGATTTTGCTCAGTTAGCTCGGAGATCATCTAGAGACAATTCTCAAGGACAGAGCAGAAAACCAGCCAATGGTAGTACTGAAGCGACCCGAGTTGGACCTCTT
CTACCTCCTCCGCCATCATCAAAGAAGCCAGTTCTTGCAGGTAGCAGCATGGTTGATTACCTTAGTGGTGATACTTACAAATCAGAAGCTTCCGAAACTTCACGTACGGT
TCCAGTTCACTCAACTGCAACTTCAATCCCGCCATCTCCTTCACCCGTCTCCCATGTGGAACACTCCATCTCATCACAACCTGTGTATGATGAACCAAATCCCACGGCAT
CATGGGGCTTGCAGGAGCCCCAAAGCCCCTCCACCCTTCCACCTCCACCATCGAAATTCGGTCAGAGACAGCAATTTTTTGACCAACATGAGGCTCGCGGGTCGGGTTCC
TCATACGACAGCTTAGTTGGACATACCCAGAACCTCTCTCTCAATCCCCCAACTCCAACTAAACAGGAAAAGTCAGAAGATGTGCTATTCAAAGATCTGGTTGATTTCGC
CAAAGCTCGGTCATCTGGATCGTCGAATCCAAACCGGTCTTTC
Protein sequenceShow/hide protein sequence
MASNAAACAERATSDMLIGPDWAINIELCDIINMDPGQAKEALKILKKKLTNKNPKIQILALFALETVSKNCGENVFQQIVERDILHDMVKIVKKKPDLGVREKILTLID
TWQEAFGGPRGRYPQCYAAYNELKNAGIDFPPREENSVPFFTPPQTQPIVHQPAAAFEDAAIHASLESDASGLSLPEIRNAQGLADVLMEMLGALDPKKPEGVKQEVIVD
LVDQCRSYQKRVMLLVNSTGDEELLCQGLALNDTLQRVIKQHDDIANGTANRETTGAESSALPIVTINNEDDESEDDFAQLARRSSRDNSQGQSRKPANGSTEATRVGPL
LPPPPSSKKPVLAGSSMVDYLSGDTYKSEASETSRTVPVHSTATSIPPSPSPVSHVEHSISSQPVYDEPNPTASWGLQEPQSPSTLPPPPSKFGQRQQFFDQHEARGSGS
SYDSLVGHTQNLSLNPPTPTKQEKSEDVLFKDLVDFAKARSSGSSNPNRSF