; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1305 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1305
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTOM1-like protein 4 isoform X1
Genome locationMC04:21075996..21079650
RNA-Seq ExpressionMC04g1305
SyntenyMC04g1305
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441589.1 PREDICTED: target of Myb protein 1 [Cucumis melo]5.95e-27779.56Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATND+LIAPDWAINIELCDI+NMDPRQ KDALKILKKRL SKNPK QLLALYALEALSKNCGD V KLIVDR ILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
         VR+KIL LVDAWQ AFGGGSKGKYPQYYAAY +LK  AGFQFPPREENV QFFSPP+ QP +EDPVSAYDD +VQASLQSD+S LSLPEIQNAQGL DV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        L+EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRV++LVNETTDEELLCQGLVLNDSLQRVLS+HD+IAKGT  T  RR EP VP+VPY+NPEDDES+DD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPL+RR TRDHIYERDRKL+NGQSSRVSPLPSPSSKK   V+MIDHLSGD YKP+GSP+  +PPS            +S+SPF+T RQPLFDEPPPR +
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP
         T+PL    RD QSP  LPPPPSRYNQRQQ+FEQQKA TGG  PHL N     S DNIVG TK LSL P T TRSA+HEEALFK+LVDFA  KSS SSK 
Subjt:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP

Query:  SRPF
        +RPF
Subjt:  SRPF

XP_022141604.1 TOM1-like protein 4 isoform X1 [Momordica charantia]0.099.6Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
        NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELK  AGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
        STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
Subjt:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF

XP_022141608.1 TOM1-like protein 4 isoform X2 [Momordica charantia]0.099.4Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKK PDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
        NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELK  AGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
        STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
Subjt:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF

XP_023551153.1 TOM1-like protein 4 isoform X1 [Cucurbita pepo subsp. pepo]8.64e-27779.88Show/hide
Query:  MSTNAAA-CAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPD
        MSTNAAA CAERATND LIAPDWAINIELCDI+NMDPRQ KDALKILKKRLASKNPKTQLLALYAL+ALSKNCGD V KLIVDR ILHEMVKIVKKKQPD
Subjt:  MSTNAAA-CAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPD

Query:  SNVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSD
        S VRDKILTLVDAWQ  FGGGSKGK PQYYAAY ELK  AGF+FPPR ENVGQF SPP+I P +E   S YDD + Q SLQSDAS LSLPEIQN QGL+D
Subjt:  SNVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPS-VPTVPYMNPEDDESD
        VL+EML ALDPKTPEALKQEVIVDLVDQCRSY SRV++LVNE+TDEE LCQGLVLNDSLQRVLS+HDDIAKGT  T  RRTEP  VP+VPYM+PE+DES+
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPS-VPTVPYMNPEDDESD

Query:  DDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPR
        DDFTPLARRSTRDHIY RDRKL+NGQSSRVSPLPSPS KK    +MIDHLSGD YK +GSPRT EPPSY P   PPSPTTSS+SPF+T RQPLF EPPPR
Subjt:  DDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPR

Query:  MISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSR
         +STNP   T  P SLPPPPSRYNQRQQ+FEQQKAVTGG+ PHL NG   +S D IVG TKNLSLGPSTPTR+A+HEEALFK+LVDF+   +S SSK +R
Subjt:  MISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSR

Query:  PF
        PF
Subjt:  PF

XP_038886343.1 TOM1-like protein 4 isoform X1 [Benincasa hispida]9.67e-27679.96Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNA ACAERATND+L APDWAINIELCDI+NMDPRQ KDALKILKKRLA+KNPK QLLALYALEALSKNCGD V KLIVDR ILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
         VRDKILTLVDAWQ A GGGSKGK+PQYYAAY ELK  AGFQFPPREENV QFFSPP+IQP +E PVSAYDD +VQASLQSD+S L LPEIQNAQ L+ V
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        L+EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRV++LVNETTDEELL QGLVLNDSLQRVLS HDDIAKGT     R  EP VP+VPY+NPEDD+S+DD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPL+RR TRD+IYERDRKL+NG SSRVSPLPSPSSKK   V+MIDHLSGD YKP+GSPR  EPPS           TSS SPF+T RQPLFDEPPPR I
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP
        STNPL    RDTQSP +LPPPPSRYNQRQQ+FEQQKAVTGGS PHL N    SS DNIVG TKNLSL P TPTRS +HEE LFK+LVDFA  KSS SSK 
Subjt:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP

Query:  SRPF
        +RPF
Subjt:  SRPF

TrEMBL top hitse value%identityAlignment
A0A1S3B3S9 target of Myb protein 12.88e-27779.56Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATND+LIAPDWAINIELCDI+NMDPRQ KDALKILKKRL SKNPK QLLALYALEALSKNCGD V KLIVDR ILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
         VR+KIL LVDAWQ AFGGGSKGKYPQYYAAY +LK  AGFQFPPREENV QFFSPP+ QP +EDPVSAYDD +VQASLQSD+S LSLPEIQNAQGL DV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        L+EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRV++LVNETTDEELLCQGLVLNDSLQRVLS+HD+IAKGT  T  RR EP VP+VPY+NPEDDES+DD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPL+RR TRDHIYERDRKL+NGQSSRVSPLPSPSSKK   V+MIDHLSGD YKP+GSP+  +PPS            +S+SPF+T RQPLFDEPPPR +
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP
         T+PL    RD QSP  LPPPPSRYNQRQQ+FEQQKA TGG  PHL N     S DNIVG TK LSL P T TRSA+HEEALFK+LVDFA  KSS SSK 
Subjt:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP

Query:  SRPF
        +RPF
Subjt:  SRPF

A0A5D3DJE5 Target of Myb protein 11.06e-27579.56Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATND+LIAPDWAINIELCDI+NMDPRQ KDALKILKKRL SKNPK QLLALYALEALSKNCGD V KLIVDR ILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
         VR+KIL LVDAWQ AFGGGSKGKYPQYYAAY +LK  AGFQFPPREENV QFFSPP+ QP +EDPVSAYDD +VQASLQSD+S LSLPEIQNAQGL DV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        L+EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRV++LVNETTDEELLCQGLVLNDSLQRVLS+HD+IAKGT  T  RR EP VP+VPY+NPEDDES+DD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPL+RR TRDHIYERDRKL+NGQSSRVSPLPSPSSKK   V+MIDHLSGD YKP+GSP+  +PPS            +S+SPF+T RQPLFDEPPPR +
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP
         T+PL    RD QSP  LPPPPSRYNQRQQ+FEQQKA TGG  PHL N     S DNIVG TK LSL P T TRSA+HEEALFK+LVDFA  KSS SSK 
Subjt:  STNPL----RDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKP

Query:  SRPF
        +RPF
Subjt:  SRPF

A0A6J1CJP5 TOM1-like protein 4 isoform X10.099.6Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
        NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELK  AGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
        STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
Subjt:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF

A0A6J1CL02 TOM1-like protein 4 isoform X20.099.4Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKK PDS
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
        NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELK  AGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
        LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDD

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
        FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMI

Query:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
        STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF
Subjt:  STNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF

A0A6J1FHH3 TOM1-like protein 4 isoform X11.64e-27579.68Show/hide
Query:  MSTNAAA-CAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPD
        MSTNAAA CAERATND+LIAPDWAINIELCDI+NMDPRQ KDALKILKKRLASKNPKTQLLALYAL+ALSKNCGD V KLIVDR ILHEMVKIVKKKQPD
Subjt:  MSTNAAA-CAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPD

Query:  SNVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSD
        S VRDKILTLVDAWQ  FGGGSKGKYPQYYAAY ELK  AGF+FPPR ENVGQF SPP+I P +E  VS YDD + Q SLQSDAS LSLPEIQNAQGL+D
Subjt:  SNVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPS-VPTVPYMNPEDDESD
        VL+E+LGALD KTPEALKQEVIVDLVDQCRSY SRV++LVNE+TDEELLCQGLVLNDSLQRVLS+HDDIAKGT  T  RRTEP  VP+VPYM+PE+DES+
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPS-VPTVPYMNPEDDESD

Query:  DDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPR
        DDFTPLARRSTRDHIY RDRKL+NGQSSRVSPLPSPS KK    +MIDHLSGD YK +GSPRT EPPSY P     SPTTSS+SPF+T RQPLF EPPPR
Subjt:  DDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPR

Query:  MISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSR
         +STNP   T  P SLPPPPSRYNQRQQ+FEQQKAVTGG+ PHL NG   +S D IVG TKNLSLGPSTP R+A+HEEALFK+L+DF+   +S SSK +R
Subjt:  MISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSR

Query:  PF
        PF
Subjt:  PF

SwissProt top hitse value%identityAlignment
O80910 TOM1-like protein 68.3e-5633.14Show/hide
Query:  STNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSN
        S +A    ++AT+D+L+ PDW  N+E+CD VN    Q KD +K +KKRL  K+ + QLLAL  LE L KNCGD +   + ++ IL EMVKIVKKK  D  
Subjt:  STNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSN

Query:  VRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAY-------------------------------
        VRDKIL +VD+WQ+AF GG +GKYPQYY AY EL+ ++G +FP R  +     +PP   P L  P   Y                               
Subjt:  VRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAY-------------------------------

Query:  ----------------DDFSVQASLQSDASDLSLPEIQNAQGLSDVLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVL
                            +  ++ ++   LSL  I++ + + D+L +ML A+DP   EA+K EVIVDLV++CRS   ++M ++  T D+ELL +GL L
Subjt:  ----------------DDFSVQASLQSDASDLSLPEIQNAQGLSDVLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVL

Query:  NDSLQRVLSFHDDIAKGTLVTAGRRTEP--------------------------SVPTVPY--------MNPEDDESDDDFTPLARRS-------TRDHI
        NDSLQ +L+ HD IA G+ +       P                          S   +P         ++ E +E +D+F  LARR        T D  
Subjt:  NDSLQRVLSFHDDIAKGTLVTAGRRTEP--------------------------SVPTVPY--------MNPEDDESDDDFTPLARRS-------TRDHI

Query:  YERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMIS-TNPLRDTQSPG
               ++   +   P P P        DMID LS     P   P  +  PS PPP      T             ++ +P PR  S   P    Q P 
Subjt:  YERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMIS-TNPLRDTQSPG

Query:  SLPPPPSRYNQRQQFFEQQ
          P     Y+Q QQ  +QQ
Subjt:  SLPPPPSRYNQRQQFFEQQ

Q6NQK0 TOM1-like protein 45.6e-12152.88Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        M+ +AAACAERATNDMLI PDWAINIELCD++NMDP Q K+A+K+LKKRL SKN K Q+LALYALE LSKNCG+NV +LI+DR +L++MVKIVKKK P+ 
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQ-SDASDLSLPEIQNAQGLSD
        NVR+KILTL+D WQEAFGG   G+YPQYY AY +L++ AG +FPPR E+   FF+PP+ QP         +D ++QASLQ  DAS LSL EIQ+A+G  D
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQ-SDASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAK-GTLVTAGRRTE--PSVPTVPY-MNPEDD
        VLM+MLGA DP  PE+LK+EVIVDLV+QCR+Y  RVM LVN TTDEELLCQGL LND+LQ VL  HDDIA  G++ + GR T   P V  V    + EDD
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAK-GTLVTAGRRTE--PSVPTVPY-MNPEDD

Query:  ESDDDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEP
        ESDD+F  LA RS+        R+  +G  S                 M+D LSGD YKPQG+  +      PPP  PP  ++SS+S       P+FD+ 
Subjt:  ESDDDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEP

Query:  PPRMISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSK
         P+       + ++   +LPPPPSR+NQRQQFFE   + +G             SD +  GQT+NLSL  S P +    E+ LFK+LV+FA T+SS ++ 
Subjt:  PPRMISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSK

Query:  PSR
         +R
Subjt:  PSR

Q8L860 TOM1-like protein 91.5e-7346.45Show/hide
Query:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI
        A  ERAT++MLI PDWA+N+E+CD++N DP Q KD +K +KKR+ S+NPK QLLAL  LE + KNCGD V   + ++ ++HEMV+IVKKK PD +V++KI
Subjt:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI

Query:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM
        L L+D WQEAF GG + +YPQYYA Y EL  +AG  FP R E     F+PP+ QP    P +  +    +   + S + +   LSL EIQNA+G+ DVL 
Subjt:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM

Query:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D
        EML AL+P   E LKQEV+VDLV+QCR+Y  RV+ LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T+ +  +P   T   +   D    D  D
Subjt:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP
         +  A  +T            NG  ++++ LP+P          ID LSGD     P G P+ A P
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP

Q9C9Y1 TOM1-like protein 87.7e-6246.5Show/hide
Query:  ERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKILTL
        +RAT+DMLI PDWA+N+E+CD++N +P Q ++ +  +KKRL S+  K QLLAL  LE +  NCG+ +   + ++ ILH+MVK+ K+K P+  V++KIL L
Subjt:  ERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKILTL

Query:  VDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPT--LEDPVSAYDDFS-------VQASLQSDASDLSLPEIQNAQGLSDV
        +D WQE+F  G +G++PQYYAAY EL  +AG  FP R          P+I P+     P + Y   S       +  S +S+   LSL EIQNA+G+ DV
Subjt:  VDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPT--LEDPVSAYDDFS-------VQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKG-TLVTAGRRTEPSVP
        L EM+ A+D    E LKQEV+VDLV QCR+Y  RV+ LVN T+DE +LCQGL LND LQR+L+ H+ IA G +++    +++  VP
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKG-TLVTAGRRTEPSVP

Q9LPL6 TOM1-like protein 31.0e-12251.99Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        M+ NAAACAERATNDMLI PDWAINIELCDI+NM+P Q K+A+K+LKKRL SKN K Q+LALYALE LSKNCG++V +LIVDR IL +MVKIVKKK PD 
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQS-DASDLSLPEIQNAQGLSD
         VR+KIL+L+D WQEAF GGS G++PQYY AY EL++ AG +FPPR E+   FF+PP+ QP +    ++ +D ++QASLQS DAS LS+ EIQ+AQG  D
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQS-DASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD
        VL +MLGALDP  PE LK+E+IVDLV+QCR+Y  RVM LVN T+DEEL+CQGL LND+LQRVL  HDD AKG  V A   T   + ++ + + +DDESDD
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD

Query:  DFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVD--MIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPP
        DF  LA RS R    E  R    G  + + P P PSS + + VD   +D LSGD YKPQ +    +PPS            +S S  H    P+FDEP P
Subjt:  DFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVD--MIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPP

Query:  RMIS------TNPLRD-----------TQSPGSLPPPPS-RYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPS------TPTRSAD
        +  S      T P+ D           TQ P   PP  S R N+R ++F+        ++P   +    SS D+++GQ++NLSL P+      TP +  D
Subjt:  RMIS------TNPLRD-----------TQSPGSLPPPPS-RYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPS------TPTRSAD

Query:  H-EEALFKELVDFANTK--SSPSSKPS
          E+ LFK+L+DFA T+  SS SSKP+
Subjt:  H-EEALFKELVDFANTK--SSPSSKPS

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 17.3e-12451.99Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        M+ NAAACAERATNDMLI PDWAINIELCDI+NM+P Q K+A+K+LKKRL SKN K Q+LALYALE LSKNCG++V +LIVDR IL +MVKIVKKK PD 
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQS-DASDLSLPEIQNAQGLSD
         VR+KIL+L+D WQEAF GGS G++PQYY AY EL++ AG +FPPR E+   FF+PP+ QP +    ++ +D ++QASLQS DAS LS+ EIQ+AQG  D
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQS-DASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD
        VL +MLGALDP  PE LK+E+IVDLV+QCR+Y  RVM LVN T+DEEL+CQGL LND+LQRVL  HDD AKG  V A   T   + ++ + + +DDESDD
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD

Query:  DFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVD--MIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPP
        DF  LA RS R    E  R    G  + + P P PSS + + VD   +D LSGD YKPQ +    +PPS            +S S  H    P+FDEP P
Subjt:  DFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVD--MIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPP

Query:  RMIS------TNPLRD-----------TQSPGSLPPPPS-RYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPS------TPTRSAD
        +  S      T P+ D           TQ P   PP  S R N+R ++F+        ++P   +    SS D+++GQ++NLSL P+      TP +  D
Subjt:  RMIS------TNPLRD-----------TQSPGSLPPPPS-RYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPS------TPTRSAD

Query:  H-EEALFKELVDFANTK--SSPSSKPS
          E+ LFK+L+DFA T+  SS SSKP+
Subjt:  H-EEALFKELVDFANTK--SSPSSKPS

AT1G76970.1 Target of Myb protein 14.0e-12252.88Show/hide
Query:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS
        M+ +AAACAERATNDMLI PDWAINIELCD++NMDP Q K+A+K+LKKRL SKN K Q+LALYALE LSKNCG+NV +LI+DR +L++MVKIVKKK P+ 
Subjt:  MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDS

Query:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQ-SDASDLSLPEIQNAQGLSD
        NVR+KILTL+D WQEAFGG   G+YPQYY AY +L++ AG +FPPR E+   FF+PP+ QP         +D ++QASLQ  DAS LSL EIQ+A+G  D
Subjt:  NVRDKILTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQ-SDASDLSLPEIQNAQGLSD

Query:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAK-GTLVTAGRRTE--PSVPTVPY-MNPEDD
        VLM+MLGA DP  PE+LK+EVIVDLV+QCR+Y  RVM LVN TTDEELLCQGL LND+LQ VL  HDDIA  G++ + GR T   P V  V    + EDD
Subjt:  VLMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAK-GTLVTAGRRTE--PSVPTVPY-MNPEDD

Query:  ESDDDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEP
        ESDD+F  LA RS+        R+  +G  S                 M+D LSGD YKPQG+  +      PPP  PP  ++SS+S       P+FD+ 
Subjt:  ESDDDFTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEP

Query:  PPRMISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSK
         P+       + ++   +LPPPPSR+NQRQQFFE   + +G             SD +  GQT+NLSL  S P +    E+ LFK+LV+FA T+SS ++ 
Subjt:  PPRMISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLPHLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSK

Query:  PSR
         +R
Subjt:  PSR

AT3G08790.1 ENTH/VHS/GAT family protein5.5e-6346.5Show/hide
Query:  ERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKILTL
        +RAT+DMLI PDWA+N+E+CD++N +P Q ++ +  +KKRL S+  K QLLAL  LE +  NCG+ +   + ++ ILH+MVK+ K+K P+  V++KIL L
Subjt:  ERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKILTL

Query:  VDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPT--LEDPVSAYDDFS-------VQASLQSDASDLSLPEIQNAQGLSDV
        +D WQE+F  G +G++PQYYAAY EL  +AG  FP R          P+I P+     P + Y   S       +  S +S+   LSL EIQNA+G+ DV
Subjt:  VDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPT--LEDPVSAYDDFS-------VQASLQSDASDLSLPEIQNAQGLSDV

Query:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKG-TLVTAGRRTEPSVP
        L EM+ A+D    E LKQEV+VDLV QCR+Y  RV+ LVN T+DE +LCQGL LND LQR+L+ H+ IA G +++    +++  VP
Subjt:  LMEMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKG-TLVTAGRRTEPSVP

AT4G32760.1 ENTH/VHS/GAT family protein1.1e-7446.45Show/hide
Query:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI
        A  ERAT++MLI PDWA+N+E+CD++N DP Q KD +K +KKR+ S+NPK QLLAL  LE + KNCGD V   + ++ ++HEMV+IVKKK PD +V++KI
Subjt:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI

Query:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM
        L L+D WQEAF GG + +YPQYYA Y EL  +AG  FP R E     F+PP+ QP    P +  +    +   + S + +   LSL EIQNA+G+ DVL 
Subjt:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM

Query:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D
        EML AL+P   E LKQEV+VDLV+QCR+Y  RV+ LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T+ +  +P   T   +   D    D  D
Subjt:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP
         +  A  +T            NG  ++++ LP+P          ID LSGD     P G P+ A P
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP

AT4G32760.2 ENTH/VHS/GAT family protein1.1e-7446.45Show/hide
Query:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI
        A  ERAT++MLI PDWA+N+E+CD++N DP Q KD +K +KKR+ S+NPK QLLAL  LE + KNCGD V   + ++ ++HEMV+IVKKK PD +V++KI
Subjt:  ACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKI

Query:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM
        L L+D WQEAF GG + +YPQYYA Y EL  +AG  FP R E     F+PP+ QP    P +  +    +   + S + +   LSL EIQNA+G+ DVL 
Subjt:  LTLVDAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYD----DFSVQASLQSDASDLSLPEIQNAQGLSDVLM

Query:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D
        EML AL+P   E LKQEV+VDLV+QCR+Y  RV+ LVN T+DE LLCQGL LND LQRVL+ ++ IA G   T+ +  +P   T   +   D    D  D
Subjt:  EMLGALDPKTPEALKQEVIVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDD--D

Query:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP
         +  A  +T            NG  ++++ LP+P          ID LSGD     P G P+ A P
Subjt:  FTPLARRSTRDHIYERDRKLSNGQSSRVSPLPSPSSKKAIRVDMIDHLSGD--TYKPQGSPRTAEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACCAATGCTGCTGCTTGTGCCGAGAGAGCAACAAATGATATGCTTATTGCTCCCGATTGGGCAATAAATATTGAGCTCTGTGATATCGTTAACATGGATCCTAG
GCAAGGGAAAGATGCATTGAAGATACTCAAGAAGCGTCTAGCTAGCAAAAATCCCAAAACACAACTTCTAGCTCTTTATGCATTGGAGGCTCTAAGCAAAAACTGCGGCG
ATAATGTTTTAAAGCTGATTGTAGATCGTAAAATCCTGCACGAAATGGTTAAAATTGTAAAGAAGAAGCAGCCCGATTCAAATGTACGGGACAAGATATTAACTTTGGTA
GATGCATGGCAAGAAGCATTTGGTGGTGGCTCCAAGGGAAAGTATCCACAATACTATGCAGCGTACATTGAGTTGAAGGCGAAAGCTGGCTTTCAATTTCCACCAAGAGA
AGAGAATGTTGGGCAGTTCTTTAGTCCACCTGAGATACAGCCAACTTTAGAGGACCCTGTTTCGGCTTACGATGATTTTTCTGTGCAGGCCTCTCTCCAGTCTGATGCTT
CTGATTTAAGCTTGCCAGAAATTCAAAATGCACAGGGACTATCAGATGTTTTAATGGAAATGCTTGGTGCACTGGACCCTAAGACTCCGGAGGCTTTGAAGCAAGAAGTG
ATTGTTGATCTTGTTGACCAATGCCGTTCCTACCACAGCCGTGTCATGATGCTTGTGAATGAGACCACAGACGAGGAACTTTTATGTCAAGGGTTGGTGTTGAATGATAG
TCTGCAGCGTGTACTTAGCTTCCATGACGACATTGCAAAAGGAACATTAGTGACAGCAGGTAGAAGAACAGAACCTTCTGTTCCAACGGTTCCCTATATGAACCCTGAAG
ATGATGAGTCGGATGATGATTTTACCCCATTAGCTCGCAGGTCAACGAGAGATCACATTTATGAAAGGGACAGGAAACTGTCGAATGGTCAATCATCCCGAGTTAGTCCA
CTTCCTTCACCCTCATCAAAGAAGGCGATCCGCGTGGACATGATTGATCATCTGAGCGGCGACACGTACAAACCTCAAGGATCTCCAAGGACAGCAGAGCCTCCATCCTA
TCCACCTCCAGTTTTCCCACCTTCACCAACAACTTCATCTACCTCACCTTTCCACACTACTAGGCAGCCTCTGTTCGACGAACCACCTCCAAGAATGATATCCACGAATC
CACTCCGGGACACCCAATCTCCGGGCTCGCTCCCTCCCCCACCCTCGAGATATAATCAAAGACAACAATTTTTTGAGCAACAAAAAGCTGTTACAGGAGGCAGTCTGCCC
CATTTGGGCAACGGTACTTATGGTTCATCTGATGACAACATAGTGGGACAAACTAAGAATCTGTCGCTCGGTCCTTCCACTCCAACCAGATCCGCAGACCATGAAGAAGC
CCTTTTCAAAGAACTGGTGGATTTTGCCAATACCAAATCATCTCCATCATCAAAACCCAGCCGACCATTCTGA
mRNA sequenceShow/hide mRNA sequence
TTTTACTTTAGTCTTTCTGCTTTATGGTTCCATTCCTGTAAAATGAACTCACGGGATTTTACTTTGAGTAGATAATGGAACTATTATGGTTTATGTGATGATTTTTATGT
GGGCTGTCGCCTGGATCATCTGTTGTTTGGCTCACATTCGAGTCCGCCCTCTTCTTCTTTATTTGCGAGGAAATTGTTGATGGGGATTCTCCTCGTGTTAATCCTAGTTC
AGATTCATATGCATAATCACTTAATATTTTTCCCCTTTTCAGAATTGCTACTTTTTTAAAGGAGATTCAAGGACGAAATACCTTATTAGGGTGTAATAGATTCAGAGGAA
CATGTCTACCAATGCTGCTGCTTGTGCCGAGAGAGCAACAAATGATATGCTTATTGCTCCCGATTGGGCAATAAATATTGAGCTCTGTGATATCGTTAACATGGATCCTA
GGCAAGGGAAAGATGCATTGAAGATACTCAAGAAGCGTCTAGCTAGCAAAAATCCCAAAACACAACTTCTAGCTCTTTATGCATTGGAGGCTCTAAGCAAAAACTGCGGC
GATAATGTTTTAAAGCTGATTGTAGATCGTAAAATCCTGCACGAAATGGTTAAAATTGTAAAGAAGAAGCAGCCCGATTCAAATGTACGGGACAAGATATTAACTTTGGT
AGATGCATGGCAAGAAGCATTTGGTGGTGGCTCCAAGGGAAAGTATCCACAATACTATGCAGCGTACATTGAGTTGAAGGCGAAAGCTGGCTTTCAATTTCCACCAAGAG
AAGAGAATGTTGGGCAGTTCTTTAGTCCACCTGAGATACAGCCAACTTTAGAGGACCCTGTTTCGGCTTACGATGATTTTTCTGTGCAGGCCTCTCTCCAGTCTGATGCT
TCTGATTTAAGCTTGCCAGAAATTCAAAATGCACAGGGACTATCAGATGTTTTAATGGAAATGCTTGGTGCACTGGACCCTAAGACTCCGGAGGCTTTGAAGCAAGAAGT
GATTGTTGATCTTGTTGACCAATGCCGTTCCTACCACAGCCGTGTCATGATGCTTGTGAATGAGACCACAGACGAGGAACTTTTATGTCAAGGGTTGGTGTTGAATGATA
GTCTGCAGCGTGTACTTAGCTTCCATGACGACATTGCAAAAGGAACATTAGTGACAGCAGGTAGAAGAACAGAACCTTCTGTTCCAACGGTTCCCTATATGAACCCTGAA
GATGATGAGTCGGATGATGATTTTACCCCATTAGCTCGCAGGTCAACGAGAGATCACATTTATGAAAGGGACAGGAAACTGTCGAATGGTCAATCATCCCGAGTTAGTCC
ACTTCCTTCACCCTCATCAAAGAAGGCGATCCGCGTGGACATGATTGATCATCTGAGCGGCGACACGTACAAACCTCAAGGATCTCCAAGGACAGCAGAGCCTCCATCCT
ATCCACCTCCAGTTTTCCCACCTTCACCAACAACTTCATCTACCTCACCTTTCCACACTACTAGGCAGCCTCTGTTCGACGAACCACCTCCAAGAATGATATCCACGAAT
CCACTCCGGGACACCCAATCTCCGGGCTCGCTCCCTCCCCCACCCTCGAGATATAATCAAAGACAACAATTTTTTGAGCAACAAAAAGCTGTTACAGGAGGCAGTCTGCC
CCATTTGGGCAACGGTACTTATGGTTCATCTGATGACAACATAGTGGGACAAACTAAGAATCTGTCGCTCGGTCCTTCCACTCCAACCAGATCCGCAGACCATGAAGAAG
CCCTTTTCAAAGAACTGGTGGATTTTGCCAATACCAAATCATCTCCATCATCAAAACCCAGCCGACCATTCTGAGACACGAGTAGCGAAGACTGATGATGTTTTTTCTTT
GTTTTTGTGGTTGTAGACAGGTGATGTGGTTCCTTTGACGAGGCTGATGTATATTCATGAACAAAGTATCCATTTTGTTTTGTAGTAGAGTTAGAATTTGTTGATGTTGC
TTTGGATGGGTTGCATATAATATATATTCTATTAATTTAGTATATCTGTTTTGTTTAATGAATAATATTTAGGTTTAATTATTAGATAATG
Protein sequenceShow/hide protein sequence
MSTNAAACAERATNDMLIAPDWAINIELCDIVNMDPRQGKDALKILKKRLASKNPKTQLLALYALEALSKNCGDNVLKLIVDRKILHEMVKIVKKKQPDSNVRDKILTLV
DAWQEAFGGGSKGKYPQYYAAYIELKAKAGFQFPPREENVGQFFSPPEIQPTLEDPVSAYDDFSVQASLQSDASDLSLPEIQNAQGLSDVLMEMLGALDPKTPEALKQEV
IVDLVDQCRSYHSRVMMLVNETTDEELLCQGLVLNDSLQRVLSFHDDIAKGTLVTAGRRTEPSVPTVPYMNPEDDESDDDFTPLARRSTRDHIYERDRKLSNGQSSRVSP
LPSPSSKKAIRVDMIDHLSGDTYKPQGSPRTAEPPSYPPPVFPPSPTTSSTSPFHTTRQPLFDEPPPRMISTNPLRDTQSPGSLPPPPSRYNQRQQFFEQQKAVTGGSLP
HLGNGTYGSSDDNIVGQTKNLSLGPSTPTRSADHEEALFKELVDFANTKSSPSSKPSRPF