; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G025220 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G025220
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTOM1-like protein 4
Genome locationCmo_Chr04:18562749..18571721
RNA-Seq ExpressionCmoCh04G025220
SyntenyCmoCh04G025220
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016020 - membrane (cellular component)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578929.1 TOM1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]4.7e-24961.7Show/hide
Query:  NPIHTAEDCIIFRGLDSAVDDDDSQSESGVCSPTLWGSDSRMNPQFHRSRNRTLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL----RQ
        NPIHTAEDCIIFRG DSA  DDDSQSESGVCSPTLWGS+SR N QFHR RNR+LSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL    RQ
Subjt:  NPIHTAEDCIIFRGLDSAVDDDDSQSESGVCSPTLWGSDSRMNPQFHRSRNRTLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL----RQ

Query:  QESASIEKYDSASETSFIRDPIKKRSETRALVTRSRSVNSGGFYLKMFFPMPLGKISTKKKSNVRSDSALNGSSRVSPKPPQVDRDWWRKRSSATSGENG
           AS+ + DSASETSF RD  K RSETRALVTRSRSV+SGGFYLKMF P+P G++S KKK N+R+DS LN SSRVSPKPP VDR+WWRKRS      N 
Subjt:  QESASIEKYDSASETSFIRDPIKKRSETRALVTRSRSVNSGGFYLKMFFPMPLGKISTKKKSNVRSDSALNGSSRVSPKPPQVDRDWWRKRSSATSGENG

Query:  GSVSGSSSSNSNSTSSERS-NSRS------------------DPSIRFSIRVEAVKRSTTPRVAGSQTERDPWPVLCSEE--SPSPARKHAVVVWKSRQP
        GSVSG S   SNSTSSERS NS S                  DPS +   RV+ + +ST P +      R  W     +     SPARK           
Subjt:  GSVSGSSSSNSNSTSSERS-NSRS------------------DPSIRFSIRVEAVKRSTTPRVAGSQTERDPWPVLCSEE--SPSPARKHAVVVWKSRQP

Query:  FAEIFPTETSTKSNSDTKGSNQKRNGIKGELLMAMISDRLLFHGSTYFISLLKIRRDMSTN--AAACAERATNDGLRAPDWAINIELCDIINMDPRQAKD
                   + +S T+                            + I + KI+RDMSTN  AAACAERATND L APDWAINIELCDIINMDPRQAKD
Subjt:  FAEIFPTETSTKSNSDTKGSNQKRNGIKGELLMAMISDRLLFHGSTYFISLLKIRRDMSTN--AAACAERATNDGLRAPDWAINIELCDIINMDPRQAKD

Query:  ALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK------
        ALKILKKRL SKNPK QLLAL+AL+ALS+NC DT+ KL+V+RNILHEM+KIVKKKPD NVRDKIL LVDAW A FGG SKGK PQYYAAYNELK      
Subjt:  ALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK------

Query:  ----------------------------------------------------GLADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENT
                                                            GLADVL E+LGALD KTPEALK+EVIVDLVDQCRSYQ RV+ILVNE+T
Subjt:  ----------------------------------------------------GLADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENT

Query:  DEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTE-PPIPSVPFMNPEDDGSEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSV
        DEELLCQGLVLND+LQRVLS HDDIAKGTF  EARRTE PP+PSVP+M+PE+D SEDDFT LARRST+D+ + +D KLANGQSSRVSP+ SPS K TT  
Subjt:  DEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTE-PPIPSVPFMNPEDDGSEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSV

Query:  EMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGT
        EMIDHLS D +K + SP+ VEPPSY+    PPSP TSS  P  TRQP+F EPP RS+STNP P+A     +PS LPPPPSRYNQRQQYFEQQK+VT GGT
Subjt:  EMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGT

Query:  LPHLSNTRGS------SHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRP
         PHLSN   S      +  N SL PSTP R+AEHEEALFKDL+ F++   SSSS+SNRP
Subjt:  LPHLSNTRGS------SHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRP

KAG6602143.1 TOM1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]1.1e-22986.61Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKIL+LVDAW AAFGGNSKGKCPQYYAAY ELK                                                          GLADVLW
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
        QL+RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLP NTRQPMFEEPPLRSISTNP
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP

Query:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
        LP AHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSS DNT LSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
Subjt:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

KAG7016452.1 TOM1-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma]7.3e-24260.35Show/hide
Query:  NPIHTAEDCIIFRGLDSAVDDDDSQSESGVCSPTLWGSDSRMNPQFHRSRNRTLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL----RQ
        NPIHTAEDCIIFRG DSA  DDDSQSESGVCSPTLWGS+SR N QFHR RNR+LSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL    RQ
Subjt:  NPIHTAEDCIIFRGLDSAVDDDDSQSESGVCSPTLWGSDSRMNPQFHRSRNRTLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHL----RQ

Query:  QESASIEKYDSASETSFIRDPIKKRSETRALVTRSRSVNSGGFYLKMFFPMPLGKISTKKKSNVRSDSALNGSSRVSPKPPQVDRDWWRKRSSATSGENG
           AS+ + DSASETSF RD  K RSETRALVTRSRSV+SGGFYLKMF P+P G++S KKK N+R+DS LN SSRVSPKPP VDR+WWRKRS      N 
Subjt:  QESASIEKYDSASETSFIRDPIKKRSETRALVTRSRSVNSGGFYLKMFFPMPLGKISTKKKSNVRSDSALNGSSRVSPKPPQVDRDWWRKRSSATSGENG

Query:  GSVSGSSSSNSNSTSSERS-NSRS------------------DPSIRFSIRVEAVKRSTTPRVAGSQTERDPWPVLCSEE--SPSPARKHAVVVWKSRQP
        GSVSG S   SNSTSSERS NS S                  DPS +   RV+ + +ST P +      R  W     +     SPARK           
Subjt:  GSVSGSSSSNSNSTSSERS-NSRS------------------DPSIRFSIRVEAVKRSTTPRVAGSQTERDPWPVLCSEE--SPSPARKHAVVVWKSRQP

Query:  FAEIFPTETSTKSNSDTKGSNQKRNGIKGELLMAMISDRLLFHGSTYFISLLKIRRDMSTN--AAACAERATNDGLRAPDWAINIELCDIINMDPRQAKD
                 ST  N+        R                    + + I + KI+RDMSTN  AAACAERATND L APDWAINIELCDIINMDPRQAKD
Subjt:  FAEIFPTETSTKSNSDTKGSNQKRNGIKGELLMAMISDRLLFHGSTYFISLLKIRRDMSTN--AAACAERATNDGLRAPDWAINIELCDIINMDPRQAKD

Query:  ALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKK-PDLNVRDKILALVDAWHAAFGGNSKGK-----------------
        ALKILKKRL SKNPK QLLAL+AL+ALS+NC DT+ KL+V+RNILHEM+KIVKKK PD NVRDKIL LVDAW A FGG SKG                  
Subjt:  ALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKK-PDLNVRDKILALVDAWHAAFGGNSKGK-----------------

Query:  -------------------------CPQYYAAYNEL---------------------KGLADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMI
                                   Q+ + Y++L                     +GLADVL E+L ALD KTPEALK+EVIVDLVDQCRSYQ RV+I
Subjt:  -------------------------CPQYYAAYNEL---------------------KGLADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMI

Query:  LVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTE-PPIPSVPFMNPEDDGSEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSS
        LVNE+TDEELLCQGLVLND+LQRVLS HDDIAKGTF  EARRTE PP+PSVP+M+PE+D SEDDFT LARRST+D+ + +D KLANGQSSRVSP+ SPS 
Subjt:  LVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTE-PPIPSVPFMNPEDDGSEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSS

Query:  KSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKS
        K TT  EMIDHLS D +K + SP+ VEPPSY+    PPSP TSS  P  TRQP+F EPP RS+STNP P+A     +PS LPPPPSRYNQRQQYFEQQK+
Subjt:  KSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKS

Query:  VTGGGTLPHLSNTRGS------SHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRP
        VT GGT PHLSN   S      +  N SL PSTP R+AEHEEALFKDL+ F++   SSSS+SNRP
Subjt:  VTGGGTLPHLSNTRGS------SHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRP

KAG7032833.1 TOM1-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma]4.9e-23086.61Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKIL+LVDAW AAFGGNSKGKCPQYYAAY ELK                                                          GLADVLW
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
        QL+RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLP NTRQPMFEEPPLRSISTNP
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP

Query:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
        LP AHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSS+DNT LSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
Subjt:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

XP_022960666.1 TOM1-like protein 4 [Cucurbita moschata]3.9e-23588.24Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK                                                          GLADVLW
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
        QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP

Query:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
        LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
Subjt:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

TrEMBL top hitse value%identityAlignment
A0A5D3DJE5 Target of Myb protein 11.0e-16467.19Show/hide
Query:  KIRRDMSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKK
        KI R+MSTNAAACAERATND L APDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPK+QLLAL+ALEALSKNC DT+ KL+V+RNILHEM+KIVKK
Subjt:  KIRRDMSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKK

Query:  K-PDLNVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------G
        K PD  VR+KILALVDAW AAFGG SKGK PQYYAAYN+LK                                                          G
Subjt:  K-PDLNVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------G

Query:  LADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDG
        L DVL EMLGALDPKTPEALK+EVIVDLVDQCRSY  RV+ILVNE TDEELLCQGLVLND+LQRVLS HD+IAKGTF  EARR EPP+PSVP++NPEDD 
Subjt:  LADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDG

Query:  SEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPL
        SEDDFT L+RR T+D+ +E+D KLANGQSSRVSP+ SPSSK T SVEMIDHLS D +KP+ SPKIV+PPS            +S  P  TRQP+F+EPP 
Subjt:  SEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPL

Query:  RSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDN-------TSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSS
        RS+ T+PL T  RD QSPS LPPPPSRYNQRQQYFEQQK+ TGGG  PHLSN    S+DN        SLSP T TRSAEHEEALFKDLV FA+ K SSS
Subjt:  RSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDN-------TSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSS

Query:  SQSNRP
        S+SNRP
Subjt:  SQSNRP

A0A6J1CJP5 TOM1-like protein 4 isoform X12.7e-16566.4Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKK-PDL
        MSTNAAACAERATND L APDWAINIELCDI+NMDPRQ KDALKILKKRL SKNPK QLLAL+ALEALSKNC D + KL+V+R ILHEM+KIVKKK PD 
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKK-PDL

Query:  NVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVL
        NVRDKIL LVDAW  AFGG SKGK PQYYAAY ELK                                                          GL+DVL
Subjt:  NVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVL

Query:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF
         EMLGALDPKTPEALK+EVIVDLVDQCRSY  RVM+LVNE TDEELLCQGLVLND+LQRVLS HDDIAKGT +   RRTEP +P+VP+MNPEDD S+DDF
Subjt:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF

Query:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLN-TRQPMFEEPPLRSIS
        T LARRST+D+ +E+D KL+NGQSSRVSP+ SPSSK    V+MIDHLS D +KPQ SP+  EPPSY  P+FPPSP TSS  P + TRQP+F+EPP R IS
Subjt:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLN-TRQPMFEEPPLRSIS

Query:  TNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSN-TRGSSHD-------NTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQS
        TNPL    RDTQSP  LPPPPSRYNQRQQ+FEQQK+VT GG+LPHL N T GSS D       N SL PSTPTRSA+HEEALFK+LV FA  K S SS+ 
Subjt:  TNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSN-TRGSSHD-------NTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQS

Query:  NRP
        +RP
Subjt:  NRP

A0A6J1CL02 TOM1-like protein 4 isoform X21.1e-16666.53Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATND L APDWAINIELCDI+NMDPRQ KDALKILKKRL SKNPK QLLAL+ALEALSKNC D + KL+V+R ILHEM+KIVKKKPD N
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKIL LVDAW  AFGG SKGK PQYYAAY ELK                                                          GL+DVL 
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALK+EVIVDLVDQCRSY  RVM+LVNE TDEELLCQGLVLND+LQRVLS HDDIAKGT +   RRTEP +P+VP+MNPEDD S+DDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLN-TRQPMFEEPPLRSIST
         LARRST+D+ +E+D KL+NGQSSRVSP+ SPSSK    V+MIDHLS D +KPQ SP+  EPPSY  P+FPPSP TSS  P + TRQP+F+EPP R IST
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLN-TRQPMFEEPPLRSIST

Query:  NPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSN-TRGSSHD-------NTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSN
        NPL    RDTQSP  LPPPPSRYNQRQQ+FEQQK+VT GG+LPHL N T GSS D       N SL PSTPTRSA+HEEALFK+LV FA  K S SS+ +
Subjt:  NPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSN-TRGSSHD-------NTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSN

Query:  RP
        RP
Subjt:  RP

A0A6J1H890 TOM1-like protein 41.9e-23588.24Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK                                                          GLADVLW
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
        QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP

Query:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
        LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
Subjt:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

A0A6J1JL48 TOM1-like protein 47.7e-22986.41Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW
        VRDKILALVDAW AAFGGNSKGKCPQYYAAYNELK                                                          GLADVLW
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK----------------------------------------------------------GLADVLW

Query:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT
        EMLGALDPKTPEALKEEVIVDLVDQCRS+QRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTF MEARRTEPPIPSVPFMNPEDDGSEDDFT
Subjt:  EMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFT

Query:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP
        QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVE+IDHLSDAHKPQLSP+IVEPPSYSLPIFPPSPQTSSPLP NTRQPMFEEPPLRSISTNP
Subjt:  QLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTNP

Query:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
        LP AHRDTQSPS LPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSS+DNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
Subjt:  LPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

SwissProt top hitse value%identityAlignment
O80910 TOM1-like protein 62.4e-4131.09Show/hide
Query:  STNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNV
        S +A    ++AT+D L  PDW  N+E+CD +N    QAKD +K +KKRL  K+ +VQLLAL  LE L KNC D +   V E+NIL EM+KIVKKK D+ V
Subjt:  STNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNV

Query:  RDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK------------------------------------------------------------------
        RDKIL +VD+W  AFGG  +GK PQYY AY+EL+                                                                  
Subjt:  RDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK------------------------------------------------------------------

Query:  ----------------------------GLA-----------DVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLND
                                    GL+           D+L +ML A+DP   EA+K+EVIVDLV++CRS Q+++M ++    D+ELL +GL LND
Subjt:  ----------------------------GLA-----------DVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLND

Query:  NLQRVLSDHDDIAKGTFM--------MEARRTEP---------------------PIPSVPF-----MNPEDDGSEDDFTQLARRSTK-------DYTFE
        +LQ +L+ HD IA G+ +        +  + ++P                     PIP+        ++ E +  ED+F QLARR +K       D T  
Subjt:  NLQRVLSDHDDIAKGTFM--------MEARRTEP---------------------PIPSVPF-----MNPEDDGSEDDFTQLARRSTK-------DYTFE

Query:  KDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSP--QTSSPLPLNTRQPMFEEP-PLRSISTNPLPTAHRDTQS
        +    A+   +   P   P   +T   +MID LS               +   P  PP+P  Q S P P  + Q     P P     +   P A +  Q 
Subjt:  KDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSP--QTSSPLPLNTRQPMFEEP-PLRSISTNPLPTAHRDTQS

Query:  PSLLPPPPSRYNQRQQYFEQQ
        P   P     Y+Q QQ+ +QQ
Subjt:  PSLLPPPPSRYNQRQQYFEQQ

Q6NQK0 TOM1-like protein 44.2e-9144.94Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        M+ +AAACAERATND L  PDWAINIELCD+INMDP QAK+A+K+LKKRL SKN KVQ+LAL+ALE LSKNC + + +L+++R +L++M+KIVKKKP+LN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK---------------------------------------------------GLADVLWEMLGALD
        VR+KIL L+D W  AFGG   G+ PQYY AYN+L+                                                   G  DVL +MLGA D
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK---------------------------------------------------GLADVLWEMLGALD

Query:  PKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAK-GTFMMEARRTE--PPIPSVPF-MNPEDDGSEDDFTQLA
        P  PE+LKEEVIVDLV+QCR+YQRRVM LVN  TDEELLCQGL LNDNLQ VL  HDDIA  G+     R T   PP+  V    + EDD S+D+F +LA
Subjt:  PKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAK-GTFMMEARRTE--PPIPSVPF-MNPEDDGSEDDFTQLA

Query:  RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQ---LSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTN
         RS+                   +P   P   S +   M+D LS D +KPQ    S  + +PP       PP P TSS    ++  P+F++         
Subjt:  RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQ---LSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTN

Query:  PLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
          P   + ++    LPPPPSR+NQRQQ+FE   S +G       S+  G +  N SL+ S P +  + E+ LFKDLV+FA+ + S ++ +NR L
Subjt:  PLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

Q8L860 TOM1-like protein 91.7e-5544.57Show/hide
Query:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT++ L  PDWA+N+E+CD++N DP QAKD +K +KKR+ S+NPK QLLAL  LE + KNC D +   V E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM
         L+D W  AFGG  + + PQYYA Y EL                                                              KG+ DVL EM
Subjt:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM

Query:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG
        L AL+P   E LK+EV+VDLV+QCR+Y++RV+ LVN  +DE LLCQGL LND+LQRVL++++ IA G
Subjt:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG

Q9C9Y1 TOM1-like protein 86.2e-5036.87Show/hide
Query:  ERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALV
        +RAT+D L  PDWA+N+E+CD++N +P Q ++ +  +KKRLTS+  KVQLLAL  LE +  NC + I   V E++ILH+M+K+ K+KP++ V++KIL L+
Subjt:  ERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALV

Query:  DAWHAAFGGNSKGKCPQYYAAYNEL---------------------------------------------------------KGLADVLWEMLGALDPKT
        D W  +F G  +G+ PQYYAAY EL                                                         +G+ DVL EM+ A+D   
Subjt:  DAWHAAFGGNSKGKCPQYYAAYNEL---------------------------------------------------------KGLADVLWEMLGALDPKT

Query:  PEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMM---EARRTEPPIPSVPFM----NPEDDGSEDDFT---
         E LK+EV+VDLV QCR+Y++RV+ LVN  +DE +LCQGL LND+LQR+L+ H+ IA G  M+   E  + E P  +   +    +   +GS   +T   
Subjt:  PEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMM---EARRTEPPIPSVPFM----NPEDDGSEDDFT---

Query:  -QLARRSTKDY-TFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSP
         ++   S  D+ T   D+ LA        P SSP +K   S+ +ID LSD +    +P
Subjt:  -QLARRSTKDY-TFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSP

Q9LPL6 TOM1-like protein 36.5e-9244.89Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        M+ NAAACAERATND L  PDWAINIELCDIINM+P QAK+A+K+LKKRL SKN KVQ+LAL+ALE LSKNC +++ +L+V+R+IL +M+KIVKKKPDL 
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK-----------------------------------------------------------GLADVL
        VR+KIL+L+D W  AFGG S G+ PQYY AYNEL+                                                           G  DVL
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK-----------------------------------------------------------GLADVL

Query:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF
         +MLGALDP  PE LKEE+IVDLV+QCR+YQRRVM LVN  +DEEL+CQGL LNDNLQRVL  HDD AKG  +     T  P+ S+   + +DD S+DDF
Subjt:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF

Query:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEM----IDHLS-DAHKPQLSPKIVEPPS--------YSLPIF-PPSPQTSSPLPLNTRQ
         QLA RS      +++S    GQ +  +P+  P   S   V +    +D LS D +KPQ + + V+PPS        YS PIF  P PQ+ SP      +
Subjt:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEM----IDHLS-DAHKPQLSPKIVEPPS--------YSLPIF-PPSPQTSSPLPLNTRQ

Query:  PMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPS-RYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHD-------NTSLSPST-------PTRSAEHEEA
        P++++       T  LP A  +TQ P   PP  S R N+R +YF+            H S+   SS+D       N SL+P+        P +  + E+ 
Subjt:  PMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPS-RYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHD-------NTSLSPST-------PTRSAEHEEA

Query:  LFKDLVKFAQFKLSSSSQS
        LFKDL+ FA+ + SSSS S
Subjt:  LFKDLVKFAQFKLSSSSQS

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 14.6e-9344.89Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        M+ NAAACAERATND L  PDWAINIELCDIINM+P QAK+A+K+LKKRL SKN KVQ+LAL+ALE LSKNC +++ +L+V+R+IL +M+KIVKKKPDL 
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK-----------------------------------------------------------GLADVL
        VR+KIL+L+D W  AFGG S G+ PQYY AYNEL+                                                           G  DVL
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK-----------------------------------------------------------GLADVL

Query:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF
         +MLGALDP  PE LKEE+IVDLV+QCR+YQRRVM LVN  +DEEL+CQGL LNDNLQRVL  HDD AKG  +     T  P+ S+   + +DD S+DDF
Subjt:  WEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDF

Query:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEM----IDHLS-DAHKPQLSPKIVEPPS--------YSLPIF-PPSPQTSSPLPLNTRQ
         QLA RS      +++S    GQ +  +P+  P   S   V +    +D LS D +KPQ + + V+PPS        YS PIF  P PQ+ SP      +
Subjt:  TQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEM----IDHLS-DAHKPQLSPKIVEPPS--------YSLPIF-PPSPQTSSPLPLNTRQ

Query:  PMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPS-RYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHD-------NTSLSPST-------PTRSAEHEEA
        P++++       T  LP A  +TQ P   PP  S R N+R +YF+            H S+   SS+D       N SL+P+        P +  + E+ 
Subjt:  PMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPS-RYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHD-------NTSLSPST-------PTRSAEHEEA

Query:  LFKDLVKFAQFKLSSSSQS
        LFKDL+ FA+ + SSSS S
Subjt:  LFKDLVKFAQFKLSSSSQS

AT1G76970.1 Target of Myb protein 13.0e-9244.94Show/hide
Query:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN
        M+ +AAACAERATND L  PDWAINIELCD+INMDP QAK+A+K+LKKRL SKN KVQ+LAL+ALE LSKNC + + +L+++R +L++M+KIVKKKP+LN
Subjt:  MSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLN

Query:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK---------------------------------------------------GLADVLWEMLGALD
        VR+KIL L+D W  AFGG   G+ PQYY AYN+L+                                                   G  DVL +MLGA D
Subjt:  VRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELK---------------------------------------------------GLADVLWEMLGALD

Query:  PKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAK-GTFMMEARRTE--PPIPSVPF-MNPEDDGSEDDFTQLA
        P  PE+LKEEVIVDLV+QCR+YQRRVM LVN  TDEELLCQGL LNDNLQ VL  HDDIA  G+     R T   PP+  V    + EDD S+D+F +LA
Subjt:  PKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAK-GTFMMEARRTE--PPIPSVPF-MNPEDDGSEDDFTQLA

Query:  RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQ---LSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTN
         RS+                   +P   P   S +   M+D LS D +KPQ    S  + +PP       PP P TSS    ++  P+F++         
Subjt:  RRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLS-DAHKPQ---LSPKIVEPPSYSLPIFPPSPQTSSPLPLNTRQPMFEEPPLRSISTN

Query:  PLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL
          P   + ++    LPPPPSR+NQRQQ+FE   S +G       S+  G +  N SL+ S P +  + E+ LFKDLV+FA+ + S ++ +NR L
Subjt:  PLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSSSQSNRPL

AT3G08790.1 ENTH/VHS/GAT family protein4.4e-5136.87Show/hide
Query:  ERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALV
        +RAT+D L  PDWA+N+E+CD++N +P Q ++ +  +KKRLTS+  KVQLLAL  LE +  NC + I   V E++ILH+M+K+ K+KP++ V++KIL L+
Subjt:  ERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKILALV

Query:  DAWHAAFGGNSKGKCPQYYAAYNEL---------------------------------------------------------KGLADVLWEMLGALDPKT
        D W  +F G  +G+ PQYYAAY EL                                                         +G+ DVL EM+ A+D   
Subjt:  DAWHAAFGGNSKGKCPQYYAAYNEL---------------------------------------------------------KGLADVLWEMLGALDPKT

Query:  PEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMM---EARRTEPPIPSVPFM----NPEDDGSEDDFT---
         E LK+EV+VDLV QCR+Y++RV+ LVN  +DE +LCQGL LND+LQR+L+ H+ IA G  M+   E  + E P  +   +    +   +GS   +T   
Subjt:  PEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKGTFMM---EARRTEPPIPSVPFM----NPEDDGSEDDFT---

Query:  -QLARRSTKDY-TFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSP
         ++   S  D+ T   D+ LA        P SSP +K   S+ +ID LSD +    +P
Subjt:  -QLARRSTKDY-TFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSP

AT4G32760.1 ENTH/VHS/GAT family protein1.2e-5644.57Show/hide
Query:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT++ L  PDWA+N+E+CD++N DP QAKD +K +KKR+ S+NPK QLLAL  LE + KNC D +   V E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM
         L+D W  AFGG  + + PQYYA Y EL                                                              KG+ DVL EM
Subjt:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM

Query:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG
        L AL+P   E LK+EV+VDLV+QCR+Y++RV+ LVN  +DE LLCQGL LND+LQRVL++++ IA G
Subjt:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG

AT4G32760.2 ENTH/VHS/GAT family protein1.2e-5644.57Show/hide
Query:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL
        A  ERAT++ L  PDWA+N+E+CD++N DP QAKD +K +KKR+ S+NPK QLLAL  LE + KNC D +   V E+ ++HEM++IVKKKPD +V++KIL
Subjt:  ACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPDLNVRDKIL

Query:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM
         L+D W  AFGG  + + PQYYA Y EL                                                              KG+ DVL EM
Subjt:  ALVDAWHAAFGGNSKGKCPQYYAAYNEL--------------------------------------------------------------KGLADVLWEM

Query:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG
        L AL+P   E LK+EV+VDLV+QCR+Y++RV+ LVN  +DE LLCQGL LND+LQRVL++++ IA G
Subjt:  LGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDIAKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATCTTACGATGAAATTCCCAAGGAATCCGATCCATACCGCCGAAGATTGCATAATTTTCAGAGGCTTGGACAGTGCCGTCGACGATGACGACTCGCAATCGGA
ATCTGGAGTCTGTTCACCCACGCTCTGGGGTTCAGATTCTCGGATGAACCCCCAATTTCATCGCTCGCGTAATCGCACTCTTTCCCCGACTTCCCGCACTCAAGCCATCG
CCAGAGGACAGCAGGAGCTCATGGAGATGGTCAGGAACATGCCGGAATCCTCCTACGAGCTTTCTCTCAAAGATCTCGTCGAACATCACTTGCGGCAACAAGAGAGCGCC
TCCATCGAGAAATATGATTCCGCCTCTGAAACTTCCTTCATAAGAGATCCTATCAAGAAGAGGAGTGAAACCAGAGCACTCGTTACCAGAAGTAGAAGCGTCAATAGCGG
TGGATTTTATCTCAAAATGTTCTTCCCGATGCCGTTAGGGAAAATTTCAACGAAAAAGAAGAGTAATGTTAGAAGCGACTCGGCGTTGAATGGAAGTTCGAGAGTATCTC
CTAAGCCACCGCAAGTGGACAGAGACTGGTGGAGGAAGAGATCGTCAGCAACCTCCGGCGAGAATGGCGGCAGTGTCTCCGGCAGTAGTAGTAGCAATAGTAATAGCACA
AGCAGCGAAAGAAGCAACAGCAGATCCGACCCATCGATCCGCTTTTCCATTCGGGTTGAAGCGGTCAAAAGATCTACTACGCCACGTGTCGCAGGCTCGCAAACCGAGCG
GGACCCATGGCCGGTGCTCTGCTCAGAAGAGTCTCCATCTCCAGCAAGAAAACACGCGGTTGTAGTGTGGAAATCGCGTCAGCCCTTTGCCGAAATCTTCCCCACGGAAA
CTTCGACAAAGTCAAACTCAGATACCAAAGGAAGTAACCAGAAGAGAAATGGTATAAAGGGAGAGTTATTGATGGCAATGATTTCTGATAGACTTCTGTTCCACGGCAGC
ACTTACTTTATTTCACTTCTTAAGATTCGGAGGGACATGTCTACCAATGCTGCTGCTTGCGCTGAGAGAGCAACAAATGATGGGCTTAGAGCTCCTGATTGGGCCATAAA
CATTGAGCTCTGTGATATCATCAACATGGATCCTAGGCAAGCAAAGGATGCATTGAAGATACTCAAGAAGCGACTGACTAGCAAGAATCCTAAAGTACAACTTCTAGCTC
TCCATGCATTGGAGGCTCTAAGCAAAAATTGCAGTGATACCATTTGTAAGCTGGTCGTGGAACGTAATATCCTGCACGAAATGCTTAAAATTGTGAAAAAGAAGCCTGAT
TTAAATGTACGGGACAAAATATTAGCTCTGGTAGATGCATGGCATGCAGCATTTGGTGGCAACTCCAAGGGGAAGTGTCCACAGTATTATGCAGCCTACAATGAATTGAA
GGGACTAGCAGATGTTCTATGGGAAATGCTTGGCGCATTGGATCCTAAGACTCCGGAGGCTCTGAAGGAAGAAGTGATTGTAGATCTTGTTGATCAATGCCGTTCATACC
AACGCCGTGTCATGATACTTGTGAATGAGAACACAGATGAGGAACTGTTATGTCAAGGATTGGTGTTGAATGACAATCTGCAGCGTGTACTCAGTGACCACGACGACATT
GCAAAAGGAACATTCATGATGGAAGCTAGAAGAACAGAACCTCCCATTCCATCGGTTCCATTTATGAACCCTGAGGACGATGGTTCGGAAGATGATTTTACACAGTTAGC
TCGCAGGTCAACCAAAGATTACACTTTTGAAAAGGACAGCAAACTGGCAAACGGTCAATCATCTCGAGTTAGTCCAGTTTCTTCACCCTCATCAAAGTCGACAACTAGTG
TAGAAATGATCGATCATCTCAGTGACGCACACAAACCTCAACTGTCTCCAAAGATTGTAGAGCCACCATCTTATTCCCTTCCAATTTTCCCACCCTCGCCTCAAACTTCA
TCTCCCTTACCTTTAAACACTAGACAGCCTATGTTTGAGGAACCACCTCTGAGAAGCATATCCACGAATCCACTTCCCACAGCACATCGGGACACTCAATCTCCAAGCCT
CCTCCCTCCCCCACCCTCGAGATATAACCAAAGACAACAATATTTTGAGCAACAAAAGTCAGTTACAGGAGGAGGCACTCTGCCCCATTTGAGCAATACCCGTGGTTCTT
CGCATGATAACACGTCGCTCAGTCCTTCCACCCCAACCAGATCTGCAGAGCATGAAGAAGCCCTTTTCAAAGATCTGGTGAAGTTTGCTCAGTTTAAGTTATCTTCATCG
TCGCAATCCAACCGACCATTGTGA
mRNA sequenceShow/hide mRNA sequence
ACAAATCATTTCATTTCAACTGCTTCCTCTCCAATTCCTTCCATCCAAGATTCCCCGGAGCAACCGCCATGGGAGATCTTACGATGAAATTCCCAAGGAATCCGATCCAT
ACCGCCGAAGATTGCATAATTTTCAGAGGCTTGGACAGTGCCGTCGACGATGACGACTCGCAATCGGAATCTGGAGTCTGTTCACCCACGCTCTGGGGTTCAGATTCTCG
GATGAACCCCCAATTTCATCGCTCGCGTAATCGCACTCTTTCCCCGACTTCCCGCACTCAAGCCATCGCCAGAGGACAGCAGGAGCTCATGGAGATGGTCAGGAACATGC
CGGAATCCTCCTACGAGCTTTCTCTCAAAGATCTCGTCGAACATCACTTGCGGCAACAAGAGAGCGCCTCCATCGAGAAATATGATTCCGCCTCTGAAACTTCCTTCATA
AGAGATCCTATCAAGAAGAGGAGTGAAACCAGAGCACTCGTTACCAGAAGTAGAAGCGTCAATAGCGGTGGATTTTATCTCAAAATGTTCTTCCCGATGCCGTTAGGGAA
AATTTCAACGAAAAAGAAGAGTAATGTTAGAAGCGACTCGGCGTTGAATGGAAGTTCGAGAGTATCTCCTAAGCCACCGCAAGTGGACAGAGACTGGTGGAGGAAGAGAT
CGTCAGCAACCTCCGGCGAGAATGGCGGCAGTGTCTCCGGCAGTAGTAGTAGCAATAGTAATAGCACAAGCAGCGAAAGAAGCAACAGCAGATCCGACCCATCGATCCGC
TTTTCCATTCGGGTTGAAGCGGTCAAAAGATCTACTACGCCACGTGTCGCAGGCTCGCAAACCGAGCGGGACCCATGGCCGGTGCTCTGCTCAGAAGAGTCTCCATCTCC
AGCAAGAAAACACGCGGTTGTAGTGTGGAAATCGCGTCAGCCCTTTGCCGAAATCTTCCCCACGGAAACTTCGACAAAGTCAAACTCAGATACCAAAGGAAGTAACCAGA
AGAGAAATGGTATAAAGGGAGAGTTATTGATGGCAATGATTTCTGATAGACTTCTGTTCCACGGCAGCACTTACTTTATTTCACTTCTTAAGATTCGGAGGGACATGTCT
ACCAATGCTGCTGCTTGCGCTGAGAGAGCAACAAATGATGGGCTTAGAGCTCCTGATTGGGCCATAAACATTGAGCTCTGTGATATCATCAACATGGATCCTAGGCAAGC
AAAGGATGCATTGAAGATACTCAAGAAGCGACTGACTAGCAAGAATCCTAAAGTACAACTTCTAGCTCTCCATGCATTGGAGGCTCTAAGCAAAAATTGCAGTGATACCA
TTTGTAAGCTGGTCGTGGAACGTAATATCCTGCACGAAATGCTTAAAATTGTGAAAAAGAAGCCTGATTTAAATGTACGGGACAAAATATTAGCTCTGGTAGATGCATGG
CATGCAGCATTTGGTGGCAACTCCAAGGGGAAGTGTCCACAGTATTATGCAGCCTACAATGAATTGAAGGGACTAGCAGATGTTCTATGGGAAATGCTTGGCGCATTGGA
TCCTAAGACTCCGGAGGCTCTGAAGGAAGAAGTGATTGTAGATCTTGTTGATCAATGCCGTTCATACCAACGCCGTGTCATGATACTTGTGAATGAGAACACAGATGAGG
AACTGTTATGTCAAGGATTGGTGTTGAATGACAATCTGCAGCGTGTACTCAGTGACCACGACGACATTGCAAAAGGAACATTCATGATGGAAGCTAGAAGAACAGAACCT
CCCATTCCATCGGTTCCATTTATGAACCCTGAGGACGATGGTTCGGAAGATGATTTTACACAGTTAGCTCGCAGGTCAACCAAAGATTACACTTTTGAAAAGGACAGCAA
ACTGGCAAACGGTCAATCATCTCGAGTTAGTCCAGTTTCTTCACCCTCATCAAAGTCGACAACTAGTGTAGAAATGATCGATCATCTCAGTGACGCACACAAACCTCAAC
TGTCTCCAAAGATTGTAGAGCCACCATCTTATTCCCTTCCAATTTTCCCACCCTCGCCTCAAACTTCATCTCCCTTACCTTTAAACACTAGACAGCCTATGTTTGAGGAA
CCACCTCTGAGAAGCATATCCACGAATCCACTTCCCACAGCACATCGGGACACTCAATCTCCAAGCCTCCTCCCTCCCCCACCCTCGAGATATAACCAAAGACAACAATA
TTTTGAGCAACAAAAGTCAGTTACAGGAGGAGGCACTCTGCCCCATTTGAGCAATACCCGTGGTTCTTCGCATGATAACACGTCGCTCAGTCCTTCCACCCCAACCAGAT
CTGCAGAGCATGAAGAAGCCCTTTTCAAAGATCTGGTGAAGTTTGCTCAGTTTAAGTTATCTTCATCGTCGCAATCCAACCGACCATTGTGAGACTGGATTAGCAAAGAC
TGACGTTTTGGTGTTTTTCTTATTGTTGTTTTAACGTAGGCAGGTGACGAGGTTGTTGTATATTCATGGACAAAGCCATCCATTTTGCTTTGATGGATGTTAGAGTAAGC
CACTGATCTCTGCATATATTATATGATATATTCATTTAGTATCTCTCTTTTGTTTCATAGCAATTATACACCCTGATCTCCTTTCCGCCTCCCCTCGCCAGAGGTTCAAA
TCTTTCTATTGAACTTAAAAAAAAAAAAGAATACTAAAGTTCATGGATTAAATGGTTTAAAAGTATATTGTCATTTACACAAGAATTGCAGGAACAAATCTGCCTTTTTT
TTTTTTTTGGGCCAAAGAATAAACTGACCCACAGAATTACCTCTCCCATACATTCAACAACCCAAGAGTACTCTTTTGCTCCTTCCACAAAGGAATTACATTAAATGGAT
GTTAAAACTTAAAAGATAGCTACTTCTTGGAGATTAGCAGTTTACCAGTTTATGCAATGAAGTTCAGAGAAGGAAGGCAAAGAGAGCTTGAAAAATTGATTTGGGAGTTC
TCAACTTTTTAGGAGAGGACACATCCTGGCTTGCCCTTAGGAGCCCTTGCATATGATGATGCCTCCCCTGAAAAATGCAGGCAAAAGTAAAGGATTGAAACAGAAGGCAT
CAAGGTTTGTAGATCAAGCATTCAATGAAATTTTCATAATATCTTAAGTTTACCTGTACAGCATCCTTGAGTGGCCCCCAGCATTCGAGTCGATTATACTTTTGTGAACT
GGAATGGAAATATACCCTGACTAAGGCATCTAGCAGCTCGGGATTCTCCCTAAATAGAATTGAATCTCCATATAAGACAGATTCAAGTACCTGGTTTTACAAAAACATGT
GATTTTAGCTTTATACCAATCTTAACCATTGCTATGATATGCACTTCCATCATGATAGGTTAGGCCTGAGAGTTCTCCAAACAGCCAAGGGAAAGTTTGTTTGGTTTCAG
GTTTTTGGTTTTTGAAACCTGTGTTTGTAACAATCCAAGTAAATATCGTTTTCTTTGAGCTTTCCCTCGAGATTTTAAAACGTGTCTACTAGAGAGAGGTTTTCACACCC
TTATAAGAAATGCTTCGTTCTCCTCTCCAACCAATGTGGGATCTCACAATAAAGGTTGTGGTCGGAGTAGGGCTAGACCCTCTACATAGTAGATGGATTTAAAAACCTTG
AGGAGAAGCCCGAAAAGAAAATCCCAAAAAGGACAATATCTAATAGCGATAATCTTGGGCTGTTACAAATGGTATCAGAGCCAATTGTCAGACGGTGCGAGCCACTGATT
GGAGTAGGGTTAAGACCCTCTCTTTAGTAGACGCGTTTTAAAATCTTGAGGAAAAATCCAAAAGGAAAAGCCCAAAGAAGACAATATCTACTAACGGTGGGCTTACACTA
AGAGTTTTGGGCTGCATCGAATAATTTTTCCAACAAATTTGGACATGAGTTATAATGGAATGTGGGAAAAGAAAAACTTACAAGAGGCAATTCTTTGGAGAAAATGTGGT
ATCTAAACTCAGCAGTAAGATCAAGCAAAGAATTAGGACCGCTCACATAGCAATGTACATGAAGACACATTTCATCTTTTACTTTCTTCCACTCAGCAACCACATCATCC
TTGTTGTACCAACCTCTCAACTGTATGCATTCACACACCCAAATATAAGAACAGAAACACAATGTGTTGTTCCACACACAGTCACACACATTAAGTAGCAATAAGTTCGT
ACCTGATCAAGGTTGATAACATCAGAGATGGTTAAGGTGAGATTAGCGGTGAAGTCGCAATGCGATAGAATATACGTTCTTGGGGTGATTCCTGCACACTTGTTCACCTC
CCTAGTGAATTCAACTTTTAACTTTGAAGCTTCAAATCTTGCTGGAGGAACCAATATCCTGACAGCCTACCAAAGATCAAAGCATCATAAGACAGAGTAACACAATTTTG
ACGGAACACAGCGTCTACTGTTCAAAATGTTAGAAATCACGGACCTTCACAATGGTATAATATTGTCCATTTTGAGCATAAGCTCTTACGGTCTTGCTTTTGTTGAGATT
GTGTTCTTTACTTATAAACTCATGATCAACCCCTTAATTAGCTGATGTGGGACTTCTCTCCCAACAATTCTCAACAATTTTTCCCGCAAACAAAGTACACCATACAACCT
CCCCGGAGGTCTATGGAGCCCTTGAACAGCCTCTCCTTAATCGAGGCTCGACTCTTTCTCTAGAGCTCTCGAACAAAGTACACCATTTGTTAACAAACTCCCCTAAATCG
AGGCTTAAATCCTTCTTTGGAGCCATCGAACAAAGTGCACCATTTTTTCAACACTCGAGTCACTTTTTACTACACCTTTGAGGCTCACAACTTTTTTGTTCGATATTTGA
GGATTTTATTAACATGGTTAAGTTAAATGCATGACTTTGATACCATGTTAGAAATCATGAACCTTCACAATGGTACGATATTATCCACTTTGAGCATAAACTCCTCTTGA
TTTTCCCAAAAGACCTCGTAGCAATAGAGATAGTGTTTCTTACTTATAAACTCATGATCTAACCCAACCATTGTAGTTCACTTCTTAAGCTTGTTTACAGAAAAAGTACG
TTGCATACAGAAAATTCAGCAACGCATGGAATTCAAGGCAATGAGACAAGAATGTAAGCCAGAACTTAGAATCAACACGAAACCCACAAATTTTGTCAATTAGTTTCTTC
AGCAAAGTTAGACATACAATCTGGATCAAACCCGACATTCATCAACAGAATGCAAACAAGCCTGAGCCCTCAAGTGTAATGATAAGAACAACTCCCAC
Protein sequenceShow/hide protein sequence
MGDLTMKFPRNPIHTAEDCIIFRGLDSAVDDDDSQSESGVCSPTLWGSDSRMNPQFHRSRNRTLSPTSRTQAIARGQQELMEMVRNMPESSYELSLKDLVEHHLRQQESA
SIEKYDSASETSFIRDPIKKRSETRALVTRSRSVNSGGFYLKMFFPMPLGKISTKKKSNVRSDSALNGSSRVSPKPPQVDRDWWRKRSSATSGENGGSVSGSSSSNSNST
SSERSNSRSDPSIRFSIRVEAVKRSTTPRVAGSQTERDPWPVLCSEESPSPARKHAVVVWKSRQPFAEIFPTETSTKSNSDTKGSNQKRNGIKGELLMAMISDRLLFHGS
TYFISLLKIRRDMSTNAAACAERATNDGLRAPDWAINIELCDIINMDPRQAKDALKILKKRLTSKNPKVQLLALHALEALSKNCSDTICKLVVERNILHEMLKIVKKKPD
LNVRDKILALVDAWHAAFGGNSKGKCPQYYAAYNELKGLADVLWEMLGALDPKTPEALKEEVIVDLVDQCRSYQRRVMILVNENTDEELLCQGLVLNDNLQRVLSDHDDI
AKGTFMMEARRTEPPIPSVPFMNPEDDGSEDDFTQLARRSTKDYTFEKDSKLANGQSSRVSPVSSPSSKSTTSVEMIDHLSDAHKPQLSPKIVEPPSYSLPIFPPSPQTS
SPLPLNTRQPMFEEPPLRSISTNPLPTAHRDTQSPSLLPPPPSRYNQRQQYFEQQKSVTGGGTLPHLSNTRGSSHDNTSLSPSTPTRSAEHEEALFKDLVKFAQFKLSSS
SQSNRPL