; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014958 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014958
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
Descriptiontransportin-3 isoform X1
Genome locationchr06:18827888..18886219
RNA-Seq ExpressionPay0014958
SyntenyPay0014958
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR011989 - Armadillo-like helical
IPR012337 - Ribonuclease H-like superfamily
IPR016024 - Armadillo-type fold
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449959.1 PREDICTED: transportin-3 isoform X1 [Cucumis melo]0.0e+00100Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
        AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT

XP_008449960.1 PREDICTED: transportin-3 isoform X2 [Cucumis melo]0.0e+00100Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
        AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT

XP_011651341.1 transportin-3 isoform X1 [Cucumis sativus]0.0e+0097.7Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIV+ASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNS NKKHVEDVFLSVFSALLDGLLLRAQV+ESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVD+CQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQ+FDF VITQLVTMLAARPSNEIKG+MCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESV SHACAFALRKICEDATAVIFE PNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAA+GSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLAS+LEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNI
        AICSVSERTDLKP+LRWESLHGWLLSAVQALPLEYLKPGEVE+LVPLWLKALGDAACDYLESKSCDE +ANYGHMQGKGGRVLKRLVREFADGHRN+
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNI

XP_011651344.1 transportin-3 isoform X3 [Cucumis sativus]0.0e+0097.7Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIV+ASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNS NKKHVEDVFLSVFSALLDGLLLRAQV+ESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVD+CQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQ+FDF VITQLVTMLAARPSNEIKG+MCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESV SHACAFALRKICEDATAVIFE PNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAA+GSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLAS+LEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNI
        AICSVSERTDLKP+LRWESLHGWLLSAVQALPLEYLKPGEVE+LVPLWLKALGDAACDYLESKSCDE +ANYGHMQGKGGRVLKRLVREFADGHRN+
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNI

XP_038891351.1 transportin-3 [Benincasa hispida]0.0e+0097.15Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        AP LIVEASAEAL+LADALLSCVAF SEDWEIADSTLQFWSSLASYILGLDENN AN+KHVEDVFLSVFSALLDGLLLRAQV+ESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQ+FDF VITQLVTML+ARPSNEIKGVMCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFE PNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDN  SLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFV LLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVI VLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPNM
        AICSVSERTDLKPVLRWESLHGWLL AVQALPLEYLKPGEVETLVPLWLKALGDAA DYLESKSCDE + NYGHMQGKGGRVLKRLVREFADGHRNIPN+
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPNM

Query:  T
        T
Subjt:  T

TrEMBL top hitse value%identityAlignment
A0A1S3BN70 transportin-3 isoform X10.0e+00100Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
        AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT

A0A1S3BNV6 transportin-3 isoform X20.0e+00100Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
        AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREFADGHRNIPNMT

A0A6J1D4H9 transportin MOS14 isoform X30.0e+0095.15Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIVEA+AEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNS N+KHVEDVFLS+FSALLDGLLLRAQV+ESAFNEERGM+DLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPW+EVE KLFALNVVAEVVLQEGQ+FDF VITQLVT+L+ARPSNEIKG+MCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFE PNLEILIW+GESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNALSLRQNPA YTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDP+FSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        IQSSGQHFV LLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASV AINSSYICDQEPDLVEAYTNFASIF+RCSHKEI
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLE AST SEG F+SMVI V+SHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPNM
        AICS+SERTDLK VLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAA DYL+SKSCDE + NYGHMQGKGGRVLKRLVREFADGHRNIPN+
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPNM

Query:  T
        T
Subjt:  T

A0A6J1F1T5 transportin-3 isoform X10.0e+0096.01Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIV+ASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILG DENNSAN+K+VEDVFLS+FSALLDGLLLR QV+ESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMN VELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQ+FDF VITQLVTML+ARPSNEIKG+MCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFE  NLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNA SLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTE T DDPMFSLLIVFWPMLEKLLRCEHMENGNLS AACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYG-HQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKE
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYG HQEKFGHLFITTFERFTYAASV+AINSSYICDQEPDLVEAY+NFASIFLRCSHKE
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYG-HQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKE

Query:  ILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQL
        ILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCAT+LQQL
Subjt:  ILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQL

Query:  AAICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPN
        AAICSVSERTDLK VLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE + NYGHMQGKGGRVLKRLVREFADGHRNI N
Subjt:  AAICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPN

Query:  M
        +
Subjt:  M

A0A6J1J969 transportin-3 isoform X10.0e+0095.72Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        APSLIV+ASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILG DENNSAN+KHVEDVFLS+FSALLDGLLLR QV+ESAFNEERGMIDLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        IHFRMN VELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQ+FDF VITQLVTML+ARPSNEIKG+MCLVYRSLAEVV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFE  NLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        RLLSSSYEAIEKLVDEDNA SLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTE + DDPMFSLLIVFWPMLEKLLRCEHMENGNLS AACRALSLA
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYG-HQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKE
        IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVI+EEYG HQEKFGHLFITTFERFTYAASV+AINSSYICDQEPDLVEAY NFASIFLRCSHKE
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYG-HQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKE

Query:  ILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQL
        ILAA GSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCAT+LQQL
Subjt:  ILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQL

Query:  AAICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPN
        AAICSVSERTDLK VLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE + NYGHMQGKGGRVLKRLVREFADGHRNI N
Subjt:  AAICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDE-EANYGHMQGKGGRVLKRLVREFADGHRNIPN

Query:  M
        +
Subjt:  M

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.4e-3036.16Show/hide
Query:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS
        +A+Q A  F  +++   G PK II+D D IF S  WK+  +  N ++K S  + PQTDGQTER NQ +E  LRC C+  PN W   I   +  YN   HS
Subjt:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS

Query:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL
        +T+ TPF+ V+       P  L S+ DK   N +      E       +KE+L     +MKK+ D K +E+ +F+ GD V +K    R ++    ++ KL
Subjt:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL

Query:  APKYHGPYRITETIEEVAYRLDLP
        AP + GP+ + +      Y LDLP
Subjt:  APKYHGPYRITETIEEVAYRLDLP

P0CT35 Transposon Tf2-2 polyprotein1.4e-3036.16Show/hide
Query:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS
        +A+Q A  F  +++   G PK II+D D IF S  WK+  +  N ++K S  + PQTDGQTER NQ +E  LRC C+  PN W   I   +  YN   HS
Subjt:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS

Query:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL
        +T+ TPF+ V+       P  L S+ DK   N +      E       +KE+L     +MKK+ D K +E+ +F+ GD V +K    R ++    ++ KL
Subjt:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL

Query:  APKYHGPYRITETIEEVAYRLDLP
        AP + GP+ + +      Y LDLP
Subjt:  APKYHGPYRITETIEEVAYRLDLP

P0CT36 Transposon Tf2-3 polyprotein1.4e-3036.16Show/hide
Query:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS
        +A+Q A  F  +++   G PK II+D D IF S  WK+  +  N ++K S  + PQTDGQTER NQ +E  LRC C+  PN W   I   +  YN   HS
Subjt:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS

Query:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL
        +T+ TPF+ V+       P  L S+ DK   N +      E       +KE+L     +MKK+ D K +E+ +F+ GD V +K    R ++    ++ KL
Subjt:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL

Query:  APKYHGPYRITETIEEVAYRLDLP
        AP + GP+ + +      Y LDLP
Subjt:  APKYHGPYRITETIEEVAYRLDLP

P0CT41 Transposon Tf2-12 polyprotein1.4e-3036.16Show/hide
Query:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS
        +A+Q A  F  +++   G PK II+D D IF S  WK+  +  N ++K S  + PQTDGQTER NQ +E  LRC C+  PN W   I   +  YN   HS
Subjt:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS

Query:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL
        +T+ TPF+ V+       P  L S+ DK   N +      E       +KE+L     +MKK+ D K +E+ +F+ GD V +K    R ++    ++ KL
Subjt:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL

Query:  APKYHGPYRITETIEEVAYRLDLP
        AP + GP+ + +      Y LDLP
Subjt:  APKYHGPYRITETIEEVAYRLDLP

Q9UR07 Transposon Tf2-11 polyprotein1.4e-3036.16Show/hide
Query:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS
        +A+Q A  F  +++   G PK II+D D IF S  WK+  +  N ++K S  + PQTDGQTER NQ +E  LRC C+  PN W   I   +  YN   HS
Subjt:  SAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSNFWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHS

Query:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL
        +T+ TPF+ V+       P  L S+ DK   N +      E       +KE+L     +MKK+ D K +E+ +F+ GD V +K    R ++    ++ KL
Subjt:  STRTTPFQTVY----GRPPPPLISYGDKKTPNDEVEALLKERDLAISALKENLTIAQNRMKKFADSKRREL-KFKVGDEVYLKLRPYRQRSLARKRAEKL

Query:  APKYHGPYRITETIEEVAYRLDLP
        AP + GP+ + +      Y LDLP
Subjt:  APKYHGPYRITETIEEVAYRLDLP

Arabidopsis top hitse value%identityAlignment
AT1G12930.1 ARM repeat superfamily protein3.2e-25965.34Show/hide
Query:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL
        AP LIVEAS+EAL L DA+LSCV FPSEDWEIADST+QFWS+ A+YIL L  N   ++  V+D FL VFSAL+D L+LRAQV E   ++E   +DLPDGL
Subjt:  APSLIVEASAEALALADALLSCVAFPSEDWEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGL

Query:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV
        +HFR N++ELLVDICQ+L  + F+ KLFF G  + +V +P +E+E+KLFAL  V+E++LQEG+ FDF +I QLV+  + RPS+E+KG + +VYRSLA+VV
Subjt:  IHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPIPWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVV

Query:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA
        GSY R IS F ++ARPLLLFLA GI+E + SHACA ALRKICEDA AVI E  NL+IL+WIGE LE+  L LEDEEEV++A+++ILGSV NKEL++ LL 
Subjt:  GSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVIFEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLA

Query:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA
        +LLSSSY  + KLVDED   S RQ+PATYT++L+S  RGLYR+GTVFSHLATSL + P  D P+ SLL VFWP+LEKL R EHME+G+L+AAACRALS+A
Subjt:  RLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPTLDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLA

Query:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI
        +QSSG+HF+ LLP VLDCLS NF+ F   ECYI+TA VI EE+ H+E++G LFITTFERFT A+S+  INSSYICDQEPDLVEAY NFAS  +R  HKE+
Subjt:  IQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAINSSYICDQEPDLVEAYTNFASIFLRCSHKEI

Query:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA
        L  SG+LLE+SF KAAICCTAMHRGAALAAMSYLS FL+VSL+SM+E  ++ S+GSF+ + + V+SH GEGL+SN++YALLGV+AMSRVHKC+TILQQLA
Subjt:  LAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYALLGVSAMSRVHKCATILQQLA

Query:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANY---GHMQGKGGRVLKRLVREFADGHRNIP
        AICS+ ERT  K +L W+SL GWL SAV ALP EYLK GE E++V  W +ALG A  DYLE+KSC+  +N    GHMQGK GR LKRLVR+FAD HRN P
Subjt:  AICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANY---GHMQGKGGRVLKRLVREFADGHRNIP

Query:  N
        N
Subjt:  N

AT3G29750.1 Eukaryotic aspartyl protease family protein3.1e-0431.46Show/hide
Query:  INQLGEPEETMIEYR-----AITSLTTKGTMKLRGIVKGKEIIVLIDSGATHNFIHHELVKERKIPINRNTQFGITIGDGTSCKGEGIC
        IN+L E E+     R      +  LT    M+  G +   +++V IDSGAT NFI  EL    K+P +   Q  + +G     +  G C
Subjt:  INQLGEPEETMIEYR-----AITSLTTKGTMKLRGIVKGKEIIVLIDSGATHNFIHHELVKERKIPINRNTQFGITIGDGTSCKGEGIC

AT3G30770.1 Eukaryotic aspartyl protease family protein2.4e-0428.57Show/hide
Query:  ELMLFILNEEESTEEGEGSEAPNTEPAEINQLGEPEETMIEYRAITSLTTKGTMKLRGIVKGKEIIVLIDSGATHNFIHHELVKERKIPINRNTQFGITI
        ELM FI+ E  +   G   E    +   I Q        ++ ++ T  T    M+  G +   +++V+IDSGAT+NFI  EL    K+P +   Q  + +
Subjt:  ELMLFILNEEESTEEGEGSEAPNTEPAEINQLGEPEETMIEYRAITSLTTKGTMKLRGIVKGKEIIVLIDSGATHNFIHHELVKERKIPINRNTQFGITI

Query:  GDGTSCKGEGIC
        G     +  G C
Subjt:  GDGTSCKGEGIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGGAGAATCCAGATCCTGGTCAACCGGGCAGAGCATTTTTAGATCAAAACTTACCGGAAGTTCAGAAAAGAAGATGGGTATTATCGAGCCAAAAAGAAGCGAATC
AGTTGGTAATAAGGTGCAATGGAACAACGAAAAAGGGATGTTGAGGAAAAATGAGTTCCAAATGAAACAAATTACCATACCTCTAAAAGGAAGCTACCAAAAAGGGGAAC
CACCGGTTAAAAGATTGTCTGATGTGGAATTTAGAGCGCATTTAGATCAAGGTCTCTGTTTTAGGTGTAATGAGAAATATTCTCATGGACATCGATGCAAAATCAAAGAA
AAAAGGGAGCTCATGCTTTTTATCTTGAATGAAGAAGAAAGCACCGAGGAAGGGGAAGGGTCAGAGGCACCAAACACAGAACCAGCAGAGATCAATCAGTTGGGAGAACC
AGAAGAAACTATGATCGAATACCGGGCCATCACCAGTTTGACAACCAAGGGAACTATGAAACTAAGAGGAATAGTCAAAGGGAAAGAGATCATTGTCTTAATTGACAGTG
GGGCAACCCACAATTTCATTCATCATGAGCTGGTCAAGGAAAGGAAAATCCCCATCAATAGAAACACTCAGTTTGGTATTACTATTGGCGATGGCACAAGCTGTAAAGGA
GAAGGCATTTGTAGCAAAGCAGCAGATGCACTTTCAAGGATGGATCATTCAATAGAATTGAAGGCATTGTCAACAACAGGGATTGTAGACATGGCAGTAGTTACAAAGGA
AATCGAGAAAGATGAAGAACTTCAACTCTTAATCCAACAGTTACAGAATAACCCAGCATTGGAGGGCAATTACTCTTTAACAAATGGCACGCTAATGTATAAAGGAAGGC
ATCCGTTTTCAGCAAAGCAAGTTGCCATGGAATTCATTGATAAGATAGTTCGAAGACACGGCATCCCTAAGTCAATTATATCAGATAGGGACAAAATATTTGTAAGCAAC
TTTTGGAAAGAACTATTTTATGCCATGAATACCATCCTTAAACGAAGCACTGCCTTTCATCCTCAAACGGATGGCCAAACAGAACGAGTCAATCAATGTTTAGAAACCTA
CTTACGGTGCTTTTGTAATGAGCAACCAAACAAATGGCATCAGTTCATTCCATGGGCAGAGTTATGGTACAACACCACATTCCATTCATCAACACGCACAACTCCTTTCC
AGACTGTGTATGGTAGACCCCCACCACCCCTGATATCCTATGGAGACAAGAAGACACCTAATGATGAAGTTGAAGCATTGCTGAAGGAAAGAGATTTGGCTATTAGTGCG
CTCAAGGAGAACCTCACGATCGCTCAAAATAGAATGAAGAAATTTGCGGACTCAAAGAGAAGAGAACTTAAGTTTAAAGTAGGAGATGAAGTCTATCTAAAGTTAAGACC
CTACCGGCAGCGCTCCTTAGCAAGAAAAAGAGCAGAAAAGCTAGCTCCTAAATATCATGGGCCGTATCGCATCACTGAGACCATAGAAGAAGTGGCATACCGACTTGATT
TACCACCCGAAGCATCAATTCATAATATCCTAACACTAAAATCACTTTATTACAAGGGTCTCCATGCCAATCAGTCGACTCATTTCAAAAAAAGTCAAACACAGAAATTG
TCTCTCCCTGCAGTGTTAGCAGTGAAGAATCTGCGGGTGCATCAGGGGGTATTCTCACGATGTGGGACAAGAGCAAATTACGGTGGTGGAGCATGGTCATTAGTTTCATG
GGGCAATTTCTCAGTCAAATCTCTTTCGACCCATCTTTCCCCATCTTCTCTGATGGACAAAGCTATCTATAAAGCCCTTGGAAGACAAGCTGCCCAAGGAGAGTCAACAT
TTTGTTTCGGATTATGGCGTTTGGCACCATCCTTAATTGTAGAGGCCAGTGCTGAAGCCCTTGCTCTAGCTGATGCTCTCTTGAGTTGTGTGGCTTTTCCAAGTGAAGAT
TGGGAGATTGCTGACTCAACATTACAATTTTGGTCTTCTCTTGCAAGCTATATTCTTGGCCTTGATGAGAATAATTCAGCGAATAAGAAACACGTGGAAGATGTATTTCT
ATCTGTATTTTCAGCACTACTTGATGGGCTTCTATTACGAGCTCAGGTGATTGAATCTGCTTTCAACGAGGAAAGAGGAATGATAGACCTACCGGATGGTCTTATCCATT
TTAGAATGAATATCGTTGAGCTTCTGGTGGATATTTGTCAAATTTTAAGGTCTTCCAGATTTATGGAAAAGCTCTTCTTTAGTGGTTGGACCAATGGTAATGTACCAATT
CCTTGGAAGGAAGTGGAGAGCAAATTATTTGCCCTTAATGTGGTCGCTGAGGTAGTCCTACAGGAGGGTCAAACCTTTGATTTCGTCGTAATAACGCAACTGGTGACCAT
GTTGGCGGCTAGACCTTCAAATGAGATCAAAGGCGTAATGTGCCTTGTTTATAGATCACTGGCAGAAGTTGTTGGATCTTACTTTAGGTCAATTTCTGCTTTTCACACAG
ATGCCAGACCCTTGCTATTATTTCTTGCTACTGGGATCACAGAATCTGTCTCTTCACATGCTTGTGCCTTTGCCCTCCGTAAAATTTGTGAAGATGCAACTGCTGTAATC
TTCGAACCGCCAAATTTGGAAATTTTGATTTGGATCGGAGAGAGTTTGGAGAAGTTGCATTTACCTTTGGAGGACGAGGAAGAAGTAGTGAGTGCTGTAAGTTTGATTCT
TGGTTCAGTTCCTAATAAAGAACTGAAGAGCAACTTGCTGGCTAGATTGCTTTCGTCAAGCTATGAAGCAATTGAGAAACTAGTCGATGAAGATAATGCACTATCGTTGA
GACAAAATCCGGCTACTTACACAAAAATCTTAACCTCTGCTGTGAGAGGCCTGTATAGGATGGGAACTGTATTTAGCCATCTAGCTACGTCTTTATCAACTGAGCCTACT
CTCGATGATCCTATGTTTTCTTTGTTGATAGTTTTCTGGCCAATGCTAGAGAAACTTTTAAGGTGTGAACACATGGAGAATGGTAATCTCTCTGCAGCTGCTTGTCGTGC
TCTATCTTTAGCCATCCAGTCTTCAGGTCAACATTTTGTTACATTGCTGCCAAAAGTTTTAGATTGCCTATCGACAAATTTTGTTTTGTTCCATGGTCATGAATGTTACA
TCAAAACAGCTTCAGTTATTGTTGAAGAATATGGCCATCAAGAAAAATTTGGGCATTTGTTTATCACCACTTTTGAAAGGTTTACTTATGCAGCTTCCGTAAGTGCTATT
AATTCTTCTTACATATGTGACCAAGAACCTGATCTAGTGGAGGCTTACACAAATTTTGCATCAATTTTTCTCCGATGCTCTCATAAGGAAATATTAGCTGCGTCTGGTTC
TCTTTTGGAGGTTTCATTCCAGAAGGCTGCTATATGTTGCACTGCCATGCATCGTGGGGCAGCGTTAGCAGCAATGTCATACCTATCTTGTTTCTTGGATGTTAGTCTAG
CTTCAATGTTAGAATTTGCAAGTACTAATTCTGAGGGATCATTCAATTCTATGGTTATCCACGTTCTATCCCACAGTGGCGAGGGACTTGTATCGAACATTTTGTATGCT
TTGCTAGGTGTTTCAGCAATGTCACGGGTTCACAAGTGTGCAACAATTCTGCAACAGTTGGCAGCAATTTGCAGTGTCAGTGAAAGAACAGACTTGAAACCTGTCCTTCG
CTGGGAATCTTTGCATGGCTGGCTACTATCAGCGGTGCAGGCTCTCCCACTTGAATATTTAAAACCAGGGGAAGTTGAAACTCTTGTGCCACTATGGTTAAAGGCTCTTG
GAGATGCAGCCTGTGACTACCTTGAAAGTAAAAGTTGTGATGAAGAGGCTAATTATGGACATATGCAAGGGAAGGGTGGAAGAGTCCTGAAGCGTCTAGTCCGTGAATTT
GCTGATGGTCACCGCAATATTCCAAATATGACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCGGAGAATCCAGATCCTGGTCAACCGGGCAGAGCATTTTTAGATCAAAACTTACCGGAAGTTCAGAAAAGAAGATGGGTATTATCGAGCCAAAAAGAAGCGAATC
AGTTGGTAATAAGGTGCAATGGAACAACGAAAAAGGGATGTTGAGGAAAAATGAGTTCCAAATGAAACAAATTACCATACCTCTAAAAGGAAGCTACCAAAAAGGGGAAC
CACCGGTTAAAAGATTGTCTGATGTGGAATTTAGAGCGCATTTAGATCAAGGTCTCTGTTTTAGGTGTAATGAGAAATATTCTCATGGACATCGATGCAAAATCAAAGAA
AAAAGGGAGCTCATGCTTTTTATCTTGAATGAAGAAGAAAGCACCGAGGAAGGGGAAGGGTCAGAGGCACCAAACACAGAACCAGCAGAGATCAATCAGTTGGGAGAACC
AGAAGAAACTATGATCGAATACCGGGCCATCACCAGTTTGACAACCAAGGGAACTATGAAACTAAGAGGAATAGTCAAAGGGAAAGAGATCATTGTCTTAATTGACAGTG
GGGCAACCCACAATTTCATTCATCATGAGCTGGTCAAGGAAAGGAAAATCCCCATCAATAGAAACACTCAGTTTGGTATTACTATTGGCGATGGCACAAGCTGTAAAGGA
GAAGGCATTTGTAGCAAAGCAGCAGATGCACTTTCAAGGATGGATCATTCAATAGAATTGAAGGCATTGTCAACAACAGGGATTGTAGACATGGCAGTAGTTACAAAGGA
AATCGAGAAAGATGAAGAACTTCAACTCTTAATCCAACAGTTACAGAATAACCCAGCATTGGAGGGCAATTACTCTTTAACAAATGGCACGCTAATGTATAAAGGAAGGC
ATCCGTTTTCAGCAAAGCAAGTTGCCATGGAATTCATTGATAAGATAGTTCGAAGACACGGCATCCCTAAGTCAATTATATCAGATAGGGACAAAATATTTGTAAGCAAC
TTTTGGAAAGAACTATTTTATGCCATGAATACCATCCTTAAACGAAGCACTGCCTTTCATCCTCAAACGGATGGCCAAACAGAACGAGTCAATCAATGTTTAGAAACCTA
CTTACGGTGCTTTTGTAATGAGCAACCAAACAAATGGCATCAGTTCATTCCATGGGCAGAGTTATGGTACAACACCACATTCCATTCATCAACACGCACAACTCCTTTCC
AGACTGTGTATGGTAGACCCCCACCACCCCTGATATCCTATGGAGACAAGAAGACACCTAATGATGAAGTTGAAGCATTGCTGAAGGAAAGAGATTTGGCTATTAGTGCG
CTCAAGGAGAACCTCACGATCGCTCAAAATAGAATGAAGAAATTTGCGGACTCAAAGAGAAGAGAACTTAAGTTTAAAGTAGGAGATGAAGTCTATCTAAAGTTAAGACC
CTACCGGCAGCGCTCCTTAGCAAGAAAAAGAGCAGAAAAGCTAGCTCCTAAATATCATGGGCCGTATCGCATCACTGAGACCATAGAAGAAGTGGCATACCGACTTGATT
TACCACCCGAAGCATCAATTCATAATATCCTAACACTAAAATCACTTTATTACAAGGGTCTCCATGCCAATCAGTCGACTCATTTCAAAAAAAGTCAAACACAGAAATTG
TCTCTCCCTGCAGTGTTAGCAGTGAAGAATCTGCGGGTGCATCAGGGGGTATTCTCACGATGTGGGACAAGAGCAAATTACGGTGGTGGAGCATGGTCATTAGTTTCATG
GGGCAATTTCTCAGTCAAATCTCTTTCGACCCATCTTTCCCCATCTTCTCTGATGGACAAAGCTATCTATAAAGCCCTTGGAAGACAAGCTGCCCAAGGAGAGTCAACAT
TTTGTTTCGGATTATGGCGTTTGGCACCATCCTTAATTGTAGAGGCCAGTGCTGAAGCCCTTGCTCTAGCTGATGCTCTCTTGAGTTGTGTGGCTTTTCCAAGTGAAGAT
TGGGAGATTGCTGACTCAACATTACAATTTTGGTCTTCTCTTGCAAGCTATATTCTTGGCCTTGATGAGAATAATTCAGCGAATAAGAAACACGTGGAAGATGTATTTCT
ATCTGTATTTTCAGCACTACTTGATGGGCTTCTATTACGAGCTCAGGTGATTGAATCTGCTTTCAACGAGGAAAGAGGAATGATAGACCTACCGGATGGTCTTATCCATT
TTAGAATGAATATCGTTGAGCTTCTGGTGGATATTTGTCAAATTTTAAGGTCTTCCAGATTTATGGAAAAGCTCTTCTTTAGTGGTTGGACCAATGGTAATGTACCAATT
CCTTGGAAGGAAGTGGAGAGCAAATTATTTGCCCTTAATGTGGTCGCTGAGGTAGTCCTACAGGAGGGTCAAACCTTTGATTTCGTCGTAATAACGCAACTGGTGACCAT
GTTGGCGGCTAGACCTTCAAATGAGATCAAAGGCGTAATGTGCCTTGTTTATAGATCACTGGCAGAAGTTGTTGGATCTTACTTTAGGTCAATTTCTGCTTTTCACACAG
ATGCCAGACCCTTGCTATTATTTCTTGCTACTGGGATCACAGAATCTGTCTCTTCACATGCTTGTGCCTTTGCCCTCCGTAAAATTTGTGAAGATGCAACTGCTGTAATC
TTCGAACCGCCAAATTTGGAAATTTTGATTTGGATCGGAGAGAGTTTGGAGAAGTTGCATTTACCTTTGGAGGACGAGGAAGAAGTAGTGAGTGCTGTAAGTTTGATTCT
TGGTTCAGTTCCTAATAAAGAACTGAAGAGCAACTTGCTGGCTAGATTGCTTTCGTCAAGCTATGAAGCAATTGAGAAACTAGTCGATGAAGATAATGCACTATCGTTGA
GACAAAATCCGGCTACTTACACAAAAATCTTAACCTCTGCTGTGAGAGGCCTGTATAGGATGGGAACTGTATTTAGCCATCTAGCTACGTCTTTATCAACTGAGCCTACT
CTCGATGATCCTATGTTTTCTTTGTTGATAGTTTTCTGGCCAATGCTAGAGAAACTTTTAAGGTGTGAACACATGGAGAATGGTAATCTCTCTGCAGCTGCTTGTCGTGC
TCTATCTTTAGCCATCCAGTCTTCAGGTCAACATTTTGTTACATTGCTGCCAAAAGTTTTAGATTGCCTATCGACAAATTTTGTTTTGTTCCATGGTCATGAATGTTACA
TCAAAACAGCTTCAGTTATTGTTGAAGAATATGGCCATCAAGAAAAATTTGGGCATTTGTTTATCACCACTTTTGAAAGGTTTACTTATGCAGCTTCCGTAAGTGCTATT
AATTCTTCTTACATATGTGACCAAGAACCTGATCTAGTGGAGGCTTACACAAATTTTGCATCAATTTTTCTCCGATGCTCTCATAAGGAAATATTAGCTGCGTCTGGTTC
TCTTTTGGAGGTTTCATTCCAGAAGGCTGCTATATGTTGCACTGCCATGCATCGTGGGGCAGCGTTAGCAGCAATGTCATACCTATCTTGTTTCTTGGATGTTAGTCTAG
CTTCAATGTTAGAATTTGCAAGTACTAATTCTGAGGGATCATTCAATTCTATGGTTATCCACGTTCTATCCCACAGTGGCGAGGGACTTGTATCGAACATTTTGTATGCT
TTGCTAGGTGTTTCAGCAATGTCACGGGTTCACAAGTGTGCAACAATTCTGCAACAGTTGGCAGCAATTTGCAGTGTCAGTGAAAGAACAGACTTGAAACCTGTCCTTCG
CTGGGAATCTTTGCATGGCTGGCTACTATCAGCGGTGCAGGCTCTCCCACTTGAATATTTAAAACCAGGGGAAGTTGAAACTCTTGTGCCACTATGGTTAAAGGCTCTTG
GAGATGCAGCCTGTGACTACCTTGAAAGTAAAAGTTGTGATGAAGAGGCTAATTATGGACATATGCAAGGGAAGGGTGGAAGAGTCCTGAAGCGTCTAGTCCGTGAATTT
GCTGATGGTCACCGCAATATTCCAAATATGACTTAA
Protein sequenceShow/hide protein sequence
MFGESRSWSTGQSIFRSKLTGSSEKKMGIIEPKRSESVGNKVQWNNEKGMLRKNEFQMKQITIPLKGSYQKGEPPVKRLSDVEFRAHLDQGLCFRCNEKYSHGHRCKIKE
KRELMLFILNEEESTEEGEGSEAPNTEPAEINQLGEPEETMIEYRAITSLTTKGTMKLRGIVKGKEIIVLIDSGATHNFIHHELVKERKIPINRNTQFGITIGDGTSCKG
EGICSKAADALSRMDHSIELKALSTTGIVDMAVVTKEIEKDEELQLLIQQLQNNPALEGNYSLTNGTLMYKGRHPFSAKQVAMEFIDKIVRRHGIPKSIISDRDKIFVSN
FWKELFYAMNTILKRSTAFHPQTDGQTERVNQCLETYLRCFCNEQPNKWHQFIPWAELWYNTTFHSSTRTTPFQTVYGRPPPPLISYGDKKTPNDEVEALLKERDLAISA
LKENLTIAQNRMKKFADSKRRELKFKVGDEVYLKLRPYRQRSLARKRAEKLAPKYHGPYRITETIEEVAYRLDLPPEASIHNILTLKSLYYKGLHANQSTHFKKSQTQKL
SLPAVLAVKNLRVHQGVFSRCGTRANYGGGAWSLVSWGNFSVKSLSTHLSPSSLMDKAIYKALGRQAAQGESTFCFGLWRLAPSLIVEASAEALALADALLSCVAFPSED
WEIADSTLQFWSSLASYILGLDENNSANKKHVEDVFLSVFSALLDGLLLRAQVIESAFNEERGMIDLPDGLIHFRMNIVELLVDICQILRSSRFMEKLFFSGWTNGNVPI
PWKEVESKLFALNVVAEVVLQEGQTFDFVVITQLVTMLAARPSNEIKGVMCLVYRSLAEVVGSYFRSISAFHTDARPLLLFLATGITESVSSHACAFALRKICEDATAVI
FEPPNLEILIWIGESLEKLHLPLEDEEEVVSAVSLILGSVPNKELKSNLLARLLSSSYEAIEKLVDEDNALSLRQNPATYTKILTSAVRGLYRMGTVFSHLATSLSTEPT
LDDPMFSLLIVFWPMLEKLLRCEHMENGNLSAAACRALSLAIQSSGQHFVTLLPKVLDCLSTNFVLFHGHECYIKTASVIVEEYGHQEKFGHLFITTFERFTYAASVSAI
NSSYICDQEPDLVEAYTNFASIFLRCSHKEILAASGSLLEVSFQKAAICCTAMHRGAALAAMSYLSCFLDVSLASMLEFASTNSEGSFNSMVIHVLSHSGEGLVSNILYA
LLGVSAMSRVHKCATILQQLAAICSVSERTDLKPVLRWESLHGWLLSAVQALPLEYLKPGEVETLVPLWLKALGDAACDYLESKSCDEEANYGHMQGKGGRVLKRLVREF
ADGHRNIPNMT