; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014737 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014737
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr05:9312288..9313712
RNA-Seq ExpressionPay0014737
SyntenyPay0014737
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0042393 - histone binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038009.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.3e-23792.42Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDE-ATTTAELWKKLESLYLTKSLSNK
        MASTRFEVSKFN NGDFALWRKKIRAILVQHKVAKILDEGRLPANITE+EKRDM+EMAYSTIL+YLS EVLRLVDE  TTTAELWKKLESLYLTKSL NK
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDE-ATTTAELWKKLESLYLTKSLSNK

Query:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG
        IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPE YR VKAAIKYGRDSLTMSIV DALKTRNL+IKKERKDGELLMARG
Subjt:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG

Query:  RSDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHM
        RSDKK WKGKEKSSRMNSN EARKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA IT GYDS ETGYESAEVLMVSHRDIQDAWIMDSGCTYHM
Subjt:  RSDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHM

Query:  TPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVL
        TPN DFLINFQKSDGGKVLL DNGTCEVKGTG VLIATHDGMIRMLTN+RYVPELKRNLISL ELD+S YTIK+E GIMKVTKGSLVKLRGTL N LYVL
Subjt:  TPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVL

Query:  EGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
        EG AVS SATKALKQQKQQIVDHVVT+IRIDGVQSSGKGSDI SDQSPLVSQIEATE+SEFDGVQS+QERTLI+E
Subjt:  EGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

KAA0046503.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.2e-21788.1Show/hide
Query:  GNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIKEKFFGYKMD
        GNGDFALWRKKIRAILVQHKVAKILDEGR+PANITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSL NKIYIKEKFFGYK+D
Subjt:  GNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIKEKFFGYKMD

Query:  QSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS
        QSKSLEENLNEFQKIVVDLNNIGEKMSDENQ IILLNSLP+ YR VKAAIKYG+DSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS
Subjt:  QSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS

Query:  SRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKS
        SRMNSN EARK FLCHK GHFKKNCPLNKSREA TSE NVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWI DSGCTYHMTPN DFLINFQKS
Subjt:  SRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKS

Query:  DGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLEGTAVSGSATKAL
        DGGKVLL DNGTCEVKGTG VLIATHDGMIRMLTN+RYVPELKRNLISLG+LD+SGYTIKFE GIMKVTKGSLVKLRGTL NGLYVLEGTAV        
Subjt:  DGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLEGTAVSGSATKAL

Query:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
                              SGKGSDISSDQSPLVSQIEATEQSEFDGVQS+ ERTLIDE
Subjt:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]2.2e-19687.83Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDFALWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAY TILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSK LEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYG DSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDG DS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]2.1e-19988.56Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDF+LWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDGYDS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]7.2e-20088.81Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDFALWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDGYDS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class7.7e-0659.68Show/hide
Query:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
        +QQKQQ  DHVVT +RI    S  + S    +Q PLVS+IE T+QSEFDG+QS+QER LIDE
Subjt:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

A0A5A7UB25 Putative gag-pol polyprotein7.7e-0659.68Show/hide
Query:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
        +QQKQQ  DHVVT +RI    S  + S    +Q PLVS+IE T+QSEFDG+QS+QER LIDE
Subjt:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

A0A5A7UB25 Putative gag-pol polyprotein1.1e-19687.83Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDFALWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAY TILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSK LEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYG DSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDG DS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

A0A5D3BRB2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-23792.42Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDE-ATTTAELWKKLESLYLTKSLSNK
        MASTRFEVSKFN NGDFALWRKKIRAILVQHKVAKILDEGRLPANITE+EKRDM+EMAYSTIL+YLS EVLRLVDE  TTTAELWKKLESLYLTKSL NK
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDE-ATTTAELWKKLESLYLTKSLSNK

Query:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG
        IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPE YR VKAAIKYGRDSLTMSIV DALKTRNL+IKKERKDGELLMARG
Subjt:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG

Query:  RSDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHM
        RSDKK WKGKEKSSRMNSN EARKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA IT GYDS ETGYESAEVLMVSHRDIQDAWIMDSGCTYHM
Subjt:  RSDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHM

Query:  TPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVL
        TPN DFLINFQKSDGGKVLL DNGTCEVKGTG VLIATHDGMIRMLTN+RYVPELKRNLISL ELD+S YTIK+E GIMKVTKGSLVKLRGTL N LYVL
Subjt:  TPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVL

Query:  EGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
        EG AVS SATKALKQQKQQIVDHVVT+IRIDGVQSSGKGSDI SDQSPLVSQIEATE+SEFDGVQS+QERTLI+E
Subjt:  EGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

A0A5D3BRV3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-21788.1Show/hide
Query:  GNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIKEKFFGYKMD
        GNGDFALWRKKIRAILVQHKVAKILDEGR+PANITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSL NKIYIKEKFFGYK+D
Subjt:  GNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIKEKFFGYKMD

Query:  QSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS
        QSKSLEENLNEFQKIVVDLNNIGEKMSDENQ IILLNSLP+ YR VKAAIKYG+DSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS
Subjt:  QSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKS

Query:  SRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKS
        SRMNSN EARK FLCHK GHFKKNCPLNKSREA TSE NVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWI DSGCTYHMTPN DFLINFQKS
Subjt:  SRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKS

Query:  DGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLEGTAVSGSATKAL
        DGGKVLL DNGTCEVKGTG VLIATHDGMIRMLTN+RYVPELKRNLISLG+LD+SGYTIKFE GIMKVTKGSLVKLRGTL NGLYVLEGTAV        
Subjt:  DGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLEGTAVSGSATKAL

Query:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
                              SGKGSDISSDQSPLVSQIEATEQSEFDGVQS+ ERTLIDE
Subjt:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

A0A5D3DNU1 Putative gag-pol polyprotein3.5e-20088.81Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDFALWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDGYDS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

A0A5D3DNU1 Putative gag-pol polyprotein7.7e-0659.68Show/hide
Query:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE
        +QQKQQ  DHVVT +RI    S  + S    +Q PLVS+IE T+QSEFDG+QS+QER LIDE
Subjt:  KQQKQQIVDHVVTNIRIDGVQSSGKGSDISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE

A0A5D3DNU1 Putative gag-pol polyprotein1.0e-19988.56Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        MASTRFEVSKFNG+GDF+LWRKKIRAILVQHKVAKILDE RLP NITESEKRDM+EMAYSTILLYLSDEVLRLVDEATTT ELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENL+EFQKIVVDLNNIGEKMSDENQA+ILLNSLPE YR VKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT
        S+KK+WKGKE+S R  S  ++RKCFLCHK GHFKKNCPLNKSREASTSEANVTDGYNSA+ITDGYDS ETGYESAEVLMVSHRDIQDAWIMDSGCT+HMT
Subjt:  SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMT

Query:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE
        P+ DFL NFQK DGGKVLL DNGTC+VKGTG V IATHDGM+R+LTN+RYVP+LKRNLISLGELD+SG TIK E G+MKVTKGSLVKLRGTL +GLYVLE
Subjt:  PNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLE

Query:  GTAVSGSATKA
        GT VSGSA  A
Subjt:  GTAVSGSATKA

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.2e-2123.99Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI
        M   +  +  F+G   +A+W+ +IRA+L +  V K++D G +P  + +S K+     A STI+ YLSD  L       T  ++ + L+++Y  KSL++++
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKI

Query:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIK-YGRDSLTMSIVLDALKTRNLEIKKERKD--GELLMA
         ++++    K+    SL  + + F +++ +L   G K+ + ++   LL +LP  Y G+  AI+    ++LT++ V + L  + ++IK +  D   +++ A
Subjt:  YIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIK-YGRDSLTMSIVLDALKTRNLEIKKERKD--GELLMA

Query:  RGRSDKKNWKG--------KEKSSRMNSNREARKCFLC-HKGHFKKNC-----PLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRD
           ++   +K         K K     +++   KC  C  +GH KK+C      LN   + +  +      +  A +               V  V++  
Subjt:  RGRSDKKNWKG--------KEKSSRMNSNREARKCFLC-HKGHFKKNC-----PLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRD

Query:  IQD--AWIMDSGCTYHMTPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRM-------LTNIRYVPELKRNLISLGELDKSGYTIKFEI
        + D   +++DSG +       D LIN +      V +       V   G  + AT  G++R+       L ++ +  E   NL+S+  L ++G +I+F+ 
Subjt:  IQD--AWIMDSGCTYHMTPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRM-------LTNIRYVPELKRNLISLGELDKSGYTIKFEI

Query:  GIMKVTKGSL--VKLRGTLSN
          + ++K  L  VK  G L+N
Subjt:  GIMKVTKGSL--VKLRGTLSN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.8e-5329.78Show/hide
Query:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILD-EGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNK
        M+  ++EV+KFNG+  F+ W++++R +L+Q  + K+LD + + P  +   +  D++E A S I L+LSD+V+  + +  T   +W +LESLY++K+L+NK
Subjt:  MASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILD-EGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNK

Query:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG
        +Y+K++ +   M +  +   +LN F  ++  L N+G K+ +E++AI+LLNSLP  Y  +   I +G+ ++ +  V  AL       KK    G+ L+  G
Subjt:  IYIKEKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARG

Query:  R-------SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMD
        R       S+     G    S+  S    R C+ C++ GHFK++CP  +  +  TS     D  N+A +    D+        E  M      +  W++D
Subjt:  R-------SDKKNWKGKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMD

Query:  SGCTYHMTPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTL
        +  ++H TP  D    +   D G V + +    ++ G G + I T+ G   +L ++R+VP+L+ NLIS   LD+ GY   F     ++TKGSLV  +G  
Subjt:  SGCTYHMTPNWDFLINFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTL

Query:  SNGLYVLEGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDIS
           LY        G    A  +    +    + ++   G+Q   K S IS
Subjt:  SNGLYVLEGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSDIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-1022.47Show/hide
Query:  DFALWRKKIRAILVQHKVAKILDEGRL--PANI-TESEKR---------DMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIK
        ++ +W +++ A+   +++A  LD      PA I T++  R           +++ YS +L  +S  V   V  ATT A++W+ L  +Y   S  +   ++
Subjt:  DFALWRKKIRAILVQHKVAKILDEGRL--PANI-TESEKR---------DMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIK

Query:  EKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGV--KAAIKYGRDSLT---------------------MSIVLDALKT
         +   +    +K++++ +         L  +G+ M  + Q   +L +LPE Y+ V  + A K    +LT                     + I  +A+  
Subjt:  EKFFGYKMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGV--KAAIKYGRDSLT---------------------MSIVLDALKT

Query:  RNLEIKKERKDGELLMARGRSDKKN-------WKGKEKSSRMNSNRE---ARKCFLCH-KGHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTET
        RN        +G       R D +N       W+    +   N+N+      KC +C  +GH  K C             +    + S+  +    S  T
Subjt:  RNLEIKKERKDGELLMARGRSDKKN-------WKGKEKSSRMNSNRE---ARKCFLCH-KGHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTET

Query:  GYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKSDGG-KVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGEL-DKSG
         ++    L +      + W++DSG T+H+T +++ L   Q   GG  V+++D  T  +  TG   ++T    +  L NI YVP + +NLIS+  L + +G
Subjt:  GYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKSDGG-KVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGEL-DKSG

Query:  YTIKF
         +++F
Subjt:  YTIKF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-1222.68Show/hide
Query:  DFALWRKKIRAILVQHKVAKILDEGRLP-------------ANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYI
        ++ +W +++ A+   +++A  LD G  P              N   +  R  +++ YS IL  +S  V   V  ATT A++W+ L  +Y   S  +   +
Subjt:  DFALWRKKIRAILVQHKVAKILDEGRLP-------------ANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYI

Query:  K-------EKFFGYKMDQSKSLE---ENLNEFQKIVVD----------LNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRN
        +           G  MD  + +E   ENL +  K V+D          L  I E++ +    ++ LNS   +              +T ++V       N
Subjt:  K-------EKFFGYKMDQSKSLE---ENLNEFQKIVVD----------LNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRN

Query:  LEIKKERKDGELLMARGRSDKKN-WKGKEKSSRMNSNREAR----KCFLCH-KGHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEV
           + +   G+       +++ N W+     SR + NR+ +    +C +C  +GH  K CP     +++T++   T  +             T ++    
Subjt:  LEIKKERKDGELLMARGRSDKKN-WKGKEKSSRMNSNREAR----KCFLCH-KGHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEV

Query:  LMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKSDGG-KVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGEL
        L V+     + W++DSG T+H+T +++ L   Q   GG  V+++D  T  +  TG   + T    +  L  + YVP + +NLIS+  L
Subjt:  LMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKSDGG-KVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGEL

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein5.5e-0424.39Show/hide
Query:  GKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLI
        G  K  R+ S  E + C LC+K  H +++C      +    E  +   Y                E+   L     D  D WI+      +MTP   +  
Subjt:  GKEKSSRMNSNREARKCFLCHK-GHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLI

Query:  NFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTI
           ++    V   D     V+G G V I   +G  + + N+ +VP L RN++S G++    Y+I
Subjt:  NFQKSDGGKVLLSDNGTCEVKGTGLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTTCAACACGATTTGAAGTGTCTAAGTTTAATGGGAATGGAGATTTCGCCCTTTGGAGAAAAAAGATTAGAGCTATTTTGGTTCAACATAAAGTAGCAAAGAT
CTTAGATGAAGGGAGACTTCCAGCAAATATTACAGAAAGTGAAAAACGAGATATGAATGAAATGGCCTATTCAACTATTCTTCTGTATCTGTCAGATGAAGTTCTTAGGC
TAGTAGATGAGGCTACTACTACAGCGGAGTTGTGGAAAAAGCTAGAGAGCCTTTACTTGACAAAGTCCTTGTCAAATAAGATATATATAAAGGAGAAATTCTTTGGATAT
AAAATGGATCAAAGTAAAAGTTTAGAAGAGAACCTGAATGAATTTCAGAAGATTGTAGTTGATCTTAATAACATCGGTGAGAAGATGTCGGATGAGAATCAAGCAATTAT
TCTTCTAAATTCATTACCAGAAATATATCGAGGGGTTAAGGCGGCTATTAAATATGGTCGAGATTCATTGACCATGAGTATAGTGTTGGATGCCTTGAAGACTAGAAATC
TCGAAATTAAGAAAGAACGCAAGGATGGAGAGTTACTTATGGCTAGAGGAAGAAGTGACAAAAAGAACTGGAAAGGCAAAGAGAAGAGTTCCAGGATGAATTCAAATAGG
GAAGCTAGAAAGTGTTTCCTTTGCCATAAAGGACACTTTAAGAAAAATTGCCCCTTGAATAAGAGCAGAGAAGCATCAACTAGTGAAGCTAATGTTACTGATGGGTATAA
TTCAGCAAAGATTACTGATGGGTATGATTCAACAGAGACTGGGTATGAGTCTGCAGAGGTCTTGATGGTGTCTCATAGAGATATACAGGATGCTTGGATCATGGATTCAG
GGTGCACGTATCACATGACTCCTAATTGGGATTTTTTGATTAACTTTCAGAAAAGTGATGGGGGAAAAGTCTTATTGAGTGATAATGGTACCTGTGAAGTAAAGGGAACT
GGTTTAGTTCTAATTGCAACACATGACGGGATGATCAGAATGCTTACAAATATAAGGTATGTTCCAGAACTCAAACGTAATCTAATATCTCTAGGTGAATTAGATAAATC
AGGTTATACCATAAAATTTGAGATTGGAATTATGAAAGTTACCAAGGGTTCTTTGGTTAAATTGAGGGGAACATTGAGCAATGGTCTGTATGTGTTGGAGGGTACTGCAG
TTTCAGGTAGTGCCACTAAGGCATTAAAGCAACAGAAACAACAGATAGTTGATCATGTTGTGACAAATATTAGAATTGATGGTGTACAGTCATCAGGCAAAGGTTCAGAT
ATATCTAGTGATCAATCACCACTAGTTTCACAAATAGAGGCTACAGAGCAGTCTGAATTTGATGGTGTACAGTCTGAACAGGAGAGGACTTTGATTGATGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTTCAACACGATTTGAAGTGTCTAAGTTTAATGGGAATGGAGATTTCGCCCTTTGGAGAAAAAAGATTAGAGCTATTTTGGTTCAACATAAAGTAGCAAAGAT
CTTAGATGAAGGGAGACTTCCAGCAAATATTACAGAAAGTGAAAAACGAGATATGAATGAAATGGCCTATTCAACTATTCTTCTGTATCTGTCAGATGAAGTTCTTAGGC
TAGTAGATGAGGCTACTACTACAGCGGAGTTGTGGAAAAAGCTAGAGAGCCTTTACTTGACAAAGTCCTTGTCAAATAAGATATATATAAAGGAGAAATTCTTTGGATAT
AAAATGGATCAAAGTAAAAGTTTAGAAGAGAACCTGAATGAATTTCAGAAGATTGTAGTTGATCTTAATAACATCGGTGAGAAGATGTCGGATGAGAATCAAGCAATTAT
TCTTCTAAATTCATTACCAGAAATATATCGAGGGGTTAAGGCGGCTATTAAATATGGTCGAGATTCATTGACCATGAGTATAGTGTTGGATGCCTTGAAGACTAGAAATC
TCGAAATTAAGAAAGAACGCAAGGATGGAGAGTTACTTATGGCTAGAGGAAGAAGTGACAAAAAGAACTGGAAAGGCAAAGAGAAGAGTTCCAGGATGAATTCAAATAGG
GAAGCTAGAAAGTGTTTCCTTTGCCATAAAGGACACTTTAAGAAAAATTGCCCCTTGAATAAGAGCAGAGAAGCATCAACTAGTGAAGCTAATGTTACTGATGGGTATAA
TTCAGCAAAGATTACTGATGGGTATGATTCAACAGAGACTGGGTATGAGTCTGCAGAGGTCTTGATGGTGTCTCATAGAGATATACAGGATGCTTGGATCATGGATTCAG
GGTGCACGTATCACATGACTCCTAATTGGGATTTTTTGATTAACTTTCAGAAAAGTGATGGGGGAAAAGTCTTATTGAGTGATAATGGTACCTGTGAAGTAAAGGGAACT
GGTTTAGTTCTAATTGCAACACATGACGGGATGATCAGAATGCTTACAAATATAAGGTATGTTCCAGAACTCAAACGTAATCTAATATCTCTAGGTGAATTAGATAAATC
AGGTTATACCATAAAATTTGAGATTGGAATTATGAAAGTTACCAAGGGTTCTTTGGTTAAATTGAGGGGAACATTGAGCAATGGTCTGTATGTGTTGGAGGGTACTGCAG
TTTCAGGTAGTGCCACTAAGGCATTAAAGCAACAGAAACAACAGATAGTTGATCATGTTGTGACAAATATTAGAATTGATGGTGTACAGTCATCAGGCAAAGGTTCAGAT
ATATCTAGTGATCAATCACCACTAGTTTCACAAATAGAGGCTACAGAGCAGTCTGAATTTGATGGTGTACAGTCTGAACAGGAGAGGACTTTGATTGATGAGTGA
Protein sequenceShow/hide protein sequence
MMASTRFEVSKFNGNGDFALWRKKIRAILVQHKVAKILDEGRLPANITESEKRDMNEMAYSTILLYLSDEVLRLVDEATTTAELWKKLESLYLTKSLSNKIYIKEKFFGY
KMDQSKSLEENLNEFQKIVVDLNNIGEKMSDENQAIILLNSLPEIYRGVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSDKKNWKGKEKSSRMNSNR
EARKCFLCHKGHFKKNCPLNKSREASTSEANVTDGYNSAKITDGYDSTETGYESAEVLMVSHRDIQDAWIMDSGCTYHMTPNWDFLINFQKSDGGKVLLSDNGTCEVKGT
GLVLIATHDGMIRMLTNIRYVPELKRNLISLGELDKSGYTIKFEIGIMKVTKGSLVKLRGTLSNGLYVLEGTAVSGSATKALKQQKQQIVDHVVTNIRIDGVQSSGKGSD
ISSDQSPLVSQIEATEQSEFDGVQSEQERTLIDE