; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027403 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027403
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr01:5525357..5529950
RNA-Seq ExpressionPI0027403
SyntenyPI0027403
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]3.0e-20796.13Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGF+DGTNPC P+T+ S+ STVP Q+NPLYEDWIAKDQALMTVIN
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN

Query:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMR
        ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSMR
Subjt:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMR

Query:  TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQICS
        TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHG SQEQ+ VHDNH TCQICS
Subjt:  TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQICS

Query:  RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
        RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVG+GQTRPISHSG
Subjt:  RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.3e-20595.63Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.5e-21195.5Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSGQVFGK FVPKA
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.8e-21395.99Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGF+DGTNPC P+T+ S+ STVP Q+NPLYEDWIAKDQALMTVIN
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN

Query:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMR
        ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSMR
Subjt:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMR

Query:  TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQICS
        TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHG SQEQ+ VHDNH TCQICS
Subjt:  TRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQICS

Query:  RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA
        RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVG+GQTRPISHSGQVFGK FVPKA
Subjt:  RRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]1.3e-20595.63Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X26.1e-20695.63Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X37.5e-21295.5Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSGQVFGK FVPKA
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X16.1e-20695.63Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

A0A5D3CLI6 T4.52.3e-20595.62Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GFIDGTNPCPPRT N+SS STVP QSNP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRT-NTSSASTVPLQSNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVST INEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHGSS EQ+SVHDNH TCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+G+GQTRP+SHS
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHS

A0A6J1D9L6 uncharacterized protein LOC1110188922.1e-12160Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQS--------NPLYEDWIAKDQALMT
        T  S++ +KD  SPIFLLSNICNL+S+RLDST+F+LWKFQLTAILKAHKLFGFIDG+   P +   SS+ T    +        NP +EDWIAKDQALMT
Subjt:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQS--------NPLYEDWIAKDQALMT

Query:  VINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRT
        +INATLS EALAYVV S +SKQVW+VL K YSS SR+NVVNLKSDLQ+I KK +ESIDAY+KRIKEIKDK ANVS  IN+E LLIYALNGL  EYNT  T
Subjt:  VINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRT

Query:  SMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRF----SFDAQTRGHGSSQEQQSVH-DN
        SMRTR+Q V+FEELHV +++EESA+ KQ K +D   QP  L +SS    +    F+ N     G GKN G G+     +F  Q RG  S     S   DN
Subjt:  SMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRF----SFDAQTRGHGSSQEQQSVH-DN

Query:  HTTCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSS---LTDSGCNTHITSD---MNYVSLAPEYNGEEQVGVGSGQTRPISH
         + CQIC + GHTALDC+NRMN++FQGRHPP QLAAMVA QNN++L++ NSS    L DS CNTH+T+D   ++  S+A +YNGEE + VGSGQ+ PI+H
Subjt:  HTTCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSS---LTDSGCNTHITSD---MNYVSLAPEYNGEEQVGVGSGQTRPISH

Query:  --SGQVFGKNFVPKA
           GQVFG N+VP+A
Subjt:  --SGQVFGKNFVPKA

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-3028.18Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS
        +L STN+++W  Q+ A+   ++L GF+DG+   PP T  + A+    + NP Y  W  +D+ + + +   +S      V  +T++ Q+W+ L K+Y++ S
Subjt:  RLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS

Query:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY
          +V  L++ L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +  P T  E+H  L   ES +   S      
Subjt:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY

Query:  NQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQS---VHDNHT-------TCQICSRRGHTALDCFNRMNY--NFQGR
           TV+  ++ ++     T  NN   GN + +        +D +   + S   QQS    H N+         CQIC  +GH+A  C    ++  +   +
Subjt:  NQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQS---VHDNHT-------TCQICSRRGHTALDCFNRMNY--NFQGR

Query:  HPPQQLAAMVASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
         PP          N A  S  +S++ L DSG   HITSD N +SL   Y G + V V  G T PISH+G
Subjt:  HPPQQLAAMVASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-2426.8Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS
        +L STN+++W  Q+ A+   ++L GF+DG+ P PP T  + A  VP + NP Y  W  +D+ + + I   +S      V  +T++ Q+W+ L K+Y++ S
Subjt:  RLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS

Query:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY
          +V  L+                +I R     D+LA +   ++ ++ +   L  LP++Y      +  +  P +  E+H  L   ES L   +  +   
Subjt:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY

Query:  NQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDA-QTRGHGSSQEQQSVHDNHTTCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA
             ++  + ++++   T  N      G  +NY +     ++ Q    GS  + +        CQICS +GH+A  C     + FQ     QQ  +   
Subjt:  NQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDA-QTRGHGSSQEQQSVHDNHTTCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA

Query:  ----SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG
              N A  S  N+++ L DSG   HITSD N +S    Y G + V +  G T PI+H+G
Subjt:  ----SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSG

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-0622.41Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN
        T+ S S   D  SP +L  +I      ++  +  D  N+V WK +  + L+  K FGFIDGT P P               +PLY+ W   +  +M  + 
Subjt:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVIN

Query:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTL
         +++ + L  V+ + ++ ++W+ L +++       +  L+  L T+ ++  +S++ Y  ++ ++  +L+  + +
Subjt:  ATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTL

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.6e-1125.29Show/hide
Query:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTS
        I+ +SNI + I + LD   +N+  W+        +  + G IDGT              +P  +N +  +W  +D  +   +  TL+P+      V S++
Subjt:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTS

Query:  SKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLR
        S+ +W  +   + +   +  + L S+L+T     D  +  Y +++K++ D L NV   + + +L++Y LNGL  +++     ++ R    +F++   +L+
Subjt:  SKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLR

Query:  AEESALAKQSKCDDSYNQPTVLLSSSQSLLSC--APTFNNNFVRGNGHGKNY-GHGR
         EE  L +  K + ++    V  SSS ++L+C  AP    NF R  G+   Y G GR
Subjt:  AEESALAKQSKCDDSYNQPTVLLSSSQSLLSC--APTFNNNFVRGNGHGKNY-GHGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCGGCAGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTAC
AAATTTTGTCCTTTGGAAGTTCCAACTGACAGCGATTTTGAAAGCTCATAAGCTTTTTGGCTTTATTGATGGTACTAATCCATGTCCTCCTCGGACTAATACCTCTTCTG
CCTCGACCGTTCCACTTCAATCAAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATAT
GTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTA
CAAGAAGCCTGATGAATCGATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTGATCAATGAAGAGGATCTTCTTATCTATG
CTTTAAATGGCCTTCCAAATGAGTATAACACTTTTCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCG
GCTCTTGCGAAACAATCTAAGTGTGATGATTCGTATAATCAACCAACTGTTTTACTCTCTTCTTCCCAATCTCTCCTGTCTTGTGCTCCTACTTTCAATAACAACTTTGT
TCGAGGCAATGGACATGGTAAAAATTATGGACATGGGCGTTTTTCTTTTGATGCTCAAACTCGTGGTCATGGTTCTTCCCAAGAACAACAGTCTGTTCATGATAATCATA
CAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAGCTTGCTGCCATGGTT
GCATCACAGAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGACTCGGGTTGCAACACTCATATTACTTCAGATATGAATTATGTTTCTCTTGCACCGGA
ATATAATGGTGAAGAACAAGTTGGTGTTGGTAGTGGACAGACTCGGCCTATTTCTCACTCAGGGCAAGTTTTCGGGAAAAATTTTGTTCCAAAGGCCTAG
mRNA sequenceShow/hide mRNA sequence
TCAGAGCCTAACATTTGTTTCTTTTTTTCTTCACTCCTCGTTTCTTTGTTCGAAGTGTTTGAACTTCTTTCTTTGTGAAACTGAAGTTTCAAGTCTCTCTTGATCGGAGA
GCTTCTTTTGCCAAGTGTATCCGATGAGTTCCTCAACTACCTTGCCTTCTTCTTCGGCAGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTG
ATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAGTTCCAACTGACAGCGATTTTGAAAGCTCATAAGCTTTTTGGCTTTATTGATGGTACTAATCCATGTCC
TCCTCGGACTAATACCTCTTCTGCCTCGACCGTTCCACTTCAATCAAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACAC
TTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTG
AAGTCCGATTTGCAAACTATTTACAAGAAGCCTGATGAATCGATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTGATCAA
TGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGAGTATAACACTTTTCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATG
TTCTTCTAAGAGCTGAGGAATCGGCTCTTGCGAAACAATCTAAGTGTGATGATTCGTATAATCAACCAACTGTTTTACTCTCTTCTTCCCAATCTCTCCTGTCTTGTGCT
CCTACTTTCAATAACAACTTTGTTCGAGGCAATGGACATGGTAAAAATTATGGACATGGGCGTTTTTCTTTTGATGCTCAAACTCGTGGTCATGGTTCTTCCCAAGAACA
ACAGTCTGTTCATGATAATCATACAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTC
CACAACAGCTTGCTGCCATGGTTGCATCACAGAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGACTCGGGTTGCAACACTCATATTACTTCAGATATG
AATTATGTTTCTCTTGCACCGGAATATAATGGTGAAGAACAAGTTGGTGTTGGTAGTGGACAGACTCGGCCTATTTCTCACTCAGGGCAAGTTTTCGGGAAAAATTTTGT
TCCAAAGGCCTAGCATTGATGATCTGATCACTTCTAAGGCTGTGGCTGCTTCTAGTTTAGCTTCCACCAGTCTGTTGTTCGACTGTTGCTTATGTTGCTGACAAGTCCTC
TTTTGCTCGTATTGCTGTTCTTCTTAAGTTTGTTGTTTTCTTGTTGCATTTGCCATATACTTCAAATAAAGGAATTGTGATGCCACCCATTGGATAATCATATCCAAATT
CTTCTTCAGCTTGACTCAACAAATCTTGAAACAAAGGTTGGTTCAAGCAAGATAGCGGGATGACAAAACGCTTCTTCTGTTCTTCTCCCACATAGACTGTAAAGCATCCT
TTCGGAACATCAAGAGACTTTGGAGTGGCTCTATTTCCTGACGATGTGGATCGTCGAAGACTTGGCTTAGAGTGAACAATACTAGGCAAACGAAACCCCATGGTATAGTT
TTTTTCTTTCCAAGGAATTAATGAAACTTTTAACTTGAAGGATTGTCAAAGGAGATTTCTTGTGATGTGGGAATGTG
Protein sequenceShow/hide protein sequence
MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFIDGTNPCPPRTNTSSASTVPLQSNPLYEDWIAKDQALMTVINATLSPEALAY
VVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTLINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEES
ALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGSSQEQQSVHDNHTTCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMV
ASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGSGQTRPISHSGQVFGKNFVPKA