; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G10935 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G10935
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationctg1681:2302744..2305242
RNA-Seq ExpressionCucsat.G10935
SyntenyCucsat.G10935
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]2.39e-27798.23Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA

Query:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT
        TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT

Query:  RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
        RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSL+SCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Subjt:  RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR

Query:  RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS
        RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG    E +S
Subjt:  RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]7.33e-26293.45Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG   FE +S
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]6.66e-26194.6Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]3.07e-27799.74Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA

Query:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT
        TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT

Query:  RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
        RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSL+SCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR
Subjt:  RSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSR

Query:  RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG
        RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG
Subjt:  RGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]1.11e-26193.45Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG   FE +S
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.55e-26293.45Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG   FE +S
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.23e-26194.6Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X15.36e-26293.45Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSG   FE +S
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS

A0A5D3CLI6 T4.54.31e-25994.59Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQ--TSPSTTSTVPPQTNPLYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSL+SCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHS
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHS

A0A6J1D9L6 uncharacterized protein LOC1110188921.28e-14657.77Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCP----------QTSPSTTSTVPPQTNPLYEDWIAKDQALM
        T  S++ +KD  SPIFLLSNICNL+S+RLDST+F+LWKFQLTAILKAHKLFGF+DG+   P          ++ P+TT+++P   NP +EDWIAKDQALM
Subjt:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCP----------QTSPSTTSTVPPQTNPLYEDWIAKDQALM

Query:  TVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFR
        T+INATLS EALAYVV S +SKQVW+VL K YSS SR+NVVNLKSDLQ+I KK +ESIDAY+KRIKEIKDK ANVS  IN+E LLIYALNGL  EYNT  
Subjt:  TVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFR

Query:  TSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVH-----D
        TSMRTR+Q V+FEELHV +++EESA+ KQ K +D   QP  L +SS    +    F+ N     G GKN G G+ +F       G  +           D
Subjt:  TSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVH-----D

Query:  NHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLT---DSGCNTHITSDMNYVSLAP---EYNGEEQVGVGNGQTRPIS
        N + CQIC + GHTALDC+NRMN++FQGRHPP QLAAMVA QNN++L++ NSS  T   DS CNTH+T+D++ +S+A    +YNGEE + VG+GQ+ PI+
Subjt:  NHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLT---DSGCNTHITSDMNYVSLAP---EYNGEEQVGVGNGQTRPIS

Query:  HSGSDTFEPSSY
        H G      S+Y
Subjt:  HSGSDTFEPSSY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-3228.17Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS
        +L STN+++W  Q+ A+   ++L GF+DG+   P   P+T  T   P+ NP Y  W  +D+ + + +   +S      V  +T++ Q+W+ L K+Y++ S
Subjt:  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS

Query:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY
          +V  L++ L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +  P T  E+H  L   ES +   S      
Subjt:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY

Query:  NQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGK-----NYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNY--NFQGRHPPQQ
           TV+  ++ ++     T  NN   GN + +     N  + +    + T  H  + + KP       CQIC  +GH+A  C    ++  +   + PP  
Subjt:  NQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGK-----NYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNY--NFQGRHPPQQ

Query:  LAAMVASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSSY-FSLSNLVFVLNIISSFLFVH
                N A  S  +S++ L DSG   HITSD N +SL   Y G + V V +G T PISH+GS +    S   +L N+++V NI  + + V+
Subjt:  LAAMVASQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSSY-FSLSNLVFVLNIISSFLFVH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-2526.28Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS
        +L STN+++W  Q+ A+   ++L GF+DG+ P P   P+T  T   P+ NP Y  W  +D+ + + I   +S      V  +T++ Q+W+ L K+Y++ S
Subjt:  RLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTST-VPPQTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGS

Query:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY
          +V  L+                +I R     D+LA +   ++ ++ +   L  LP++Y      +  +  P +  E+H  L   ES L   +  +   
Subjt:  RSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKCDDSY

Query:  NQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDA-QTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA
             ++  + ++++   T  N      G  +NY +     ++ Q    G   + +        CQICS +GH+A  C     + FQ     QQ  +   
Subjt:  NQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDA-QTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA

Query:  ----SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS-YFSLSNLVFVLNIISSFLFVH
              N A  S  N+++ L DSG   HITSD N +S    Y G + V + +G T PI+H+GS +   SS    L+ +++V NI  + + V+
Subjt:  ----SQNNAFLSIVNSSS-LTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSS-YFSLSNLVFVLNIISSFLFVH

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.8e-0622.7Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA
        T+ S S   D  SP +L  +I      ++  +  D  N+V WK +  + L+  K FGF+DGT P            P   +PLY+ W   +  +M  +  
Subjt:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINA

Query:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI
        +++ + L  V+ + ++ ++W+ L +++       +  L+  L T+ ++  +S++ Y  ++ ++
Subjt:  TLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.0e-1124.1Show/hide
Query:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTSS
        I+ +SNI + I + LD   +N+  W+        +  + G +DGT             +P   N +  +W  +D  +   +  TL+P+      V S++S
Subjt:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEAL-AYVVGSTSS

Query:  KQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRA
        + +W  +   + +   +  + L S+L+T     D  +  Y +++K++ D L NV   + + +L++Y LNGL  +++     ++ R    +F++   +L+ 
Subjt:  KQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRA

Query:  EESALAKQSKCDDSYNQPTVLLSSSQSLMSC--APTFNNNFVRGNGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDN
        EE  L +  K + ++    V  SSS ++++C  AP    NF R  G+   Y G GR +   + RG   S    P  ++
Subjt:  EESALAKQSKCDDSYNQPTVLLSSSQSLMSC--APTFNNNFVRGNGHGKNY-GHGRFSFDAQTRGHGLSQEQKPVHDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTAC
AAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCT
CGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTT
GTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAA
GAAGCCTGATGAATCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTT
TAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCT
CTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCATGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCG
AGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAA
CTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCA
TCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATA
TAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGTTCTGATACTTTTGAACCTTCTTCCTATTTCTCTCTATCTAATCTTGTTT
TTGTTCTTAATATCATTTCTAGTTTCCTTTTTGTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCAGCTGAGAAAGACTCACTTTCACCAATTTTTCTACTGTCCAACATTTGTAACCTGATTTCAATGAGGCTTGACTCTAC
AAATTTTGTCCTTTGGAAGTTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTTTGGCTTTGTTGATGGTACTAATCCATGTCCTCAGACTAGTCCGTCTACTACCT
CGACCGTTCCGCCTCAAACGAATCCTTTATATGAAGATTGGATTGCTAAGGATCAGGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTT
GTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTGAAGTCCGATTTGCAAACTATTTACAA
GAAGCCTGATGAATCTATTGATGCCTATATTAAACGGATTAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATCAATGAAGAGGATCTTCTTATCTATGCTT
TAAATGGCCTTCCAAATGAGTACAACACTTTCCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCGGCT
CTTGCAAAACAATCTAAGTGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCATGTCATGTGCTCCTACTTTCAATAACAACTTTGTTCG
AGGCAACGGACATGGTAAAAATTATGGACATGGACGTTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTTGTCTCAAGAACAAAAGCCCGTTCATGATAATCATGCAA
CTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCA
TCGCAAAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACCGATTCGGGTTGCAACACTCATATTACTTCAGACATGAATTATGTTTCTCTTGCACCTGAATA
TAATGGTGAAGAACAAGTTGGTGTTGGTAATGGACAGACTCGGCCTATTTCTCACTCAGGTTCTGATACTTTTGAACCTTCTTCCTATTTCTCTCTATCTAATCTTGTTT
TTGTTCTTAATATCATTTCTAGTTTCCTTTTTGTTCATTAA
Protein sequenceShow/hide protein sequence
MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLFGFVDGTNPCPQTSPSTTSTVPPQTNPLYEDWIAKDQALMTVINATLSPEALAYV
VGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESA
LAKQSKCDDSYNQPTVLLSSSQSLMSCAPTFNNNFVRGNGHGKNYGHGRFSFDAQTRGHGLSQEQKPVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVA
SQNNAFLSIVNSSSLTDSGCNTHITSDMNYVSLAPEYNGEEQVGVGNGQTRPISHSGSDTFEPSSYFSLSNLVFVLNIISSFLFVH