; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021838 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021838
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr01:15542532..15544120
RNA-Seq ExpressionIVF0021838
SyntenyIVF0021838
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]7.14e-277100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.35e-278100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.25e-28699.75Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGK FVPKA
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]1.93e-26994.75Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKL+GF+DGTNPCP    + S+TSTVPPQ+NP YEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSK DDSYNQPTVLLSSSQSLLSCAPTF+NNFVRGNGHGK+YGHGRFSFDAQTRGHG S EQK VHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNT ITSDMNYVSLAPEYNGEEQVG+GNGQTRP+SHSGQVFGK FVPKA
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]2.05e-278100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.1e-218100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.8e-22499.75Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGK FVPKA
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.1e-218100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

A0A5D3CLI6 T4.51.2e-217100Show/hide
Query:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
Subjt:  MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
        NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM

Query:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
        RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC
Subjt:  RTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQIC

Query:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS
        SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS
Subjt:  SRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHS

A0A6J1D9L6 uncharacterized protein LOC1110188925.2e-12058.8Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPR-------TNNSSSTSTVPPQSNPSYEDWIAKDQALMT
        T  S++ +KD  SPIFLLSNICNL+S+RLDST+F+LWKFQLTAILKAHKL+GFIDG+   P +       T +  +T+T  P  NP +EDWIAKDQALMT
Subjt:  TLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPR-------TNNSSSTSTVPPQSNPSYEDWIAKDQALMT

Query:  VINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRT
        +INATLS EALAYVV S +SKQVW+VL K YSS SR+NVVNLKSDLQ+I KK +ESIDAY+KRIKEIKDK ANVS  IN+E LLIYALNGL  EYNT  T
Subjt:  VINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRT

Query:  SMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRF----SFDAQTRGHGSSPEQKSVH-DN
        SMRTR+Q V+FEELHV +++EESA+ KQ K +D   QP  L +SS    +    F  N     G GK+ G G+     +F  Q RG  S     S   DN
Subjt:  SMRTRSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRF----SFDAQTRGHGSSPEQKSVH-DN

Query:  HATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSS---LTDSGCNTRITSD---MNYVSLAPEYNGEEQVGIGNGQTRPMSH
         + CQIC + GHTALDC+NRMN++FQGRHPP QLAAMVA QNN++L++ NSS    L DS CNT +T+D   ++  S+A +YNGEE + +G+GQ+ P++H
Subjt:  HATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMVASQNNAFLSIVNSSS---LTDSGCNTRITSD---MNYVSLAPEYNGEEQVGIGNGQTRPMSH

Query:  --SGQVFGKNFVPKA
           GQVFG N+VP+A
Subjt:  --SGQVFGKNFVPKA

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.4e-3127.3Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSG
        +L STN+++W  Q+ A+   ++L GF+DG+   PP T  + +     P+ NP Y  W  +D+ + + +   +S      V  +T++ Q+W+ L K+Y++ 
Subjt:  RLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSG

Query:  SRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDS
        S  +V  L++ L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  +  P T  E+H  L   ES +   S     
Subjt:  SRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDS

Query:  YNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKS---VHDNH-------ATCQICSRRGHTALDCFNRMNY--NFQG
            TV+  ++ ++     T  NN   GN + +        +D +   + S P Q+S    H N+         CQIC  +GH+A  C    ++  +   
Subjt:  YNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKS---VHDNH-------ATCQICSRRGHTALDCFNRMNY--NFQG

Query:  RHPPQQLAAMVASQNNAFLSIVNSSS-LTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
        + PP          N A  S  +S++ L DSG    ITSD N +SL   Y G + V + +G T P+SH+G
Subjt:  RHPPQQLAAMVASQNNAFLSIVNSSS-LTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-2425.62Show/hide
Query:  RLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSG
        +L STN+++W  Q+ A+   ++L GF+DG+ P PP T  + +     P+ NP Y  W  +D+ + + I   +S      V  +T++ Q+W+ L K+Y++ 
Subjt:  RLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSG

Query:  SRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDS
        S  +V  L+                +I R     D+LA +   ++ ++ +   L  LP++Y      +  +  P +  E+H  L   ES L   +  +  
Subjt:  SRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEESALAKQSKGDDS

Query:  YNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDA-QTRGHGSSPEQKSVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMV
              ++  + ++++   T  N      G  ++Y +     ++ Q    GS  + +        CQICS +GH+A  C     + FQ     QQ  +  
Subjt:  YNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDA-QTRGHGSSPEQKSVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAMV

Query:  A----SQNNAFLSIVNSSS-LTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG
               N A  S  N+++ L DSG    ITSD N +S    Y G + V I +G T P++H+G
Subjt:  A----SQNNAFLSIVNSSS-LTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSG

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).4.2e-0521.82Show/hide
Query:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI
        T+ S S   D  SP +L  +I      ++  +  D  N+V WK +  + L+  K +GFIDGT P              P   +P Y+ W   +  +M  +
Subjt:  TLPSSSAEKDSLSPIFLLSNI-----CNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVI

Query:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI
          +++ + L  V+ + ++ ++W+ L +++       +  L+  L T+ ++  +S++ Y  ++ ++
Subjt:  NATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEI

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.5e-1025.19Show/hide
Query:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEAL-AYVVGST
        I+ +SNI + I + LD   +N+  W+        +  + G IDGT               +P  +N    +W  +D  +   +  TL+P+      V S+
Subjt:  IFLLSNICNLISMRLD--STNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEAL-AYVVGST

Query:  SSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLL
        +S+ +W  +   + +   +  + L S+L+T     D  +  Y +++K++ D L NV   + + +L++Y LNGL  +++     ++ R    +F++   +L
Subjt:  SSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLL

Query:  RAEESALAKQSKGDDSYNQPTVLLSSSQSLLSC--APTFDNNFVRGNGHGKHY-GHGR
        + EE  L +  K + ++    V  SSS ++L+C  AP    NF R  G+   Y G GR
Subjt:  RAEESALAKQSKGDDSYNQPTVLLSSSQSLLSC--APTFDNNFVRGNGHGKHY-GHGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCCTCAACTACCTTGCCTTCTTCTTCGGCTGAGAAAGATTCACTTTCACCAATTTTTCTACTGTCCAACATCTGTAACCTGATTTCAATGAGGCTTGACTCTAC
AAATTTTGTCCTTTGGAAATTCCAATTGACAGCGATTTTGAAAGCTCATAAACTTTATGGCTTTATTGATGGTACTAATCCATGTCCTCCTCGGACTAATAATTCCTCTT
CTACCTCAACCGTTCCGCCTCAATCGAATCCTTCATATGAAGATTGGATTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCA
TATGTTGTTGGAAGCACTTCTTCCAAACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTAAAGTCGGATTTGCAAACTAT
TTACAAGAAGCCTGATGAATCTATTGATGCCTACATTAAACGGATAAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATTAATGAAGAGGATCTTCTTATCT
ATGCTTTAAATGGCCTTCCAAATGAGTATAACACTTTTCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAA
TCAGCTCTTGCAAAACAATCCAAGGGTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCCTGTCTTGTGCTCCTACTTTCGATAACAACTT
TGTTCGAGGCAATGGACATGGTAAACATTATGGACATGGACGCTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTCTTCCCCAGAACAAAAGTCTGTTCATGATAATC
ATGCAACTTGTCAGATTTGTTCACGTCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATG
GTTGCATCGCAGAATAATGCATTTCTATCTATTGTGAATTCGTCTTCTTTGACTGATTCAGGTTGCAACACTCGTATTACTTCAGATATGAATTATGTTTCTCTTGCACC
TGAATATAATGGTGAAGAACAAGTTGGCATTGGTAATGGACAGACTCGGCCTATGTCCCACTCAGGACAAGTCTTCGGGAAAAATTTTGTTCCAAAGGCCTAG
mRNA sequenceShow/hide mRNA sequence
CTTTTCTTTGTTCGAAGTGTTTGAACTTCTTTCTTTGTGAAACTGAAGTCTCAAGTCTCTCTTGATCTTTTGCCAAGTGTACCCGATGAGTTCCTCAACTACCTTGCCTT
CTTCTTCGGCTGAGAAAGATTCACTTTCACCAATTTTTCTACTGTCCAACATCTGTAACCTGATTTCAATGAGGCTTGACTCTACAAATTTTGTCCTTTGGAAATTCCAA
TTGACAGCGATTTTGAAAGCTCATAAACTTTATGGCTTTATTGATGGTACTAATCCATGTCCTCCTCGGACTAATAATTCCTCTTCTACCTCAACCGTTCCGCCTCAATC
GAATCCTTCATATGAAGATTGGATTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTTGTTGGAAGCACTTCTTCCA
AACAGGTTTGGGATGTTCTTGCAAAGCTTTATTCTTCTGGTTCCCGGTCTAATGTGGTGAATTTAAAGTCGGATTTGCAAACTATTTACAAGAAGCCTGATGAATCTATT
GATGCCTACATTAAACGGATAAAGGAGATCAAGGATAAACTTGCTAATGTTTCTACTTTTATTAATGAAGAGGATCTTCTTATCTATGCTTTAAATGGCCTTCCAAATGA
GTATAACACTTTTCGAACGTCAATGCGTACACGTTCTCAACCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAGAGCTGAGGAATCAGCTCTTGCAAAACAATCCAAGG
GTGATGATTCGTATAATCAACCGACTGTTTTACTCTCTTCTTCTCAATCTCTCCTGTCTTGTGCTCCTACTTTCGATAACAACTTTGTTCGAGGCAATGGACATGGTAAA
CATTATGGACATGGACGCTTTTCTTTCGATGCTCAAACTCGTGGTCATGGTTCTTCCCCAGAACAAAAGTCTGTTCATGATAATCATGCAACTTGTCAGATTTGTTCACG
TCGTGGCCACACTGCACTCGATTGTTTCAATCGCATGAACTATAATTTTCAAGGACGTCATCCTCCACAACAACTTGCTGCAATGGTTGCATCGCAGAATAATGCATTTC
TATCTATTGTGAATTCGTCTTCTTTGACTGATTCAGGTTGCAACACTCGTATTACTTCAGATATGAATTATGTTTCTCTTGCACCTGAATATAATGGTGAAGAACAAGTT
GGCATTGGTAATGGACAGACTCGGCCTATGTCCCACTCAGGACAAGTCTTCGGGAAAAATTTTGTTCCAAAGGCCTAGCATTGGTGATCTGATCACTTCTAAGGCTGTGG
CTGCTTCTAGTTTATCTTCCACCAGTCTGTTGTTCTACTGTTGCTTATGTTGCTGACAAGTCCTCTTTTGCTCATATTGCTGTTCTTCTTAAGTTTGTTGTTTTCTTGTT
GCATTTGCC
Protein sequenceShow/hide protein sequence
MSSSTTLPSSSAEKDSLSPIFLLSNICNLISMRLDSTNFVLWKFQLTAILKAHKLYGFIDGTNPCPPRTNNSSSTSTVPPQSNPSYEDWIAKDQALMTVINATLSPEALA
YVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRTRSQPVTFEELHVLLRAEE
SALAKQSKGDDSYNQPTVLLSSSQSLLSCAPTFDNNFVRGNGHGKHYGHGRFSFDAQTRGHGSSPEQKSVHDNHATCQICSRRGHTALDCFNRMNYNFQGRHPPQQLAAM
VASQNNAFLSIVNSSSLTDSGCNTRITSDMNYVSLAPEYNGEEQVGIGNGQTRPMSHSGQVFGKNFVPKA