; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05685 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05685
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCarg_Chr16:3845122..3846240
RNA-Seq ExpressionCarg05685
SyntenyCarg05685
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]9.5e-14774.48Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKLFGF+DGT PCP ++PS            Y+DW AKDQALMTVINA
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA

Query:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT
        TLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT

Query:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICL
        RS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TFNNNFVRG G G+ +GHG  SFD Q RG G SQ+Q+  V +NH++CQIC 
Subjt:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICL

Query:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVGVG+GQ+ PISHSG
Subjt:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.5e-197100Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
        MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG

Query:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
        STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
Subjt:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV

Query:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
        LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
Subjt:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR

Query:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
Subjt:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-204100Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
        MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG

Query:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
        STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
Subjt:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV

Query:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
        LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
Subjt:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR

Query:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPPP
        MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPPP
Subjt:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPPP

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.2e-14673.42Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G+ +GHG  SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG   FE
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]9.5e-14774.48Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKLFGF+DGT PCP ++PS            Y+DW AKDQALMTVINA
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA

Query:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT
        TLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT

Query:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICL
        RS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TFNNNFVRG G G+ +GHG  SFD Q RG G SQ+Q+  V +NH++CQIC 
Subjt:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICL

Query:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVGVG+GQ+ PISHSG
Subjt:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X26.0e-14773.42Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G+ +GHG  SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG   FE
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.3e-14673.85Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G+ +GHG  SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X16.0e-14773.42Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G+ +GHG  SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG   FE
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFE

A0A5D3CLI6 T4.58.7e-14673.78Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G+ +GHG  SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHS
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHS
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHS

A0A6J1D9L6 uncharacterized protein LOC1110188926.3e-11257.43Show/hide
Query:  SSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDG---------------------TQPCPISNPSYDDWFAKDQALMT
        +SSS++T+ +L SPI LLSNICNL+SI+LDST+++LWKFQLT +LKAHKLFGFIDG                     T   P+ NP ++DW AKDQALMT
Subjt:  SSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDG---------------------TQPCPISNPSYDDWFAKDQALMT

Query:  VINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRT
        +INATLS EALAYVV S TSKQVW VL K YSS+SR+NVVNLKSDLQ+I KK++ESIDAY+KRIKEIKDK ANVS  +NDE LLIYALNGL TEYNT  T
Subjt:  VINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRT

Query:  SMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHG----CSSFDTQGRGGGSSQQQQLVVANN
        SMRTR+  V+FEELHV +K+EESA+ KQ KR+DL  QP AL ASS    +  S F+ N    RGRG+ +G G      +F  QGRG  S        A+N
Subjt:  SMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHG----CSSFDTQGRGGGSSQQQQLVVANN

Query:  HSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------------SDMNYLSLASGYHGEEQVGVGSGQSLPISH
         S CQIC + GH ALDC+NRMN++F GRHPP  LAAMVA QNN++L++ N                    S+++  S+AS Y+GEE + VGSGQS PI+H
Subjt:  HSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------------SDMNYLSLASGYHGEEQVGVGSGQSLPISH

Query:  SGCG
         GCG
Subjt:  SGCG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-0922.15Show/hide
Query:  WKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESI
        W+ ++  LL    L   +D     P      +DW   D+   + I   LS + +  ++   T++ +W  L  LY S + +N + LK  L  +      + 
Subjt:  WKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESI

Query:  DAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNN
         +++     +  +LAN+   + +ED  I  LN LP+ Y+   T++    T +  +++   L   E    K   +              Q+L++       
Subjt:  DAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNN

Query:  NFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFN--RMNYNFHGRHPPHHLAAMVASQNNAFLSI
              GRGR +    +++   G  G S  + +  V N    C  C + GH   DC N  +      G+    + AAMV + +N  L I
Subjt:  NFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFN--RMNYNFHGRHPPHHLAAMVASQNNAFLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.9e-2326.93Show/hide
Query:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFID----------GTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAY
        ++ + E  L +  +L  N+ N+   KL STNY++W  Q+  L   ++L GF+D          GT   P  NP Y  W  +D+ + + +   +S      
Subjt:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFID----------GTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAY

Query:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE
        V  +TT+ Q+W  L K+Y++ S  +V  L++ L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  + TP T  E
Subjt:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE

Query:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVV---ANNHSS------CQICL
        +H  L   ES +   S    +      + A++ S  +  +T NNN          +G+  + +D +     S   QQ       NN+ S      CQIC 
Subjt:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVV---ANNHSS------CQICL

Query:  RRGHIALDCFNRMNY--NFHGRHPPHHLA-----AMVA-----SQNNAFLSI-----VNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPP
         +GH A  C    ++  + + + PP         A +A     S NN  L       + SD N LSL   Y G + V V  G ++PISH+G  +  T   
Subjt:  RRGHIALDCFNRMNY--NFHGRHPPHHLA-----AMVA-----SQNNAFLSI-----VNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPP

Query:  P
        P
Subjt:  P

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-1724.04Show/hide
Query:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS----------NPSYDDWFAKDQALMTVINATLSPEALAY
        ++ + E  L++  +L  N+ N+   KL STNY++W  Q+  L   ++L GF+DG+ P P +          NP Y  W  +D+ + + I   +S      
Subjt:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS----------NPSYDDWFAKDQALMTVINATLSPEALAY

Query:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE
        V  +TT+ Q+W  L K+Y++ S  +V  L+                +I R     D+LA +   ++ ++ +   L  LP +Y      +  + TP +  E
Subjt:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE

Query:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHG---CSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIA
        +H  L   ES L        L      ++  + +++++ +T  N     RG  R + +     +S+     G  S  +Q          CQIC  +GH A
Subjt:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHG---CSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIA

Query:  LDCFNRMNYN-----------FHGRHPPHHLAAMVASQNNAFL------SIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFET
          C     +            F    P  +LA       N +L        + SD N LS    Y G + V +  G ++PI+H+G  +  T
Subjt:  LDCFNRMNYN-----------FHGRHPPHHLAAMVASQNNAFL------SIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFET

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.0e-0523.27Show/hide
Query:  SNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS-NPSYDDWFAKDQALMTVINATLSPEALAYVVGST
        S S +S       L P +   +  ++  +  D  NYV WK +  + L+  K FGFIDGT P P   +P Y  W   +  +M  +  +++ + L  V+ + 
Subjt:  SNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS-NPSYDDWFAKDQALMTVINATLSPEALAYVVGST

Query:  TSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTV
        T+ ++W  L +++       +  L+  L T+ ++  +S++ Y  ++ ++  +L+  + +
Subjt:  TSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTV

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.3e-1226.59Show/hide
Query:  LSNICNLISIKLD--STNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEAL-AYVVGSTTSKQVWNVLAKLYSSSS
        +SNI + I + LD   +NY  W+    T   +  + G IDGT     +N +  +W  +D  +   +  TL+P+      V S+TS+ +W  +   + ++ 
Subjt:  LSNICNLISIKLD--STNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEAL-AYVVGSTTSKQVWNVLAKLYSSSS

Query:  RSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRD---
         +  + L S+L+T     D  +  Y +++K++ D L NV   V D +L++Y LNGL  +++     ++ R    +F++   +L+ EE  L +  K +   
Subjt:  RSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRD---

Query:  -DLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGG
         D     T L  S    ++       N +  RGRGRG+         +GRGG
Subjt:  -DLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCTCTAATAGTTCATCCTCTTCTTCGACCGAGACAAACTTGCTCTCACCGATTGTTCTGTTGTCGAACATCTGCAACCTGATATCGATAAAGCTCGACTCGAC
GAATTATGTCCTTTGGAAGTTTCAGTTGACAACACTTTTGAAAGCTCATAAACTTTTTGGCTTTATTGATGGTACTCAACCATGTCCGATATCGAATCCTTCGTACGACG
ATTGGTTTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTGGTTGGTAGCACTACTTCTAAACAGGTTTGGAATGTT
CTTGCAAAGCTTTATTCTTCTAGTTCAAGGTCTAATGTAGTGAATTTGAAGTCTGATCTACAAACTATTTCCAAGAAGTCTGATGAATCTATTGATGCCTATATTAAACG
GATTAAGGAGATCAAGGACAAGCTTGCTAATGTTTCTACTGTTGTCAATGATGAGGATCTTCTTATCTATGCTCTAAATGGCCTACCAACCGAGTATAATACTTTTCGAA
CGTCGATGCGTACGCGTTCTACGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAAGCTGAGGAATCAGCTCTTGCAAAACAGTCTAAGCGTGATGATTTGTGTAAA
CAACCAACTGCTTTACTTGCTTCTTCTCAATCTCTTATGTCTTATGCTTCTACTTTTAATAATAACTTTGTTCGAGGTCGTGGACGTGGACGTGGACATGGACATGGATG
TTCTTCTTTTGATACTCAAGGTCGCGGGGGTGGTTCTTCCCAACAGCAGCAATTGGTTGTTGCTAATAATCATTCATCTTGTCAGATTTGTTTACGTCGTGGCCATATTG
CGCTCGACTGTTTCAATCGTATGAACTATAATTTTCATGGACGCCACCCTCCACATCACCTTGCTGCAATGGTTGCATCCCAGAATAATGCTTTTCTGTCTATTGTTAAT
TCTGATATGAATTATCTTTCTCTTGCATCTGGATATCATGGTGAAGAACAAGTTGGCGTTGGTAGTGGACAGTCTCTGCCTATTTCTCATTCAGGTTGTGGTACTTTTGA
AACTCCTCCTCCTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCTCTAATAGTTCATCCTCTTCTTCGACCGAGACAAACTTGCTCTCACCGATTGTTCTGTTGTCGAACATCTGCAACCTGATATCGATAAAGCTCGACTCGAC
GAATTATGTCCTTTGGAAGTTTCAGTTGACAACACTTTTGAAAGCTCATAAACTTTTTGGCTTTATTGATGGTACTCAACCATGTCCGATATCGAATCCTTCGTACGACG
ATTGGTTTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTGGTTGGTAGCACTACTTCTAAACAGGTTTGGAATGTT
CTTGCAAAGCTTTATTCTTCTAGTTCAAGGTCTAATGTAGTGAATTTGAAGTCTGATCTACAAACTATTTCCAAGAAGTCTGATGAATCTATTGATGCCTATATTAAACG
GATTAAGGAGATCAAGGACAAGCTTGCTAATGTTTCTACTGTTGTCAATGATGAGGATCTTCTTATCTATGCTCTAAATGGCCTACCAACCGAGTATAATACTTTTCGAA
CGTCGATGCGTACGCGTTCTACGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAAGCTGAGGAATCAGCTCTTGCAAAACAGTCTAAGCGTGATGATTTGTGTAAA
CAACCAACTGCTTTACTTGCTTCTTCTCAATCTCTTATGTCTTATGCTTCTACTTTTAATAATAACTTTGTTCGAGGTCGTGGACGTGGACGTGGACATGGACATGGATG
TTCTTCTTTTGATACTCAAGGTCGCGGGGGTGGTTCTTCCCAACAGCAGCAATTGGTTGTTGCTAATAATCATTCATCTTGTCAGATTTGTTTACGTCGTGGCCATATTG
CGCTCGACTGTTTCAATCGTATGAACTATAATTTTCATGGACGCCACCCTCCACATCACCTTGCTGCAATGGTTGCATCCCAGAATAATGCTTTTCTGTCTATTGTTAAT
TCTGATATGAATTATCTTTCTCTTGCATCTGGATATCATGGTGAAGAACAAGTTGGCGTTGGTAGTGGACAGTCTCTGCCTATTTCTCATTCAGGTTGTGGTACTTTTGA
AACTCCTCCTCCTCCTTGA
Protein sequenceShow/hide protein sequence
MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNV
LAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCK
QPTALLASSQSLMSYASTFNNNFVRGRGRGRGHGHGCSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN
SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGCGTFETPPPP