; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G017150 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G017150
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationCmo_Chr11:12157833..12160153
RNA-Seq ExpressionCmoCh11G017150
SyntenyCmoCh11G017150
Gene Ontology termsNA
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]8.0e-14674.74Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKLFGF+DGT PCP ++PS            Y+DW AKDQALMTVINA
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA

Query:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT
        TLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT

Query:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICL
        RS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TFNNNFVRG G G  +GHGR SFD Q RG G SQ+Q+  V +NH++CQIC 
Subjt:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICL

Query:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVGVG+GQ+ PISHSG
Subjt:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]2.8e-19998.93Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
        MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG

Query:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
        STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
Subjt:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV

Query:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV--RGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
        LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV  RGRGRGHGHG SSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
Subjt:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV--RGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR

Query:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPKN
        MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQ MEQNFVPKN
Subjt:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPKN

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-19399.17Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
        MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVG

Query:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
        STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV
Subjt:  STTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHV

Query:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV--RGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
        LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV  RGRGRGHGHG SSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR
Subjt:  LLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFV--RGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNR

Query:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
Subjt:  MNYNFHGRHPPHHLAAMVASQNNAFLSIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.3e-14873.75Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G  +GHGR SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSGQV  + FVPK
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]4.5e-14974.37Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKLFGF+DGT PCP ++PS            Y+DW AKDQALMTVINA
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPS------------YDDWFAKDQALMTVINA

Query:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT
        TLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSMRT
Subjt:  TLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT

Query:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICL
        RS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TFNNNFVRG G G  +GHGR SFD Q RG G SQ+Q+  V +NH++CQIC 
Subjt:  RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICL

Query:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK
        RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVGVG+GQ+ PISHSGQV  + FVPK
Subjt:  RRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X21.1e-14574.1Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G  +GHGR SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X36.4e-14973.75Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G  +GHGR SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSGQV  + FVPK
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPK

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X11.1e-14574.1Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G  +GHGR SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHSG
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

A0A5D3CLI6 T4.54.3e-14574.04Show/hide
Query:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI
        M SS +  SSS E + LSPI LLSNICNLIS++LDSTN+VLWKFQLT +LKAHKL+GFIDGT PC              P SNPSY+DW AKDQALMTVI
Subjt:  MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPC--------------PISNPSYDDWFAKDQALMTVI

Query:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM
        NATLSPEALAYVVGST+SKQVW+VLAKLYSS SRSNVVNLKSDLQTI KK DESIDAYIKRIKEIKDKLANVST +N+EDLLIYALNGLP EYNTFRTSM
Subjt:  NATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSM

Query:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI
        RTRS PVTFEELHVLL+AEESALAKQSK DD   QPT LL+SSQSL+S A TF+NNFVRG G G  +GHGR SFD Q RG GSS +Q+  V +NH++CQI
Subjt:  RTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRG--HGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQI

Query:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHS
        C RRGH ALDCFNRMNYNF GRHPP  LAAMVASQNNAFLSIVN              SDMNY+SLA  Y+GEEQVG+G+GQ+ P+SHS
Subjt:  CLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------SDMNYLSLASGYHGEEQVGVGSGQSLPISHS

A0A6J1D9L6 uncharacterized protein LOC1110188921.8e-11156.76Show/hide
Query:  SSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDG---------------------TQPCPISNPSYDDWFAKDQALMT
        +SSS++T+ +L SPI LLSNICNL+SI+LDST+++LWKFQLT +LKAHKLFGFIDG                     T   P+ NP ++DW AKDQALMT
Subjt:  SSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDG---------------------TQPCPISNPSYDDWFAKDQALMT

Query:  VINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRT
        +INATLS EALAYVV S TSKQVW VL K YSS+SR+NVVNLKSDLQ+I KK++ESIDAY+KRIKEIKDK ANVS  +NDE LLIYALNGL TEYNT  T
Subjt:  VINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRT

Query:  SMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTF--NNNFVRGRGRGHGHGRS----SFDTQGRGGGSSQQQQLVVANN
        SMRTR+  V+FEELHV +K+EESA+ KQ KR+DL  QP AL ASS    +  S F  N +  RGRG+ +G G++    +F  QGRG  S        A+N
Subjt:  SMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTF--NNNFVRGRGRGHGHGRS----SFDTQGRGGGSSQQQQLVVANN

Query:  HSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------------SDMNYLSLASGYHGEEQVGVGSGQSLPISH
         S CQIC + GH ALDC+NRMN++F GRHPP  LAAMVA QNN++L++ N                    S+++  S+AS Y+GEE + VGSGQS PI+H
Subjt:  HSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVN--------------------SDMNYLSLASGYHGEEQVGVGSGQSLPISH

Query:  --SGQVMEQNFVPK
           GQV   N+VP+
Subjt:  --SGQVMEQNFVPK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-0923Show/hide
Query:  WKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESI
        W+ ++  LL    L   +D     P      +DW   D+   + I   LS + +  ++   T++ +W  L  LY S + +N + LK  L  +      + 
Subjt:  WKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESI

Query:  DAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNN
         +++     +  +LAN+   + +ED  I  LN LP+ Y+   T++    T +  +++   L   E    K  K+ +   Q  AL+               
Subjt:  DAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNN

Query:  NFVRGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFN--RMNYNFHGRHPPHHLAAMVASQNNAFLSI
            GRGR +    +++   G  G S  + +  V N    C  C + GH   DC N  +      G+    + AAMV + +N  L I
Subjt:  NFVRGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFN--RMNYNFHGRHPPHHLAAMVASQNNAFLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.8e-2327.25Show/hide
Query:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFID----------GTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAY
        ++ + E  L +  +L  N+ N+   KL STNY++W  Q+  L   ++L GF+D          GT   P  NP Y  W  +D+ + + +   +S      
Subjt:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFID----------GTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAY

Query:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE
        V  +TT+ Q+W  L K+Y++ S  +V  L++ L+  + K  ++ID Y++ +    D+LA +   ++ ++ +   L  LP EY      +  + TP T  E
Subjt:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE

Query:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGHGHGRSSFDTQGRGGGSSQQQQLVV---ANNHSS------CQICLRR
        +H  L   ES +   S    +      + A++ S  +  +T NNN        +G+  + +D +     S   QQ       NN+ S      CQIC  +
Subjt:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGHGHGRSSFDTQGRGGGSSQQQQLVV---ANNHSS------CQICLRR

Query:  GHIALDCFNRMNY--NFHGRHPPHHLA-----AMVA-----SQNNAFLSI-----VNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
        GH A  C    ++  + + + PP         A +A     S NN  L       + SD N LSL   Y G + V V  G ++PISH+G
Subjt:  GHIALDCFNRMNY--NFHGRHPPHHLA-----AMVA-----SQNNAFLSI-----VNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.1e-1623.9Show/hide
Query:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS----------NPSYDDWFAKDQALMTVINATLSPEALAY
        ++ + E  L++  +L  N+ N+   KL STNY++W  Q+  L   ++L GF+DG+ P P +          NP Y  W  +D+ + + I   +S      
Subjt:  SSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS----------NPSYDDWFAKDQALMTVINATLSPEALAY

Query:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE
        V  +TT+ Q+W  L K+Y++ S  +V  L+                +I R     D+LA +   ++ ++ +   L  LP +Y      +  + TP +  E
Subjt:  VVGSTTSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEE

Query:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGHGHGR-----SSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIA
        +H  L   ES L        L      ++  + +++++ +T  N     RG    +       +S+     G  S  +Q          CQIC  +GH A
Subjt:  LHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRGRGHGHGR-----SSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIA

Query:  LDCFNRMNYN-----------FHGRHPPHHLAAMVASQNNAFL------SIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG
          C     +            F    P  +LA       N +L        + SD N LS    Y G + V +  G ++PI+H+G
Subjt:  LDCFNRMNYN-----------FHGRHPPHHLAAMVASQNNAFL------SIVNSDMNYLSLASGYHGEEQVGVGSGQSLPISHSG

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.0e-0523.27Show/hide
Query:  SNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS-NPSYDDWFAKDQALMTVINATLSPEALAYVVGST
        S S +S       L P +   +  ++  +  D  NYV WK +  + L+  K FGFIDGT P P   +P Y  W   +  +M  +  +++ + L  V+ + 
Subjt:  SNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPIS-NPSYDDWFAKDQALMTVINATLSPEALAYVVGST

Query:  TSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTV
        T+ ++W  L +++       +  L+  L T+ ++  +S++ Y  ++ ++  +L+  + +
Subjt:  TSKQVWNVLAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTV

AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.5e-1228.06Show/hide
Query:  LSNICNLISIKLD--STNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEAL-AYVVGSTTSKQVWNVLAKLYSSSS
        +SNI + I + LD   +NY  W+    T   +  + G IDGT     +N +  +W  +D  +   +  TL+P+      V S+TS+ +W  +   + ++ 
Subjt:  LSNICNLISIKLD--STNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEAL-AYVVGSTTSKQVWNVLAKLYSSSS

Query:  RSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLC
         +  + L S+L+T     D  +  Y +++K++ D L NV   V D +L++Y LNGL  +++     ++ R    +F++   +L+ EE  L +  K     
Subjt:  RSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLC

Query:  KQPTALLASSQSLMSYASTFN--NNFVRGRG-----RGHGHGRSSFDTQGRGG
          PT +  SS S +   S      NF R  G     RG G G + F  +GRGG
Subjt:  KQPTALLASSQSLMSYASTFN--NNFVRGRG-----RGHGHGRSSFDTQGRGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCTCTAATAGTTCATCCTCTTCTTCGACCGAGACGAACTTGCTCTCACCGATTGTTCTGTTGTCGAACATCTGCAACCTGATATCGATTAAGCTCGACTCGAC
GAATTATGTCCTTTGGAAGTTTCAGTTAACAACGCTTTTGAAAGCTCATAAACTTTTTGGCTTTATTGATGGTACTCAACCATGTCCGATATCGAATCCTTCGTACGACG
ATTGGTTTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTGGTTGGTAGCACTACTTCTAAACAGGTTTGGAATGTT
CTTGCAAAGCTTTATTCTTCTAGTTCAAGGTCTAATGTAGTGAATTTGAAGTCTGATCTACAAACTATTTCCAAGAAGTCTGATGAATCTATTGATGCCTATATTAAACG
GATTAAGGAGATCAAGGACAAGCTTGCTAATGTTTCTACTGTTGTCAATGATGAGGATCTTCTTATCTATGCTCTAAATGGCCTACCAACCGAGTATAATACTTTTCGAA
CGTCGATGCGTACGCGTTCTACGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAAGCTGAGGAATCAGCTCTTGCAAAACAGTCTAAGCGTGATGATTTGTGTAAA
CAACCAACTGCTTTACTTGCTTCTTCTCAATCTCTCATGTCTTATGCTTCTACTTTTAATAATAACTTTGTTCGAGGTCGTGGACGTGGACATGGACATGGACGTTCTTC
TTTTGATACTCAAGGTCGCGGTGGTGGTTCTTCCCAACAGCAGCAGTTGGTTGTTGCTAATAATCATTCATCTTGTCAGATTTGTTTACGTCGTGGCCATATTGCGCTCG
ATTGTTTCAATCGTATGAACTATAATTTTCATGGACGCCACCCTCCACATCACCTTGCTGCAATGGTTGCATCCCAGAATAATGCTTTTCTGTCTATTGTTAATTCTGAT
ATGAATTATCTTTCTCTTGCATCTGGATATCATGGTGAAGAACAAGTTGGCGTTGGTAGTGGACAGTCTCTGCCTATTTCTCATTCAGGACAAGTAATGGAGCAAAATTT
TGTTCCAAAGAACTAG
mRNA sequenceShow/hide mRNA sequence
TTTCTTCTTATGTCATATGGATTCCTCTAATAGTTCATCCTCTTCTTCGACCGAGACGAACTTGCTCTCACCGATTGTTCTGTTGTCGAACATCTGCAACCTGATATCGA
TTAAGCTCGACTCGACGAATTATGTCCTTTGGAAGTTTCAGTTAACAACGCTTTTGAAAGCTCATAAACTTTTTGGCTTTATTGATGGTACTCAACCATGTCCGATATCG
AATCCTTCGTACGACGATTGGTTTGCTAAGGATCAAGCTCTTATGACAGTCATAAATGCTACACTTTCACCTGAGGCTTTGGCATATGTGGTTGGTAGCACTACTTCTAA
ACAGGTTTGGAATGTTCTTGCAAAGCTTTATTCTTCTAGTTCAAGGTCTAATGTAGTGAATTTGAAGTCTGATCTACAAACTATTTCCAAGAAGTCTGATGAATCTATTG
ATGCCTATATTAAACGGATTAAGGAGATCAAGGACAAGCTTGCTAATGTTTCTACTGTTGTCAATGATGAGGATCTTCTTATCTATGCTCTAAATGGCCTACCAACCGAG
TATAATACTTTTCGAACGTCGATGCGTACGCGTTCTACGCCTGTTACTTTTGAAGAACTTCATGTTCTTCTAAAAGCTGAGGAATCAGCTCTTGCAAAACAGTCTAAGCG
TGATGATTTGTGTAAACAACCAACTGCTTTACTTGCTTCTTCTCAATCTCTCATGTCTTATGCTTCTACTTTTAATAATAACTTTGTTCGAGGTCGTGGACGTGGACATG
GACATGGACGTTCTTCTTTTGATACTCAAGGTCGCGGTGGTGGTTCTTCCCAACAGCAGCAGTTGGTTGTTGCTAATAATCATTCATCTTGTCAGATTTGTTTACGTCGT
GGCCATATTGCGCTCGATTGTTTCAATCGTATGAACTATAATTTTCATGGACGCCACCCTCCACATCACCTTGCTGCAATGGTTGCATCCCAGAATAATGCTTTTCTGTC
TATTGTTAATTCTGATATGAATTATCTTTCTCTTGCATCTGGATATCATGGTGAAGAACAAGTTGGCGTTGGTAGTGGACAGTCTCTGCCTATTTCTCATTCAGGACAAG
TAATGGAGCAAAATTTTGTTCCAAAGAACTAGCACTAATGGTCTATACCTGACTTCAAAAGCTATCCTACTGCTACTAATGTTGATACCAAGTGTTATTTCATTCATGTT
GTTGTTCATTAAGTTTGTTCTTTTCCTGTTGAATCTTCCATGTTCTTCAAATAATTGAGTATGAGAATATTGTCTACGATCAAGGACGGTTTCTTTGCACTTTTTAGAAC
ATCGAACAAAAGAACTAAGGTGGGTGGAAATGTAACAGCCCAAGCTCACCGCTAGCAAATATTGTCTTCTTTAGGCTTTTCCTTCTGGGTTTCCCCTCAATGTTCCAAAA
CGCGTTTATTAGAAAGAGGTTTCC
Protein sequenceShow/hide protein sequence
MDSSNSSSSSSTETNLLSPIVLLSNICNLISIKLDSTNYVLWKFQLTTLLKAHKLFGFIDGTQPCPISNPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNV
LAKLYSSSSRSNVVNLKSDLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRTRSTPVTFEELHVLLKAEESALAKQSKRDDLCK
QPTALLASSQSLMSYASTFNNNFVRGRGRGHGHGRSSFDTQGRGGGSSQQQQLVVANNHSSCQICLRRGHIALDCFNRMNYNFHGRHPPHHLAAMVASQNNAFLSIVNSD
MNYLSLASGYHGEEQVGVGSGQSLPISHSGQVMEQNFVPKN