; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017675 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017675
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr03:18377132..18385347
RNA-Seq ExpressionHG10017675
SyntenyHG10017675
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8676568.1 L-ascorbate oxidase [Hibiscus syriacus]1.3e-2131.12Show/hide
Query:  GLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF--------
        GLV N  K  +F  G+  E+   +    D  +  L V YLG+PLI+ +L   DC P+++K+    ++W AR LSY GR+QLI +++ +   +        
Subjt:  GLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF--------

Query:  -----------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ
                                 A    + W  +  FG +I RH+ I+W+AI NRL T+DR+  +   +  V V+C    ET++H+F DCP S+
Subjt:  -----------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ

XP_018498238.1 PREDICTED: uncharacterized protein LOC108865598 [Pyrus x bretschneideri]2.6e-2235.16Show/hide
Query:  LGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSY----------VGRLQLI
        L  F  LSGL AN  KSS+F  G+  ++   +    +F +  L + YLG+PLIS+ L   DC  ++E+V + +R+W  + LS+          + +  L 
Subjt:  LGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSY----------VGRLQLI

Query:  HSVLQSFSGFPTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS
           L  ++     S P V WS L  +  NI + +FI WLAI+ +L   DRI  + P +    +LCSI  E+  HLFF CPF+
Subjt:  HSVLQSFSGFPTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS

XP_022157473.1 uncharacterized protein LOC111024165 [Momordica charantia]3.1e-2335.81Show/hide
Query:  DGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF----------------
        D +  +F +GL   + D L+AF+ F + SL V YLG+ L+S R+ + DC P+LE++   VR+WSAR LS+   L LI  V QSF                
Subjt:  DGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF----------------

Query:  -------------------------------SGFPTASN---------PVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSI
                                       SG  + SN         P V W +L  FGGNI +H+FIAWLA+R+RL T+DR+ RW   +    V C+ 
Subjt:  -------------------------------SGFPTASN---------PVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSI

Query:  KVETKDHLFFDCPFS
        + E+ D+LFF+CPFS
Subjt:  KVETKDHLFFDCPFS

XP_038888122.1 uncharacterized protein LOC120078021 [Benincasa hispida]8.6e-2638.25Show/hide
Query:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF--
        VL  F GLSGLVAN G SS+F VG+     ++L+AFM F L SL V YLGLPL++ RL   DC P+++++T+ +RSW+AR+LS+ GRLQLI SVLQSF  
Subjt:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF--

Query:  ------------------------------------------------SG-FPTAS--------NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDR
                                                        SG F  AS         P V+W  +   GG I RH F AWLA+R+R+     
Subjt:  ------------------------------------------------SG-FPTAS--------NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDR

Query:  IGRWCPDIHRVYVLCSI
        +   C +I+ V +LC I
Subjt:  IGRWCPDIHRVYVLCSI

XP_039031002.1 uncharacterized protein LOC120165586 [Hibiscus syriacus]4.0e-2331.71Show/hide
Query:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSG
        VL +F   SGLV N  K  +F  G+  E+   +    D  +  L V YLG+PLI+ +L   DC P+++K+    ++W AR LSY GR+QLI +++ +   
Subjt:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSG

Query:  F-------------------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFD
        +                                 A    + W  +  FG +I RH+ I+W+AI NRL T+DR+  +   +  V V+C    ET++H+F D
Subjt:  F-------------------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFD

Query:  CPFSQ
        CP S+
Subjt:  CPFSQ

TrEMBL top hitse value%identityAlignment
A0A5A7TWG5 Reverse transcriptase domain-containing protein1.9e-1842.67Show/hide
Query:  EEVKGVNLLLADGLNGGASIF-PVSELSASY-KVVLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPI
        E+VK  +L  AD L     IF    ELS  + +  L +F  LSGL AN  KSS+F  G+  E+   L+A M F   +LSV YLGLPL++ RL   DC P+
Subjt:  EEVKGVNLLLADGLNGGASIF-PVSELSASY-KVVLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPI

Query:  LEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGFPTASNPVVAWSNLFL
        ++++TS +RSW+AR LS+ GR+QL+ SVL+S           V W+++F+
Subjt:  LEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGFPTASNPVVAWSNLFL

A0A6A2XLW8 L-ascorbate oxidase6.2e-2231.12Show/hide
Query:  GLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF--------
        GLV N  K  +F  G+  E+   +    D  +  L V YLG+PLI+ +L   DC P+++K+    ++W AR LSY GR+QLI +++ +   +        
Subjt:  GLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF--------

Query:  -----------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ
                                 A    + W  +  FG +I RH+ I+W+AI NRL T+DR+  +   +  V V+C    ET++H+F DCP S+
Subjt:  -----------------------PTASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ

A0A6J1D875 uncharacterized protein LOC1110178115.3e-2131.42Show/hide
Query:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFS-
        VL  F  L+GLVAN  KSS F  G+   + + L+AF DF + S  V YLG+PL S RL + DC P+LE++ S V SWSAR  S+  RLQLI S+LQSF  
Subjt:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFS-

Query:  ------------------------------------------------------------------------GF-----------------PTAS-----
                                                                                GF                 P +S     
Subjt:  ------------------------------------------------------------------------GF-----------------PTAS-----

Query:  ----------NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLC
                   P+V W +   F GNIS+H+FIAWLA+R+RL T DR+ RW   I    V C
Subjt:  ----------NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLC

A0A6J1DTG0 uncharacterized protein LOC1110241651.5e-2335.81Show/hide
Query:  DGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF----------------
        D +  +F +GL   + D L+AF+ F + SL V YLG+ L+S R+ + DC P+LE++   VR+WSAR LS+   L LI  V QSF                
Subjt:  DGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSF----------------

Query:  -------------------------------SGFPTASN---------PVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSI
                                       SG  + SN         P V W +L  FGGNI +H+FIAWLA+R+RL T+DR+ RW   +    V C+ 
Subjt:  -------------------------------SGFPTASN---------PVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSI

Query:  KVETKDHLFFDCPFS
        + E+ D+LFF+CPFS
Subjt:  KVETKDHLFFDCPFS

O65244 F21E10.5 protein1.7e-1931.13Show/hide
Query:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQS---
        VL +F   SGL  +  KS+++  G+      ++     F +  L V YLGLPL+S RL  +DC P++E++   + +W++R LS+ GRL LI S L S   
Subjt:  VLGQFQGLSGLVANDGKSSLFCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQS---

Query:  ------------------------FSGFPTASNPV------------VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVE
                                +SG   +SN               AW     F     +H+F  WLAI N+L T  R+  W        VLC+  +E
Subjt:  ------------------------FSGFPTASNPV------------VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVE

Query:  TKDHLFFDCPFS
        T+DHLFF C ++
Subjt:  TKDHLFFDCPFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-1140Show/hide
Query:  NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQA
        N +V W     F  ++ +H FI W+   NRL T+DR+  W   I  V +LC+   E++ HLFF+CPF  A
Subjt:  NPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQA

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-1040.28Show/hide
Query:  NPV---VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ
        NP+   V W     F G I +H FIAW+ +R+RL T+DR+  W      + + C+   ET+ HLFFDC F++
Subjt:  NPV---VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQ

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-1043.42Show/hide
Query:  EDTDDLSAFMDFPLAS--LSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF
        +D D       FP AS  L V YLGLPL++ ++  +D  P++EK+   +  W+AR LS+ GRLQLI SV+ S + F
Subjt:  EDTDDLSAFMDFPLAS--LSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGF

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-0943.08Show/hide
Query:  VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS
        V W     F     +++ +AW+AI+NRL T DR+  W        VLC   VET+DHLFF CP+S
Subjt:  VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-1140.24Show/hide
Query:  GFPTAS-----NPV---VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS
        GF +A+     NPV   V W     F G I +H FI+W+ IR+RL T+D++  W   +  + +LC+   ET+ HLFFDC F+
Subjt:  GFPTAS-----NPV---VAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFS

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-1039.77Show/hide
Query:  SVLQSFSGFPT-----ASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQA
        S L SFS   T       +P V W+ +  F   I R + I W++   RL T+DR+  W  +I   +VLCS   ET  HLFF+C FS A
Subjt:  SVLQSFSGFPT-----ASNPVVAWSNLFLFGGNISRHTFIAWLAIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCGCGCTCCCCATCCTCGTTCCCCATTTCGTTCCCTGTCCCCGCCAAATTCCCTATGGGGATCAGGGCGGGGATTCCTTGCGAGGAATTCGGGGGATCGGGGA
GAAGTCGTTGGTAACTGAGTTTGGCTCGTGGGGTGATAAGGTTATCGATAATGTACTTATTAGTGAGGAGGTGAAAGGGGTTAATCTGTTGTTGGCTGATGGATTGAATG
GTGGTGCATCTATTTTTCCAGTTTCAGAATTATCAGCTAGTTATAAGGTTGTTTTGGGGCAGTTTCAAGGGCTGTCTGGTCTAGTTGCTAATGATGGGAAAAGTTCGTTA
TTTTGTGTTGGCCTAACTCCTGAAGACACTGATGATCTTTCTGCGTTCATGGATTTTCCTTTGGCTTCCCTCTCGGTCTGTTACCTTGGCTTGCCTCTTATCTCGAGCAG
GCTTTTTTATACAGATTGTTGGCCCATTTTGGAAAAGGTTACCTCCTGTGTTAGGAGTTGGTCGGCCAGAGCTTTATCTTACGTTGGTCGTCTTCAGCTTATTCACTCTG
TGCTTCAGAGTTTTTCAGGATTCCCTACGGCCTCGAATCCTGTGGTGGCTTGGTCTAACCTCTTTTTGTTTGGAGGTAATATTTCTAGACATACTTTTATTGCATGGCTT
GCTATTCGTAATCGGTTACAGACGCAGGATCGTATCGGTAGGTGGTGTCCTGATATTCATAGGGTTTATGTTTTGTGCTCTATCAAAGTTGAAACAAAGGACCATCTATT
CTTTGATTGCCCTTTCAGTCAGGCCCATGGGGGTACATGCTTGTCTGGGGTTCATGCTCCCATTGGGATACTGAGCTTCTTTGGATTTGCCACTTTAGTGCCCGTAGTTG
GTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCGCGCTCCCCATCCTCGTTCCCCATTTCGTTCCCTGTCCCCGCCAAATTCCCTATGGGGATCAGGGCGGGGATTCCTTGCGAGGAATTCGGGGGATCGGGGA
GAAGTCGTTGGTAACTGAGTTTGGCTCGTGGGGTGATAAGGTTATCGATAATGTACTTATTAGTGAGGAGGTGAAAGGGGTTAATCTGTTGTTGGCTGATGGATTGAATG
GTGGTGCATCTATTTTTCCAGTTTCAGAATTATCAGCTAGTTATAAGGTTGTTTTGGGGCAGTTTCAAGGGCTGTCTGGTCTAGTTGCTAATGATGGGAAAAGTTCGTTA
TTTTGTGTTGGCCTAACTCCTGAAGACACTGATGATCTTTCTGCGTTCATGGATTTTCCTTTGGCTTCCCTCTCGGTCTGTTACCTTGGCTTGCCTCTTATCTCGAGCAG
GCTTTTTTATACAGATTGTTGGCCCATTTTGGAAAAGGTTACCTCCTGTGTTAGGAGTTGGTCGGCCAGAGCTTTATCTTACGTTGGTCGTCTTCAGCTTATTCACTCTG
TGCTTCAGAGTTTTTCAGGATTCCCTACGGCCTCGAATCCTGTGGTGGCTTGGTCTAACCTCTTTTTGTTTGGAGGTAATATTTCTAGACATACTTTTATTGCATGGCTT
GCTATTCGTAATCGGTTACAGACGCAGGATCGTATCGGTAGGTGGTGTCCTGATATTCATAGGGTTTATGTTTTGTGCTCTATCAAAGTTGAAACAAAGGACCATCTATT
CTTTGATTGCCCTTTCAGTCAGGCCCATGGGGGTACATGCTTGTCTGGGGTTCATGCTCCCATTGGGATACTGAGCTTCTTTGGATTTGCCACTTTAGTGCCCGTAGTTG
GTTGA
Protein sequenceShow/hide protein sequence
MAFALPILVPHFVPCPRQIPYGDQGGDSLRGIRGIGEKSLVTEFGSWGDKVIDNVLISEEVKGVNLLLADGLNGGASIFPVSELSASYKVVLGQFQGLSGLVANDGKSSL
FCVGLTPEDTDDLSAFMDFPLASLSVCYLGLPLISSRLFYTDCWPILEKVTSCVRSWSARALSYVGRLQLIHSVLQSFSGFPTASNPVVAWSNLFLFGGNISRHTFIAWL
AIRNRLQTQDRIGRWCPDIHRVYVLCSIKVETKDHLFFDCPFSQAHGGTCLSGVHAPIGILSFFGFATLVPVVG