; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007764 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007764
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase
Genome locationChr10:12509964..12511953
RNA-Seq ExpressionHG10007764
SyntenyHG10007764
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]7.0e-3451.95Show/hide
Query:  GARGGLCLFWRGSVSLDIKSFSSHHIDAQV-CWKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFR
        G  GGL L W+G + + ++SFS HHIDA +    G  WRFTGVYG PE   + LTWNLLRRL+ G   PWLVGGD NE L  +EK+GG  RSE+ +E+FR
Subjt:  GARGGLCLFWRGSVSLDIKSFSSHHIDAQV-CWKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFR

Query:  DCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNH
        + + DC L DLGF G  +TW +G+     I ER DRF+GN+ F  LF    V H
Subjt:  DCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNH

KAF5449841.1 hypothetical protein F2P56_030246 [Juglans regia]3.5e-3349.35Show/hide
Query:  GARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFR
        G  GGL L W+  +S+ I SFS +HIDA +       WRFTGVYG+P++  + LTWNL+R L    S PWLVGGD NE L   EK+GG     + LE+FR
Subjt:  GARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFR

Query:  DCLNDCRLVDLGFSGSLFTWHGKRNGVD-IWERWDRFVGNDMFWNLFNIKRVNH
          + DC L DLGF G  FTW   R G+  I ER DRF GN   W +F  +RV H
Subjt:  DCLNDCRLVDLGFSGSLFTWHGKRNGVD-IWERWDRFVGNDMFWNLFNIKRVNH

KAG6636592.1 hypothetical protein CIPAW_11G121200 [Carya illinoinensis]1.4e-3445.73Show/hide
Query:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS
        N     S G  GGL L W   + ++++SFS +HID  +     + WRFTG+YG+P++ R+  TWNL+R L     LPWLVGGDLNE L   EK+GG  R 
Subjt:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS

Query:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN
         S +E+FR+ L +C L DLG+ G  FT W+G+     I+E  DRFVGND+  +LF    V H N
Subjt:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]1.4e-3445.73Show/hide
Query:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS
        N     S G  GGL L W   + ++++SFS +HID  +     + WRFTG+YG+P++ R+  TWNL+R L     LPWLVGGDLNE L   EK+GG  R 
Subjt:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS

Query:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN
         S +E+FR+ L +C L DLG+ G  FT W+G+     I+E  DRFVGND+  +LF    V H N
Subjt:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]2.4e-3445.73Show/hide
Query:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS
        N     S G  GGL L W   + ++++SFS +HID  +     + WRFTG+YG+P++ R+  TWNL+R L     +PWLVGGDLNE L   EK+GG  R 
Subjt:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFL-WRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRS

Query:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN
         S +E+FR+ L +C L DLG+ G  FT W+G+     I+ER DRFVGND+  +LF    V H N
Subjt:  ESCLESFRDCLNDCRLVDLGFSGSLFT-WHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN

TrEMBL top hitse value%identityAlignment
A0A2N9FVV5 Reverse transcriptase domain-containing protein1.9e-3248.68Show/hide
Query:  GGLCLFWRGSVSLDIKSFSSHHIDAQVC-WKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFRDCL
        GGL LFW+  ++L IKSFS  HID  +      +WRF G YG PE+  ++ +WN+LR LH  +SLPW   GD NE +S DEK+GG  R+ES +++FRD L
Subjt:  GGLCLFWRGSVSLDIKSFSSHHIDAQVC-WKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFRDCL

Query:  NDCRLVDLGFSGSLFTWHGKR-NGVDIWERWDRFVGNDMFWNLFNIKRVNHL
        +DC   DLGF G  FTW   R NGV +WE+ DR V N  +   F   RV+H+
Subjt:  NDCRLVDLGFSGSLFTWHGKR-NGVDIWERWDRFVGNDMFWNLFNIKRVNHL

A0A2N9I236 Peptidylprolyl isomerase1.4e-3250Show/hide
Query:  SGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWK-GFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESF
        +G  GGL + W  SVSL I+SFS HHIDA V  + G  WR TG YGYPE   +  +W+LLRRLH G+S PWLV GD NE ++ DEK G   RS + + +F
Subjt:  SGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWK-GFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESF

Query:  RDCLNDCRLVDLGFSGSLFTWHGKR-NGVDIWERWDRFVGNDMFWNLFNIKRVNHL
        R+ L+DC L DLGF G  FTW  +R N   +  R DR V N  + +LF   RV H+
Subjt:  RDCLNDCRLVDLGFSGSLFTWHGKR-NGVDIWERWDRFVGNDMFWNLFNIKRVNHL

A0A2N9IXK4 RNase H domain-containing protein3.2e-3247.06Show/hide
Query:  GGLCLFWRGSVSLDIKSFSSHHIDAQVC-WKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFRDCL
        GGL +FW+    + IKSFS HHIDA +   +   WRFTG YG PE+ R+  +W+LLR LH  +SLPW   GD NE LS +EK+GGP+RS   ++ FRD +
Subjt:  GGLCLFWRGSVSLDIKSFSSHHIDAQVC-WKGFLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFRDCL

Query:  NDCRLVDLGFSGSLFTWHGKRNGV-DIWERWDRFVGNDMFWNLFNIKRVNHLN
        + C   DLGF+G  FTW   R G   +WER DR +    + +LF + +V HL+
Subjt:  NDCRLVDLGFSGSLFTWHGKRNGV-DIWERWDRFVGNDMFWNLFNIKRVNHLN

A0A5B6UZQ1 Reverse transcriptase4.1e-3244.51Show/hide
Query:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGF--LWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVR
        N ++  + G RGGLCL W+G + + +KSFS  HI+A +  +G    W+FTG Y  P    K L W+LLRRL + ++ PWLV GD NE +   EKKGG  R
Subjt:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGF--LWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVR

Query:  SESCLESFRDCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHL
            +E+FR+ L DC+L D+G+SG+ FTW  G     +I ER DR V N+ + NLF + ++ HL
Subjt:  SESCLESFRDCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHL

A0A5B6VBB6 Reverse transcriptase1.4e-3244.51Show/hide
Query:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKG--FLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVR
        N ++ ++ G+RGGLCL W+G +++ +KS+S  HID+ +        WRFTG YG P    K   W+LL+ L  GN  PWLV GD NE L  +EKKGG VR
Subjt:  NLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKG--FLWRFTGVYGYPESGRKILTWNLLRRLHEGNSLPWLVGGDLNEGLSDDEKKGGPVR

Query:  SESCLESFRDCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHL
         +  ++ FRD L +C+L+D+G+SG+ FTW  G     +I ER DR V N+ + +LF +  V HL
Subjt:  SESCLESFRDCLNDCRLVDLGFSGSLFTW-HGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTAAAGGTCTTGTTGAAGCCTTGAAGAATTTTGAGTTGACTTCAGAAGAAGATTCATGTCCAGTGGGAATTGTGTCTGGTGGTCTTGACCAATCGTTACCTTT
GAAGCAAATTGATGTGGAGAATGAGGGTTTTGTTTCTATTACTCCTCAAAGCTCTGGGATTCTTAAAGGCGATGAGATCTTGGGTAGAAGTCCTTCGGTTGAGAGTGTTG
AGGGTGACACCTCAGCCAGAAAGTTAAAAGTCTGGAAAAGAAAACTGAAGTTGGAAAAAGCTGAAAACTCTAAGGCTGATCTGTTGGTTATCTTGAAGAGAAAAGGGGAT
GGTGGTGATTCAGGTCAATCTTTGAAAAGAAGTCGTGCAATGCCAGATATTAATTTGAATGTGTGGGAGGCTGATGAGTTTTGGCAGAGGCTGCTAGACAGCCCTGCCTT
GACCAATGAATACAATTTGTTGGAATGCTCGAGCTCGGGTGCGAGAGGTGGTTTATGTCTGTTTTGGAGAGGCTCGGTATCCTTGGATATTAAGTCTTTTTCTTCTCATC
ATATTGATGCACAAGTTTGCTGGAAGGGTTTTTTATGGAGATTCACAGGGGTGTATGGTTATCCGGAAAGTGGTAGGAAGATTCTTACTTGGAATCTCTTACGACGCCTC
CATGAAGGGAATTCTCTTCCTTGGCTGGTGGGGGGCGATTTAAACGAGGGCCTTAGCGATGATGAAAAGAAGGGTGGTCCTGTGAGAAGCGAGAGTTGTCTGGAGAGCTT
TCGTGATTGTTTGAATGATTGTAGGTTGGTGGATCTAGGATTCTCTGGTTCCCTCTTTACATGGCATGGTAAAAGGAACGGTGTTGATATTTGGGAGAGGTGGGATCGTT
TTGTTGGAAACGATATGTTTTGGAATTTATTCAACATAAAAAGAGTTAATCACTTAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTAAAGGTCTTGTTGAAGCCTTGAAGAATTTTGAGTTGACTTCAGAAGAAGATTCATGTCCAGTGGGAATTGTGTCTGGTGGTCTTGACCAATCGTTACCTTT
GAAGCAAATTGATGTGGAGAATGAGGGTTTTGTTTCTATTACTCCTCAAAGCTCTGGGATTCTTAAAGGCGATGAGATCTTGGGTAGAAGTCCTTCGGTTGAGAGTGTTG
AGGGTGACACCTCAGCCAGAAAGTTAAAAGTCTGGAAAAGAAAACTGAAGTTGGAAAAAGCTGAAAACTCTAAGGCTGATCTGTTGGTTATCTTGAAGAGAAAAGGGGAT
GGTGGTGATTCAGGTCAATCTTTGAAAAGAAGTCGTGCAATGCCAGATATTAATTTGAATGTGTGGGAGGCTGATGAGTTTTGGCAGAGGCTGCTAGACAGCCCTGCCTT
GACCAATGAATACAATTTGTTGGAATGCTCGAGCTCGGGTGCGAGAGGTGGTTTATGTCTGTTTTGGAGAGGCTCGGTATCCTTGGATATTAAGTCTTTTTCTTCTCATC
ATATTGATGCACAAGTTTGCTGGAAGGGTTTTTTATGGAGATTCACAGGGGTGTATGGTTATCCGGAAAGTGGTAGGAAGATTCTTACTTGGAATCTCTTACGACGCCTC
CATGAAGGGAATTCTCTTCCTTGGCTGGTGGGGGGCGATTTAAACGAGGGCCTTAGCGATGATGAAAAGAAGGGTGGTCCTGTGAGAAGCGAGAGTTGTCTGGAGAGCTT
TCGTGATTGTTTGAATGATTGTAGGTTGGTGGATCTAGGATTCTCTGGTTCCCTCTTTACATGGCATGGTAAAAGGAACGGTGTTGATATTTGGGAGAGGTGGGATCGTT
TTGTTGGAAACGATATGTTTTGGAATTTATTCAACATAAAAAGAGTTAATCACTTAAATTGA
Protein sequenceShow/hide protein sequence
MESKGLVEALKNFELTSEEDSCPVGIVSGGLDQSLPLKQIDVENEGFVSITPQSSGILKGDEILGRSPSVESVEGDTSARKLKVWKRKLKLEKAENSKADLLVILKRKGD
GGDSGQSLKRSRAMPDINLNVWEADEFWQRLLDSPALTNEYNLLECSSSGARGGLCLFWRGSVSLDIKSFSSHHIDAQVCWKGFLWRFTGVYGYPESGRKILTWNLLRRL
HEGNSLPWLVGGDLNEGLSDDEKKGGPVRSESCLESFRDCLNDCRLVDLGFSGSLFTWHGKRNGVDIWERWDRFVGNDMFWNLFNIKRVNHLN