; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009421 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009421
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:5712485..5714043
RNA-Seq ExpressionHG10009421
SyntenyHG10009421
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3468559.1 reverse transcriptase [Gossypium australe]4.2e-3140.12Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIK--WKDKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        G++GGL L WKED+DVT+++F   H+D  IK     + WRFTG+YG P    + + W LL+RLS++ + PWL+ GD NE+++  EK GG P+++K ++IF
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIK--WKDKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMG----WFY---SDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTSES
        ++ L++CNL+D+G    W Y   SDH P+L+  +   ++  R + F FE +W  E   +E +    +W S S
Subjt:  KEALDECNLVDMG----WFY---SDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTSES

KAG6636592.1 hypothetical protein CIPAW_11G121200 [Carya illinoinensis]7.9e-3037.37Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK
        G  GGL L W  D+ V +++F  +HID  IK  D   WRFTG+YG+P+++R+T TW L+R LS    LPWL+GGDLNEVL  HEK GG  +    I+ F+
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK

Query:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        E L EC L D+G       W+                                     +SDH PIL E ++ Q + +R + F  E  W+ E++C+ II
Subjt:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

MBA0879484.1 hypothetical protein [Gossypium schwendimanii]3.9e-2940.76Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        GSKGGL L WK D+ +++++F  +HID  +K +   + WRFTG YG P    K  +W+LL  L  D   PWL+ GD NE+L+ +EK GG P++   +Q F
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMGW--FYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        +EAL+ C L D+G+    SDH P+L+       ++   + FKFE +W+ E+ C++ I
Subjt:  KEALDECNLVDMGW--FYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

XP_042950313.1 uncharacterized protein LOC122282426 [Carya illinoinensis]7.9e-3037.37Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK
        G  GGL L W  D+ V +++F  +HID  IK  D   WRFTG+YG+P+++R+T TW L+R LS    LPWL+GGDLNEVL  HEK GG  +    I+ F+
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK

Query:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        E L EC L D+G       W+                                     +SDH PIL E ++ Q + +R + F  E  W+ E++C+ II
Subjt:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

XP_042980077.1 uncharacterized protein LOC122310261 [Carya illinoinensis]2.7e-3037.37Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK
        G  GGL L W  D+ V +++F  +HID  IK  D   WRFTG+YG+P++ R+T TW L+R LS    +PWL+GGDLNEVL  HEK GG  +    I+ F+
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD-KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFK

Query:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        E L EC L D+G       W+                                     +SDH PIL E ++ Q + +R + F+FE  W+ E++C+ II
Subjt:  EALDECNLVDMG-------WF-------------------------------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

TrEMBL top hitse value%identityAlignment
A0A5B6VH75 Reverse transcriptase2.0e-3140.12Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIK--WKDKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        G++GGL L WKED+DVT+++F   H+D  IK     + WRFTG+YG P    + + W LL+RLS++ + PWL+ GD NE+++  EK GG P+++K ++IF
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIK--WKDKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMG----WFY---SDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTSES
        ++ L++CNL+D+G    W Y   SDH P+L+  +   ++  R + F FE +W  E   +E +    +W S S
Subjt:  KEALDECNLVDMG----WFY---SDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTSES

A0A5B6VIP7 Reverse transcriptase2.5e-2940Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        GS+GGL L WKED+ VT+++F   HID  IK  D    WRFTG+YG P    K + W LL+RL+++   PWL+ GD NE+L+  EK GG  ++ K ++ F
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMG----WF-------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTS
        +E LD+C L+D+G    WF             YSDH P+LL           +R F FE +W  E   +E++  +  W S
Subjt:  KEALDECNLVDMG----WF-------------YSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTS

A0A5B6W0J0 Endonuclease/exonuclease/phosphatase4.2e-2941.61Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        GS+GGL L WK+++ V +++F   HID+ IK  +  + WR+TG+YG P    K   W LLRRL ++ + PWL+ GD NE+LF  EK GG  +++K ++ F
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMG------WFYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        +E L++C LVD+G      +  SDH PILL+ N         R F FE  W  E+  +E+I
Subjt:  KEALDECNLVDMG------WFYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

A0A5B6WIA2 Reverse transcriptase8.0e-2836.68Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWK--DKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        GSKGGL L WK+D+DV +K+F   HID  IK +  ++ WRFTG YG P S  K   W+LL+RL+++ D PWL+ GD NE+L+  EK GG P++SK +++F
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWK--DKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMG--------------------------------------------WFYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        +E L +C L D+G                                            +  SDH P+LLE N  +      R F FE +W  EK  + +I
Subjt:  KEALDECNLVDMG--------------------------------------------WFYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

A0A7J9N8H0 CCHC-type domain-containing protein1.9e-2940.76Show/hide
Query:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF
        GSKGGL L WK D+ +++++F  +HID  +K +   + WRFTG YG P    K  +W+LL  L  D   PWL+ GD NE+L+ +EK GG P++   +Q F
Subjt:  GSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKD--KSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIF

Query:  KEALDECNLVDMGW--FYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII
        +EAL+ C L D+G+    SDH P+L+       ++   + FKFE +W+ E+ C++ I
Subjt:  KEALDECNLVDMGW--FYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTAAAGGGGGTCTTTATCTGTTTTGGAAGGAGGATGTTGATGTCACTATCAAGACTTTCTATTCTCACCATATAGACTCATTTATTAAGTGGAAGGATAAAAG
TTGGCGCTTCACTGGAATGTATGGCTACCCTGAATCAAATAGGAAGACGCTAACTTGGGAGTTACTGAGGAGACTTTCAAAGGATCACGATTTGCCATGGCTGATTGGAG
GAGATCTCAACGAAGTTCTTTTTGATCATGAGAAGTTAGGCGGCCCCCCGAAGAATTCCAAATCTATACAAATCTTTAAAGAAGCTCTTGATGAGTGCAACTTAGTTGAC
ATGGGTTGGTTCTACTCGGATCACCGTCCAATATTACTGGAGCTGAATCATCTACAAAAAGTAAGGAAGAGAGAGCGTCATTTCAAGTTTGAAGAGTTCTGGATCCAAGA
GAAAGAATGTAAGGAGATTATCTCTAATGCCGGCCTTTGGACCTCAGAGTCGGAAATCAGATGCTGGATTGAAGACATTTGCTTCCTTTCAAAGAACTTTGAGTCTTGTG
ATTTCTTTTACATTCCGCGTTCTTGTAATATGCTTGCTGATTATGTAGCAAGGTTGGCTAAATCTAATTCTAATTGTTACACTTGGGTGGAGGAAATCCCCAGTGTTTTT
CAGCGTTTAGCATTTCGCGATTGTTTTTTGTTTCTGCCCGTTGAGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTAAAGGGGGTCTTTATCTGTTTTGGAAGGAGGATGTTGATGTCACTATCAAGACTTTCTATTCTCACCATATAGACTCATTTATTAAGTGGAAGGATAAAAG
TTGGCGCTTCACTGGAATGTATGGCTACCCTGAATCAAATAGGAAGACGCTAACTTGGGAGTTACTGAGGAGACTTTCAAAGGATCACGATTTGCCATGGCTGATTGGAG
GAGATCTCAACGAAGTTCTTTTTGATCATGAGAAGTTAGGCGGCCCCCCGAAGAATTCCAAATCTATACAAATCTTTAAAGAAGCTCTTGATGAGTGCAACTTAGTTGAC
ATGGGTTGGTTCTACTCGGATCACCGTCCAATATTACTGGAGCTGAATCATCTACAAAAAGTAAGGAAGAGAGAGCGTCATTTCAAGTTTGAAGAGTTCTGGATCCAAGA
GAAAGAATGTAAGGAGATTATCTCTAATGCCGGCCTTTGGACCTCAGAGTCGGAAATCAGATGCTGGATTGAAGACATTTGCTTCCTTTCAAAGAACTTTGAGTCTTGTG
ATTTCTTTTACATTCCGCGTTCTTGTAATATGCTTGCTGATTATGTAGCAAGGTTGGCTAAATCTAATTCTAATTGTTACACTTGGGTGGAGGAAATCCCCAGTGTTTTT
CAGCGTTTAGCATTTCGCGATTGTTTTTTGTTTCTGCCCGTTGAGGCTTAA
Protein sequenceShow/hide protein sequence
MGSKGGLYLFWKEDVDVTIKTFYSHHIDSFIKWKDKSWRFTGMYGYPESNRKTLTWELLRRLSKDHDLPWLIGGDLNEVLFDHEKLGGPPKNSKSIQIFKEALDECNLVD
MGWFYSDHRPILLELNHLQKVRKRERHFKFEEFWIQEKECKEIISNAGLWTSESEIRCWIEDICFLSKNFESCDFFYIPRSCNMLADYVARLAKSNSNCYTWVEEIPSVF
QRLAFRDCFLFLPVEA