; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014501 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014501
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationChr02:12853109..12853666
RNA-Seq ExpressionHG10014501
SyntenyHG10014501
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]4.0e-3750Show/hide
Query:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE
        N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF  +P EDPH HL  F+ +    K+ GVSEE LR KLFP+SL+  A 
Subjt:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE

Query:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
        +WL++L P+S+T W++L +KF+ KYF   +N K+R +I++F+Q   ES   A
Subjt:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]6.8e-3752.74Show/hide
Query:  FIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWL
        FI DD+DRAIR Y AP F+ L++GI+   I A  FE+KP+MFQML + GQF  +P EDPH HL LFM +    K  GV E+ALR KLFPYS++  A  WL
Subjt:  FIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWL

Query:  DSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESV
        +SL   S+TTW++L +KF+ KYF  N N K R +I +F+Q   ES+
Subjt:  DSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESV

XP_024027611.1 uncharacterized protein LOC112093437 [Morus notabilis]1.1e-3752.38Show/hide
Query:  IFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAW
        + IA+DRDRAIRDY  P    L  GI+   I A  FE+KP+MFQML + GQF ++  +DPH HL LF+ +C   K  GV+EEALR KLFPYSL+  A AW
Subjt:  IFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAW

Query:  LDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESV
        L+SL P+S+  W++L +KF+ KYF  NKN K R DI +F+Q   E++
Subjt:  LDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESV

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]8.9e-3749.04Show/hide
Query:  AIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSL
        A ++ N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF   P EDPH H+  F+ +    K+ GVSEEALR KLFP+SL
Subjt:  AIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSL

Query:  QGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
        +  A AWL++L P+S+T W++L +KF+ KYF   +N K+R +I++F+Q+  E+   A
Subjt:  QGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]3.4e-3649.03Show/hide
Query:  DQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQG
        ++ N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF   P EDPH H+  F+ +    K+ GVSEEALR KLFP+SL+ 
Subjt:  DQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQG

Query:  DAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
         A AWL++L P+S+T W++L +KF+ KYF   +N K+R +I++F+Q   E+   A
Subjt:  DAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

TrEMBL top hitse value%identityAlignment
A0A6J1DSZ5 uncharacterized protein LOC1110241071.6e-3145.75Show/hide
Query:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDA
        N I +AD +DRA+RDY A   + L++ +++  P DA  FE KP+M QMLN   QF  L +EDP  HL  F+++   C++ G+S++ALR  LFP+SL G A
Subjt:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDA

Query:  EAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
         AWL++    +ITTW ++VDKF+ KYF   +N   R +II+FRQ  +E+V+ A
Subjt:  EAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

A0A6J1DY39 uncharacterized protein LOC1110256537.1e-3245.75Show/hide
Query:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDA
        N I +AD RDRA+RDY A   + L++ +++  P DA  FE KP+M QMLN+ GQF  L +EDP  HL  F+++    ++ G+S++ALR  LFP+S+ G A
Subjt:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQ-PIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDA

Query:  EAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
         AWL++   ++ITTW ++VDKF+ KYF   +N   R +II+FRQ  +E+V+ A
Subjt:  EAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

A0A6J1E251 uncharacterized protein LOC1110253021.7e-3343.85Show/hide
Query:  MDPLGDDPLVPPRNNVQQNGDQQPQQ--QPAIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKH
        M+    DP  PP  N   NGD   ++      +  N I +AD+RD A+R+YV  AF  L++GI +    A  FE+KP+MFQ+L + GQF  L NEDP+ H
Subjt:  MDPLGDDPLVPPRNNVQQNGDQQPQQ--QPAIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKH

Query:  LNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
        L  F+ +    ++ G SE+ALR K+FP+SL+  A  W+++L+PNSI TW  L DKF+ KY +L KN   R DI++FRQ  +E+V  A
Subjt:  LNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

A0A6J1H7E4 uncharacterized protein LOC1114611681.9e-3244.74Show/hide
Query:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE
        N+I +ADDR+RAIR Y  PA   L+  I+   + A TFE+KP+MFQML + GQF  LP+EDPH HL  F+ +    +  GV ++ +R  LFPYSL+  A+
Subjt:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE

Query:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
        +WL++L P +I +W++L +KF+ KYF   +N ++R +I+AF+Q   E++  A
Subjt:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

U5CUI2 Retrotrans_gag domain-containing protein1.9e-3750Show/hide
Query:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE
        N I +ADDR RAIR+Y AP F  L+ GI+   I A  FE+KP+MFQML + GQF  +P EDPH HL  F+ +    K+ GVSEE LR KLFP+SL+  A 
Subjt:  NSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCKVNGVSEEALRFKLFPYSLQGDAE

Query:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA
        +WL++L P+S+T W++L +KF+ KYF   +N K+R +I++F+Q   ES   A
Subjt:  AWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCGCTTGGAGACGATCCTCTAGTACCTCCTCGGAACAACGTTCAGCAAAATGGAGATCAGCAACCACAGCAACAACCTGCTATAGACCAACGAAACTCCATCTT
CATAGCAGATGATAGAGATAGAGCAATCAGAGACTATGTTGCGCCTGCATTTCAAACTTTAGACACTGGCATCCTCGACCAACCAATAGATGCGCTCACATTTGAAATGA
AACCGTTGATGTTTCAAATGTTGAACTCATTTGGTCAATTTCCCATACTACCCAATGAAGACCCCCACAAACATCTCAATCTTTTTATGAGAATGTGTCGTTATTGTAAA
GTTAATGGTGTTTCTGAAGAGGCATTAAGATTTAAGTTGTTTCCTTACTCTTTGCAGGGAGACGCAGAAGCATGGCTTGATTCATTGCAGCCTAATTCCATCACCACATG
GGATAACCTCGTGGACAAATTTATAGAAAAATACTTCTCACTAAACAAGAATACTAAGTACAGAGGTGATATCATCGCTTTCAGGCAAGCACCATCAGAATCTGTAGATG
GAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCCGCTTGGAGACGATCCTCTAGTACCTCCTCGGAACAACGTTCAGCAAAATGGAGATCAGCAACCACAGCAACAACCTGCTATAGACCAACGAAACTCCATCTT
CATAGCAGATGATAGAGATAGAGCAATCAGAGACTATGTTGCGCCTGCATTTCAAACTTTAGACACTGGCATCCTCGACCAACCAATAGATGCGCTCACATTTGAAATGA
AACCGTTGATGTTTCAAATGTTGAACTCATTTGGTCAATTTCCCATACTACCCAATGAAGACCCCCACAAACATCTCAATCTTTTTATGAGAATGTGTCGTTATTGTAAA
GTTAATGGTGTTTCTGAAGAGGCATTAAGATTTAAGTTGTTTCCTTACTCTTTGCAGGGAGACGCAGAAGCATGGCTTGATTCATTGCAGCCTAATTCCATCACCACATG
GGATAACCTCGTGGACAAATTTATAGAAAAATACTTCTCACTAAACAAGAATACTAAGTACAGAGGTGATATCATCGCTTTCAGGCAAGCACCATCAGAATCTGTAGATG
GAGCTTAG
Protein sequenceShow/hide protein sequence
MDPLGDDPLVPPRNNVQQNGDQQPQQQPAIDQRNSIFIADDRDRAIRDYVAPAFQTLDTGILDQPIDALTFEMKPLMFQMLNSFGQFPILPNEDPHKHLNLFMRMCRYCK
VNGVSEEALRFKLFPYSLQGDAEAWLDSLQPNSITTWDNLVDKFIEKYFSLNKNTKYRGDIIAFRQAPSESVDGA