; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009891 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009891
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationChr06:13067274..13071833
RNA-Seq ExpressionHG10009891
SyntenyHG10009891
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OIW13724.1 hypothetical protein TanjilG_08066, partial [Lupinus angustifolius]3.3e-3037.36Show/hide
Query:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR---------------------GLVPGPP--PT--------------TL----FQTTIRTSRT
        SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR                     G +P  P  PT              TL    FQ T   S  
Subjt:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR---------------------GLVPGPP--PT--------------TL----FQTTIRTSRT

Query:  PDSKAGFFSRCVTPRQTCPL--PEGSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGV
         D+   + S     R +        SG  CVQRL GSRD AI+TKYRISLRSSSM+EPRYPL RV   + ++     H     GG    F FLGA  AGV
Subjt:  PDSKAGFFSRCVTPRQTCPL--PEGSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGV

Query:  -----------------------W-FRRVEGE------------GSKQRACFPP------PPARTIESRKSSQSVNPYYVWTCPHAVTSWRLLGQQRAPD
                               W    V G             G   R   PP      PP   IESRKSSQSVNPYYVWTC   V          A  
Subjt:  -----------------------W-FRRVEGE------------GSKQRACFPP------PPARTIESRKSSQSVNPYYVWTCPHAVTSWRLLGQQRAPD

Query:  QREERASFVKTICKGNTFERLLK--------HIVPPNAPHPRER----APTTRPATSLGKAEAE
          E R  FV    K  T +  ++        H   P   H R R      TTRP  +   + AE
Subjt:  QREERASFVKTICKGNTFERLLK--------HIVPPNAPHPRER----APTTRPATSLGKAEAE

XP_022933354.1 uncharacterized protein LOC111440692 [Cucurbita moschata]3.3e-3076.34Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS PEGA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT   PFHIR GHIAGPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

XP_022975713.1 uncharacterized protein LOC111475769 [Cucurbita maxima]9.6e-3075.27Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS PEGA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT   PFHIR GHI GPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

XP_023520622.1 uncharacterized protein LOC111784033 [Cucurbita pepo subsp. pepo]2.1e-2975.27Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS P+GA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT  SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

XP_023552350.1 uncharacterized protein LOC111810043 [Cucurbita pepo subsp. pepo]2.1e-2975.27Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS P+GA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT  SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

TrEMBL top hitse value%identityAlignment
A0A4P1RM86 Uncharacterized protein (Fragment)1.6e-3037.36Show/hide
Query:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR---------------------GLVPGPP--PT--------------TL----FQTTIRTSRT
        SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR                     G +P  P  PT              TL    FQ T   S  
Subjt:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR---------------------GLVPGPP--PT--------------TL----FQTTIRTSRT

Query:  PDSKAGFFSRCVTPRQTCPL--PEGSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGV
         D+   + S     R +        SG  CVQRL GSRD AI+TKYRISLRSSSM+EPRYPL RV   + ++     H     GG    F FLGA  AGV
Subjt:  PDSKAGFFSRCVTPRQTCPL--PEGSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGV

Query:  -----------------------W-FRRVEGE------------GSKQRACFPP------PPARTIESRKSSQSVNPYYVWTCPHAVTSWRLLGQQRAPD
                               W    V G             G   R   PP      PP   IESRKSSQSVNPYYVWTC   V          A  
Subjt:  -----------------------W-FRRVEGE------------GSKQRACFPP------PPARTIESRKSSQSVNPYYVWTCPHAVTSWRLLGQQRAPD

Query:  QREERASFVKTICKGNTFERLLK--------HIVPPNAPHPRER----APTTRPATSLGKAEAE
          E R  FV    K  T +  ++        H   P   H R R      TTRP  +   + AE
Subjt:  QREERASFVKTICKGNTFERLLK--------HIVPPNAPHPRER----APTTRPATSLGKAEAE

A0A6A5L505 Uncharacterized protein1.8e-2937.25Show/hide
Query:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR------------------------------------------------------GLVPGPPP
        SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR                                                      GL PGPP 
Subjt:  SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR------------------------------------------------------GLVPGPPP

Query:  TTLFQTTIRTSRTPDSKAGFF----------------------SRCVTPRQTCPLPE---------------GSGRNCVQRLGGSRDPAIYTKYRISLR-
         TL QTTIRT R  DS  G F                      SR +T   +   P                 SG  CVQRL GSRD AI+TKYRISLR 
Subjt:  TTLFQTTIRTSRTPDSKAGFF----------------------SRCVTPRQTCPLPE---------------GSGRNCVQRLGGSRDPAIYTKYRISLR-

Query:  ----SSSMREPRYPLLRVVVSNTTRMFCPPHAEAGA----GGRRIHFKFLGATCAGVWFRRVEGEGSKQRACFPPPPARTIESRKSSQSVNPYYVWTC
            S  +R P   LLR+++    ++    H  AG+      +  HF        G + R++        A   PPP   IESRKSSQSVNPYYVWTC
Subjt:  ----SSSMREPRYPLLRVVVSNTTRMFCPPHAEAGA----GGRRIHFKFLGATCAGVWFRRVEGEGSKQRACFPPPPARTIESRKSSQSVNPYYVWTC

A0A6A5L5R0 Uncharacterized protein1.4e-2936.04Show/hide
Query:  GARRCRSPPEGAFCQPRLGRRCLHR--RNKGLALARRLNLHQSMPQVNRRTDS-SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR--------
        GARRCRS P        +      R  +  GL    +     S+   +R  D  SPFHIR  HIAGPH LPS+QFQALFDSLF V+FIFPSR        
Subjt:  GARRCRSPPEGAFCQPRLGRRCLHR--RNKGLALARRLNLHQSMPQVNRRTDS-SPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSR--------

Query:  -------------GLVPGPP--PT--------------TL----FQTTIRTSRTPDS--------------------------KAG-----FFSRCVTPR
                     G +P  P  PT              TL    FQ T   S T D+                          KAG       SR +T  
Subjt:  -------------GLVPGPP--PT--------------TL----FQTTIRTSRTPDS--------------------------KAG-----FFSRCVTPR

Query:  QTCPLPE---------------GSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGVWF
         +   P                 SG  CVQRL GSRD AI+TKYRISLRSSSM+EPRYPL RV   + ++     H     GG    F FLGA  AGV  
Subjt:  QTCPLPE---------------GSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGVWF

Query:  RRVEGEGSKQRACFPP------------------------PPART------IESRKSSQSVNPYYVWTC
          + G+    +   PP                        PP +       IESRKSSQSVNPYYVWTC
Subjt:  RRVEGEGSKQRACFPP------------------------PPART------IESRKSSQSVNPYYVWTC

A0A6J1EZI8 uncharacterized protein LOC1114406921.6e-3076.34Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS PEGA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT   PFHIR GHIAGPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

A0A6J1IK28 uncharacterized protein LOC1114757694.7e-3075.27Show/hide
Query:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL
        M GARRCRS PEGA CQPR GRR LH+RNKGL   RRLN H+SMPQV+ RT   PFHIR GHI GPH LPS+QFQALFDSLF V+FIFPSR L
Subjt:  MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGL

SwissProt top hitse value%identityAlignment
Q8TGM5 Putative uncharacterized protein ART35.9e-0669.23Show/hide
Query:  GRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRV
        G  CVQR   SR+ AI+  YRISLRSSSMREPR PLL+V
Subjt:  GRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGAGCACGCAGATGTCGAAGCCCGCCCGAAGGCGCATTCTGCCAGCCACGATTGGGACGACGATGTCTCCACAGGCGTAACAAAGGCCTTGCCTTAGCCCGCCG
CCTCAATCTGCATCAGTCCATGCCCCAAGTCAATCGGCGGACCGACTCATCACCGTTCCACATCCGACAGGGGCACATCGCCGGCCCTCATCTGCTTCCCTCCCAACAAT
TTCAAGCACTATTTGACTCTCTTTTCAACGTCATTTTCATCTTTCCCTCGCGGGGACTTGTGCCCGGTCCACCGCCGACGACGCTTTTCCAGACTACAATTCGGACGTCT
AGGACGCCTGATTCCAAAGCTGGGTTCTTCTCGCGATGCGTGACGCCCAGGCAGACGTGCCCTCTGCCAGAAGGTTCCGGGCGCAACTGCGTTCAAAGACTCGGTGGTTC
GCGGGATCCTGCAATTTACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGTTGAGAGTCGTTGTGAGTAATACGACAAGAATGT
TCTGCCCCCCGCACGCCGAGGCCGGGGCAGGGGGCAGGCGAATTCATTTCAAGTTCCTTGGCGCGACCTGCGCCGGGGTTTGGTTTAGGCGCGTCGAGGGGGAGGGGAGC
AAGCAAAGAGCATGCTTCCCCCCGCCCCCGGCGCGAACAATTGAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCCCGCATGCCGTCACCTC
CTGGCGTCTGTTAGGACAACAAAGGGCGCCGGATCAACGCGAGGAGCGAGCGTCATTCGTCAAAACAATCTGTAAAGGCAACACGTTCGAGAGACTTCTCAAGCATATCG
TGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACGACGCGGCCTGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGATGCAGGCTTTGGGGTTCGTCGTGCA
CCCAACGAGGGGTGTCGACAGAAAAGGACTAACGAGACACGACACTTCCATCACATGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTAGAAATACTTTAGGGCA
CGTCAATGAGGAGGAGCTAACGCCGACAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGGAGCACGCAGATGTCGAAGCCCGCCCGAAGGCGCATTCTGCCAGCCACGATTGGGACGACGATGTCTCCACAGGCGTAACAAAGGCCTTGCCTTAGCCCGCCG
CCTCAATCTGCATCAGTCCATGCCCCAAGTCAATCGGCGGACCGACTCATCACCGTTCCACATCCGACAGGGGCACATCGCCGGCCCTCATCTGCTTCCCTCCCAACAAT
TTCAAGCACTATTTGACTCTCTTTTCAACGTCATTTTCATCTTTCCCTCGCGGGGACTTGTGCCCGGTCCACCGCCGACGACGCTTTTCCAGACTACAATTCGGACGTCT
AGGACGCCTGATTCCAAAGCTGGGTTCTTCTCGCGATGCGTGACGCCCAGGCAGACGTGCCCTCTGCCAGAAGGTTCCGGGCGCAACTGCGTTCAAAGACTCGGTGGTTC
GCGGGATCCTGCAATTTACACCAAGTATCGCATTTCGCTACGTTCTTCATCGATGCGAGAGCCGAGATATCCGTTGTTGAGAGTCGTTGTGAGTAATACGACAAGAATGT
TCTGCCCCCCGCACGCCGAGGCCGGGGCAGGGGGCAGGCGAATTCATTTCAAGTTCCTTGGCGCGACCTGCGCCGGGGTTTGGTTTAGGCGCGTCGAGGGGGAGGGGAGC
AAGCAAAGAGCATGCTTCCCCCCGCCCCCGGCGCGAACAATTGAATCAAGAAAGAGCTCTCAGTCTGTCAATCCTTACTATGTCTGGACCTGCCCGCATGCCGTCACCTC
CTGGCGTCTGTTAGGACAACAAAGGGCGCCGGATCAACGCGAGGAGCGAGCGTCATTCGTCAAAACAATCTGTAAAGGCAACACGTTCGAGAGACTTCTCAAGCATATCG
TGCCGCCGAATGCACCGCATCCGAGAGAGCGAGCACCGACGACGCGGCCTGCAACGAGCCTTGGAAAGGCCGAAGCGGAACGCGATGCAGGCTTTGGGGTTCGTCGTGCA
CCCAACGAGGGGTGTCGACAGAAAAGGACTAACGAGACACGACACTTCCATCACATGAGGTACCGATGCAAGAACCGATCGATCGCGGCCGCTAGAAATACTTTAGGGCA
CGTCAATGAGGAGGAGCTAACGCCGACAGTTTGA
Protein sequenceShow/hide protein sequence
MHGARRCRSPPEGAFCQPRLGRRCLHRRNKGLALARRLNLHQSMPQVNRRTDSSPFHIRQGHIAGPHLLPSQQFQALFDSLFNVIFIFPSRGLVPGPPPTTLFQTTIRTS
RTPDSKAGFFSRCVTPRQTCPLPEGSGRNCVQRLGGSRDPAIYTKYRISLRSSSMREPRYPLLRVVVSNTTRMFCPPHAEAGAGGRRIHFKFLGATCAGVWFRRVEGEGS
KQRACFPPPPARTIESRKSSQSVNPYYVWTCPHAVTSWRLLGQQRAPDQREERASFVKTICKGNTFERLLKHIVPPNAPHPRERAPTTRPATSLGKAEAERDAGFGVRRA
PNEGCRQKRTNETRHFHHMRYRCKNRSIAAARNTLGHVNEEELTPTV