; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10019010 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10019010
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGag-proteinase polyprotein
Genome locationChr04:13969897..13970806
RNA-Seq ExpressionHG10019010
SyntenyHG10019010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035157.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.0e-2032.43Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ
        MT  D+  KK KGIA +S+   + V    D   N+ ESI+LL +QF  AL          RN  +P+V  ++   SNQ  +       + +N+ +N    
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ

Query:  AIRERS------FRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYE
           ++       FR  EC   G+YQ ECP F+++++K F   LLD+ES  S + D  + A    ++      D   C+   V++K  E + E   TL  E
Subjt:  AIRERS------FRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYE

Query:  EISL----KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVPA
        +       KER + L+E+N RLMS IS LK +LRE + ++D      ++ NSGT++L+ IL +G   S + G+ F  S      S+++   +FVPA
Subjt:  EISL----KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVPA

KAA0036592.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.5e-1929.79Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ
        M    + +KK KG+  +S   ++++    D   N+ +SI LL KQF   +K++K  + +  N A   +     D  N   +  + + ++  +      G+
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ

Query:  AIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL--
           ER FR  ECE  G+YQAECP FL+R++K F  TL D+++   +E D+ + A +            V+  D   +++  E+N ++   L++EE+ +  
Subjt:  AIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL--

Query:  ----------KERTEKLIEDNHRLMSTISELKKELREAKADHDR------LSNSGTDDLEKILSSGKKVSDKRGIRFENSRK
                  KER ++ +E+N RLMS IS LK +L+E + D+D+      + NSG ++L+  L+SGK  S K G+ F+ S++
Subjt:  ----------KERTEKLIEDNHRLMSTISELKKELREAKADHDR------LSNSGTDDLEKILSSGKKVSDKRGIRFENSRK

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.8e-2032.24Show/hide
Query:  MTFE----DKNDKKLKGIALQ-SSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKN
        +TFE    D+  KK KGIA + + +SE++V    D+  N+ ESI LL KQF   +K++K  + +  N A   +     D  N   ++ + + ++  +   
Subjt:  MTFE----DKNDKKLKGIALQ-SSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKN

Query:  NCGGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEE
           G+    R FR  ECE  G+YQAECP FL+R++K F ATL D ++    E D+ + A    +S E    D+  C+      ++C++N       ++EE
Subjt:  NCGGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEE

Query:  ISL------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNR
        + +            KER + L+E+N RLM  IS LK +L+E + ++D      ++ NSGT++L+ IL+SG+   ++ G+ F+ S RK+   +E +    
Subjt:  ISL------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNR

Query:  FVPA
        FVPA
Subjt:  FVPA

KAA0050329.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]7.7e-1930Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASP-HVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGG
        M   D+ +KK KG+A +S   E++     D+  N+ ESI+LL +QF K ++++K  + +  N  +P H +    D  N   +  + + K+ ++ +    G
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASP-HVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGG

Query:  QAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL-
        +    +SFR  EC   G+YQ EC  FL+R++K F        + +S+E  D+ K   G     ++   +V+  D   +++  E+N ++   L++EE+ + 
Subjt:  QAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL-

Query:  -----------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNRFVPA
                   KER + L+E+N RLMS IS LK +L+E + +HD      ++ NSG ++L+ IL+ G+  S K G+ F+ S R ++  +E+    +FVPA
Subjt:  -----------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNRFVPA

TYK14780.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]5.9e-1932.56Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHV---KDLSNDISNQNSKLPKQAGKKPDNDKNNC
        MT  ++ +KK K I  +S   E+++    D+  N+ ESI+LL KQF K ++++K  +  + N  + +    KD  N I   N         + DND    
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHV---KDLSNDISNQNSKLPKQAGKKPDNDKNNC

Query:  GGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEIS
          +    R FR  EC   G YQAECP FL+R++K F ATL D++ + +SE D+ +KA    + +E    D+  C          E+N +    L++EE+ 
Subjt:  GGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEIS

Query:  L------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVP
        +            KER + L+E+N +LMS IS LK + +E + D+D      ++ NSGT+ L+ IL+SG+  S K G+ F+ S K     + +   +FVP
Subjt:  L------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVP

Query:  A
        A
Subjt:  A

TrEMBL top hitse value%identityAlignment
A0A5A7T0Q0 Gag-pol polyprotein3.4e-2032.43Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ
        MT  D+  KK KGIA +S+   + V    D   N+ ESI+LL +QF  AL          RN  +P+V  ++   SNQ  +       + +N+ +N    
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ

Query:  AIRERS------FRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYE
           ++       FR  EC   G+YQ ECP F+++++K F   LLD+ES  S + D  + A    ++      D   C+   V++K  E + E   TL  E
Subjt:  AIRERS------FRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYE

Query:  EISL----KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVPA
        +       KER + L+E+N RLMS IS LK +LRE + ++D      ++ NSGT++L+ IL +G   S + G+ F  S      S+++   +FVPA
Subjt:  EISL----KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVPA

A0A5A7TPF7 Gag-proteinase polyprotein8.9e-2132.24Show/hide
Query:  MTFE----DKNDKKLKGIALQ-SSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKN
        +TFE    D+  KK KGIA + + +SE++V    D+  N+ ESI LL KQF   +K++K  + +  N A   +     D  N   ++ + + ++  +   
Subjt:  MTFE----DKNDKKLKGIALQ-SSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKN

Query:  NCGGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEE
           G+    R FR  ECE  G+YQAECP FL+R++K F ATL D ++    E D+ + A    +S E    D+  C+      ++C++N       ++EE
Subjt:  NCGGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEE

Query:  ISL------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNR
        + +            KER + L+E+N RLM  IS LK +L+E + ++D      ++ NSGT++L+ IL+SG+   ++ G+ F+ S RK+   +E +    
Subjt:  ISL------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNR

Query:  FVPA
        FVPA
Subjt:  FVPA

A0A5A7UA03 Gag-proteinase polyprotein3.8e-1930Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASP-HVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGG
        M   D+ +KK KG+A +S   E++     D+  N+ ESI+LL +QF K ++++K  + +  N  +P H +    D  N   +  + + K+ ++ +    G
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASP-HVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGG

Query:  QAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL-
        +    +SFR  EC   G+YQ EC  FL+R++K F        + +S+E  D+ K   G     ++   +V+  D   +++  E+N ++   L++EE+ + 
Subjt:  QAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL-

Query:  -----------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNRFVPA
                   KER + L+E+N RLMS IS LK +L+E + +HD      ++ NSG ++L+ IL+ G+  S K G+ F+ S R ++  +E+    +FVPA
Subjt:  -----------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENS-RKLRHISESSLSNRFVPA

A0A5D3CTL4 Gag-proteinase polyprotein2.9e-1932.56Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHV---KDLSNDISNQNSKLPKQAGKKPDNDKNNC
        MT  ++ +KK K I  +S   E+++    D+  N+ ESI+LL KQF K ++++K  +  + N  + +    KD  N I   N         + DND    
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHV---KDLSNDISNQNSKLPKQAGKKPDNDKNNC

Query:  GGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEIS
          +    R FR  EC   G YQAECP FL+R++K F ATL D++ + +SE D+ +KA    + +E    D+  C          E+N +    L++EE+ 
Subjt:  GGQAIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEIS

Query:  L------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVP
        +            KER + L+E+N +LMS IS LK + +E + D+D      ++ NSGT+ L+ IL+SG+  S K G+ F+ S K     + +   +FVP
Subjt:  L------------KERTEKLIEDNHRLMSTISELKKELREAKADHD------RLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVP

Query:  A
        A
Subjt:  A

A0A5D3DH72 Gag-proteinase polyprotein1.7e-1929.79Show/hide
Query:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ
        M    + +KK KG+  +S   ++++    D   N+ +SI LL KQF   +K++K  + +  N A   +     D  N   +  + + ++  +      G+
Subjt:  MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQ

Query:  AIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL--
           ER FR  ECE  G+YQAECP FL+R++K F  TL D+++   +E D+ + A +            V+  D   +++  E+N ++   L++EE+ +  
Subjt:  AIRERSFRFHECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISL--

Query:  ----------KERTEKLIEDNHRLMSTISELKKELREAKADHDR------LSNSGTDDLEKILSSGKKVSDKRGIRFENSRK
                  KER ++ +E+N RLMS IS LK +L+E + D+D+      + NSG ++L+  L+SGK  S K G+ F+ S++
Subjt:  ----------KERTEKLIEDNHRLMSTISELKKELREAKADHDR------LSNSGTDDLEKILSSGKKVSDKRGIRFENSRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTTGAGGACAAAAATGATAAGAAGCTAAAAGGCATCGCCTTGCAATCCTCTATTAGTGAGGACTCAGTTTACAAGATTAGAGACTCAAATGAAAATCTTGCTGA
ATCTATATCTCTTCTGGCCAAACAGTTTGGAAAAGCTCTCAAACGATGGAAAAAACGTTCCGGCTCCCAAAGGAATTATGCTTCACCACATGTTAAAGACTTAAGCAACG
ACATATCTAACCAAAATTCAAAACTTCCAAAACAGGCAGGCAAGAAACCCGACAATGATAAAAACAACTGTGGTGGTCAAGCAATTAGAGAAAGATCTTTTAGATTTCAT
GAGTGTGAGAGTTTTGGATACTATCAGGCTGAATGTCCTAACTTCCTAAAAAGAAAGGAGAAAGGATTTGCTGCTACCCTCTTAGATGATGAATCAGAAGTGAGCAGTGA
ATTAGATGATGAAGTGAAAGCCCTGATGGGGAGTATGTCCCTTGAGCATTCTGTTCAAGATAAAGTGTCCTGTAATGATGGTGTCGTCAAGACGAAAATGTGTGAGCAAA
ACTCAGAACATCATGGAACTTTGTCTTATGAGGAAATTAGCCTGAAAGAGAGAACTGAGAAACTTATTGAAGACAACCATCGGCTAATGAGTACCATATCAGAGTTGAAG
AAAGAATTGCGAGAGGCTAAAGCTGATCATGACAGACTATCTAACTCTGGAACAGATGATCTTGAAAAAATCTTGTCATCAGGAAAGAAAGTGTCTGACAAGAGAGGCAT
CAGATTTGAGAATTCGAGAAAACTAAGACACATTAGCGAATCTTCCTTGAGCAATAGATTTGTTCCAGCTCAAAATTCTCCTGTGATCTATTCAAAGTTAAAAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCTTTGAGGACAAAAATGATAAGAAGCTAAAAGGCATCGCCTTGCAATCCTCTATTAGTGAGGACTCAGTTTACAAGATTAGAGACTCAAATGAAAATCTTGCTGA
ATCTATATCTCTTCTGGCCAAACAGTTTGGAAAAGCTCTCAAACGATGGAAAAAACGTTCCGGCTCCCAAAGGAATTATGCTTCACCACATGTTAAAGACTTAAGCAACG
ACATATCTAACCAAAATTCAAAACTTCCAAAACAGGCAGGCAAGAAACCCGACAATGATAAAAACAACTGTGGTGGTCAAGCAATTAGAGAAAGATCTTTTAGATTTCAT
GAGTGTGAGAGTTTTGGATACTATCAGGCTGAATGTCCTAACTTCCTAAAAAGAAAGGAGAAAGGATTTGCTGCTACCCTCTTAGATGATGAATCAGAAGTGAGCAGTGA
ATTAGATGATGAAGTGAAAGCCCTGATGGGGAGTATGTCCCTTGAGCATTCTGTTCAAGATAAAGTGTCCTGTAATGATGGTGTCGTCAAGACGAAAATGTGTGAGCAAA
ACTCAGAACATCATGGAACTTTGTCTTATGAGGAAATTAGCCTGAAAGAGAGAACTGAGAAACTTATTGAAGACAACCATCGGCTAATGAGTACCATATCAGAGTTGAAG
AAAGAATTGCGAGAGGCTAAAGCTGATCATGACAGACTATCTAACTCTGGAACAGATGATCTTGAAAAAATCTTGTCATCAGGAAAGAAAGTGTCTGACAAGAGAGGCAT
CAGATTTGAGAATTCGAGAAAACTAAGACACATTAGCGAATCTTCCTTGAGCAATAGATTTGTTCCAGCTCAAAATTCTCCTGTGATCTATTCAAAGTTAAAAACATGA
Protein sequenceShow/hide protein sequence
MTFEDKNDKKLKGIALQSSISEDSVYKIRDSNENLAESISLLAKQFGKALKRWKKRSGSQRNYASPHVKDLSNDISNQNSKLPKQAGKKPDNDKNNCGGQAIRERSFRFH
ECESFGYYQAECPNFLKRKEKGFAATLLDDESEVSSELDDEVKALMGSMSLEHSVQDKVSCNDGVVKTKMCEQNSEHHGTLSYEEISLKERTEKLIEDNHRLMSTISELK
KELREAKADHDRLSNSGTDDLEKILSSGKKVSDKRGIRFENSRKLRHISESSLSNRFVPAQNSPVIYSKLKT