; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G09160 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G09160
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr11:11472008..11478328
RNA-Seq ExpressionClc11G09160
SyntenyClc11G09160
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU42103.1 hypothetical protein TSUD_134870 [Trifolium subterraneum]3.2e-1041.03Show/hide
Query:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMK
        R+ +K DW FDSG SR+M G+  +L D+KS S       T   GA      V  L I KS P++            L  GLTANLIS++QLCDQGH++++
Subjt:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMK

Query:  TIQKTVAKDGISGLPLL
        +++K ++++ I GLP L
Subjt:  TIQKTVAKDGISGLPLL

KAA0035514.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.0e-0632.17Show/hide
Query:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLY-SGLTANLISVNQLCDQ---------
        W FDSG SR+M G +S+ ++L+        +CTS  G VT  D      I K N     +PC      L+   Y  GL ANLIS++QLCDQ         
Subjt:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLY-SGLTANLISVNQLCDQ---------

Query:  ----------------GHVSMKTIQKTVAKDGISGLPLLPAKG
                        GH+S++++ K +    I G+P L   G
Subjt:  ----------------GHVSMKTIQKTVAKDGISGLPLLPAKG

KAA0056418.1 gag-pol polyprotein [Cucumis melo var. makuwa]8.0e-0625.64Show/hide
Query:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------
        W FDSGSSR+M G +S+ ++L+        +C S  G VT  D      I K N     +PC +    +      GL ANLISV+QLCDQ          
Subjt:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------

Query:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR
                                                       GH+S++++ K +  + + G+P L   G+  C DCQ    T+     L+
Subjt:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR

MCH89489.1 gag-pol polyprotein [Trifolium medium]7.3e-0727.33Show/hide
Query:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ------
        R+ ++ DW FDSG SR+M G + +L D+KS S             VT  D       IK   ++            L  GLTANLIS++QLCDQ      
Subjt:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ------

Query:  ----------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELRNM
                              GH++++ +++ + ++   GLP L  +   +C +CQ    TR+   +L+++
Subjt:  ----------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELRNM

PNX99239.1 gag-pol polyprotein, partial [Trifolium pratense]4.3e-0746.32Show/hide
Query:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG
        R  +K DW FDSG S++MIGEK+YL ++KS S   N   T   GA      +  L    S P +P  S  + L   L  GLTANLIS++QLCDQG
Subjt:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG

TrEMBL top hitse value%identityAlignment
A0A2K3N8B2 Gag-pol polyprotein (Fragment)2.1e-0746.32Show/hide
Query:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG
        R  +K DW FDSG S++MIGEK+YL ++KS S   N   T   GA      +  L    S P +P  S  + L   L  GLTANLIS++QLCDQG
Subjt:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQG

A0A2Z6NJJ4 Reverse transcriptase Ty1/copia-type domain-containing protein1.5e-1041.03Show/hide
Query:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMK
        R+ +K DW FDSG SR+M G+  +L D+KS S       T   GA      V  L I KS P++            L  GLTANLIS++QLCDQGH++++
Subjt:  RSLAKGDWVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMK

Query:  TIQKTVAKDGISGLPLL
        +++K ++++ I GLP L
Subjt:  TIQKTVAKDGISGLPLL

A0A5A7SMR2 Gag-pol polyprotein1.9e-0524.1Show/hide
Query:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------
        W FDSG SR+M G +S+ ++L+        +CTS    VT +D      I K N     +PC +    +      GL ANLIS++Q+CDQ          
Subjt:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------

Query:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR
                                                       GH+SM+++ K +  + +  +P L   G+  C DCQ    T++    L+
Subjt:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR

A0A5D3DZQ7 Gag-pol polyprotein3.9e-0625.64Show/hide
Query:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------
        W FDSGSSR+M G +S+ ++L+        +C S  G VT  D      I K N     +PC +    +      GL ANLISV+QLCDQ          
Subjt:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLYSGLTANLISVNQLCDQ----------

Query:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR
                                                       GH+S++++ K +  + + G+P L   G+  C DCQ    T+     L+
Subjt:  -----------------------------------------------GHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCSATRLLQHELR

A0A5D3E4L4 Gag-pol polyprotein3.9e-0632.17Show/hide
Query:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLY-SGLTANLISVNQLCDQ---------
        W FDSG SR+M G +S+ ++L+        +CTS  G VT  D      I K N     +PC      L+   Y  GL ANLIS++QLCDQ         
Subjt:  WVFDSGSSRYMIGEKSYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSN---PQMPCCSASSTLWNFLY-SGLTANLISVNQLCDQ---------

Query:  ----------------GHVSMKTIQKTVAKDGISGLPLLPAKG
                        GH+S++++ K +    I G+P L   G
Subjt:  ----------------GHVSMKTIQKTVAKDGISGLPLLPAKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACACCAGAGTTTGGGTACTCCAGTCTTCTCATATTCATATGTTTCTACCATGTCCAATACCGGGGAGAACACTAGATTTACCTTTGGAACCTGGAAAGAAGGTTGG
CGTGTACATAGAGATCAAAGTTCCATTAGCTTCCAGAGGTCGACCTTGCCCCGTGCTACGATATTCAACTCGAAAAGTGTTACTGCCGCCGATCACCAATTTTCTTCATC
TCCTTCTCGATAAGTACTTTCTCCTGTGTCCAGATATTCCTACTAGATCCTTAGCCAAAGGAGATTGGGTCTTTGATAGTGGAAGCTCAAGGTATATGATAGGAGAAAAG
AGTTACTTGTCAGATTTGAAATCTATTAGTGATAGAGACAACCGCAAGTGCACGAGTCAAGTTGGTGCCGTGACACTTCAGGACAACGTTGCGATGCTGACCATCATAAA
AAGTAACCCCCAAATGCCGTGTTGTAGCGCCTCCTCTACGCTGTGGAACTTCCTATATAGCGGTCTCACTGCTAACCTTATTAGCGTCAATCAGTTGTGCGATCAAGGAC
ATGTGAGCATGAAAACTATTCAAAAAACAGTGGCCAAGGATGGCATATCAGGGCTCCCTCTGTTGCCTGCTAAAGGTAGAATTGTGTGTAGTGATTGTCAGAGTTGTTCT
GCGACTAGGCTCCTCCAGCACGAATTACGAAATATGGAGGCGGAGGAAACCAAATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACACCAGAGTTTGGGTACTCCAGTCTTCTCATATTCATATGTTTCTACCATGTCCAATACCGGGGAGAACACTAGATTTACCTTTGGAACCTGGAAAGAAGGTTGG
CGTGTACATAGAGATCAAAGTTCCATTAGCTTCCAGAGGTCGACCTTGCCCCGTGCTACGATATTCAACTCGAAAAGTGTTACTGCCGCCGATCACCAATTTTCTTCATC
TCCTTCTCGATAAGTACTTTCTCCTGTGTCCAGATATTCCTACTAGATCCTTAGCCAAAGGAGATTGGGTCTTTGATAGTGGAAGCTCAAGGTATATGATAGGAGAAAAG
AGTTACTTGTCAGATTTGAAATCTATTAGTGATAGAGACAACCGCAAGTGCACGAGTCAAGTTGGTGCCGTGACACTTCAGGACAACGTTGCGATGCTGACCATCATAAA
AAGTAACCCCCAAATGCCGTGTTGTAGCGCCTCCTCTACGCTGTGGAACTTCCTATATAGCGGTCTCACTGCTAACCTTATTAGCGTCAATCAGTTGTGCGATCAAGGAC
ATGTGAGCATGAAAACTATTCAAAAAACAGTGGCCAAGGATGGCATATCAGGGCTCCCTCTGTTGCCTGCTAAAGGTAGAATTGTGTGTAGTGATTGTCAGAGTTGTTCT
GCGACTAGGCTCCTCCAGCACGAATTACGAAATATGGAGGCGGAGGAAACCAAATGTTAA
Protein sequenceShow/hide protein sequence
MHTRVWVLQSSHIHMFLPCPIPGRTLDLPLEPGKKVGVYIEIKVPLASRGRPCPVLRYSTRKVLLPPITNFLHLLLDKYFLLCPDIPTRSLAKGDWVFDSGSSRYMIGEK
SYLSDLKSISDRDNRKCTSQVGAVTLQDNVAMLTIIKSNPQMPCCSASSTLWNFLYSGLTANLISVNQLCDQGHVSMKTIQKTVAKDGISGLPLLPAKGRIVCSDCQSCS
ATRLLQHELRNMEAEETKC