; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G14220 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G14220
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr05:13991206..13994743
RNA-Seq ExpressionClc05G14220
SyntenyClc05G14220
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833168.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]3.8e-7493.67Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDA MLE TFSTFHASNML+QQQY+EKGFKQYSELISCLLVAEQNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK
        NHESRPTGTTPFPEVNAVNFNNRGR GRGRGRG  RD GRGRNSYYFRG HSNHPN K
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK

XP_022144017.1 uncharacterized protein LOC111013806 [Momordica charantia]5.3e-6886.71Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKL LCGEKITD+DMLEKT+STFH SN+LLQQQY+EKGFK+YSELISCLLVA+QNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK
        NHESRPTG TPFPE NAVNFNNRGR GRGRGRG  RD GRGRN++ FRGG  N PN K
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK

XP_028767565.1 uncharacterized protein LOC114725248 [Prosopis alba]5.9e-5973.65Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARYEWMHLRLQDFKSV++YNSA+FKISS+L LCGEKITD D+LEKTFSTFHASN+LLQQQY+EKGFK+YSELISCLLVAEQNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGR---NSYYFRGGHSNH-PNSKEPPEM
        NHE RPTG+ PFPE N    NN    GRGRGRG  RD GRGR   N  ++ GG +N+ PN+ +  ++
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGR---NSYYFRGGHSNH-PNSKEPPEM

XP_028788524.1 uncharacterized protein LOC114744522 [Prosopis alba]2.2e-5873.05Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKA YEWMHLRLQDFKSV++YNSA+FKISS+L LCGEKITD D+LEKTFSTFHASN+LLQQQY+EKGFK+YSELISCLLVAEQNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGR---NSYYFRGGHSNH-PNSKEPPEM
        NHE RPTG+ PFPE N    NN    GRGRGRG  RD GRGR   N  ++ GG +N+ PN+ +  ++
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGR---NSYYFRGGHSNH-PNSKEPPEM

XP_028805625.1 uncharacterized protein LOC114760535 [Prosopis alba]1.6e-5973.05Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARYEWMHLRLQDFKSV++YNSA+FKISS+L LCGEKITD D+LEKTFSTFHASN+LLQQQY+EKGFK+YSELISCLLVAEQNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRN----SYYFRGGHSNHPNSKEPPEM
        NHE RPTG+ PFPE N    NN    GRGRGRG  RD GRGR      +Y RG ++ +PN+ +  ++
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRN----SYYFRGGHSNHPNSKEPPEM

TrEMBL top hitse value%identityAlignment
A0A2G9HA29 Uncharacterized protein5.0e-5674.51Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILP ARY+WMHLRLQDFKSVS YNSA+FKISS+L LCGE ITD D+LEK FSTFH SN+LLQQQY+EKGFK++SELISCLLVAEQNNELLM+
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSN
        NHE RPTG  PFP+VN + +NN  R G   GR H R  GRGRN+Y F+GGH++
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSN

A0A438ID00 Retrovirus-related Pol polyprotein from transposon TNT 1-948.6e-5676.62Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARY+WMHLRLQDFK+VS+YNSALFKISS+L LCGEKIT+ DMLEKTF+TFHASN+LLQQQY+E+ F +YSELISCLLVAEQNNELLM+
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNH
        NH+SRPTG+ PFPEVNA++   RGR GRGRGRG     GRGRN  Y  G +SN+
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNH

A0A6J1CSH6 uncharacterized protein LOC1110138062.6e-6886.71Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKL LCGEKITD+DMLEKT+STFH SN+LLQQQY+EKGFK+YSELISCLLVA+QNNELLMK
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK
        NHESRPTG TPFPE NAVNFNNRGR GRGRGRG  RD GRGRN++ FRGG  N PN K
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSK

A5AJ43 Uncharacterized protein8.6e-5676.62Show/hide
Query:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK
        RYDHQKTVILPKARY+WMHLRLQDFK+VS+YNSALFKISS+L LCGEKIT+ DMLEKTF+TFHASN+LLQQQY+E+ F +YSELISCLLVAEQNNELLM+
Subjt:  RYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMK

Query:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNH
        NH+SRPTG+ PFPEVNA++   RGR GRGRGRG     GRGRN  Y  G +SN+
Subjt:  NHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNH

A5AXH9 Uncharacterized protein1.1e-5564.36Show/hide
Query:  RRRSTTTIDTPRRRMAICDGGGTKHEEWLAVEKQSGKKRKNEGSGGVHRYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDA
        R  S+      R  +  C G G K+ E+L V     K      S    RYDHQKTVILPKARY+WMHLRLQDFK+VS+YNSALFKISS+L LCGEKIT+ 
Subjt:  RRRSTTTIDTPRRRMAICDGGGTKHEEWLAVEKQSGKKRKNEGSGGVHRYDHQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDA

Query:  DMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHS
        DMLEKTF+TFHASN+LLQQQY+E+ F +YSELISCLLVAEQNNELLM+NH+SRPT   PFPEVNA++   RGR GRGRGRG  R  GRGRN  Y  G +S
Subjt:  DMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFPEVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHS

Query:  NH
        N+
Subjt:  NH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCTGATGGAGCGGCAATGCTTGGACGTGCGGTTGGCGTCGATTGATCTCAGCCCACGAATGAAGCTCCGAATGGCTACCGACGCGATGAATGGCGAGGCGGACTG
CTCACGGACGCCCACGGTCGAACACGCTTCGCGTCCTCGAATCGACAACGGCGCTCGATGCCTGGCGAGGCGGCGATCGACGACGACGATAGACACACCTCGACGGCGAA
TGGCAATATGCGATGGCGGGGGCACTAAGCATGAAGAATGGTTGGCGGTAGAGAAACAATCGGGAAAGAAGAGGAAAAACGAGGGCAGTGGCGGCGTTCACAGATATGAT
CATCAGAAAACAGTCATTCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGGCTCCAAGATTTCAAATCAGTCAGTGATTACAACTCCGCGTTATTTAAAATCAGTTC
AAAATTATTATTGTGCGGAGAGAAAATTACTGATGCGGATATGTTGGAGAAGACATTTTCTACATTCCATGCCTCGAATATGCTCCTGCAGCAGCAATATCAAGAAAAAG
GTTTTAAACAATATTCTGAATTAATTTCATGTCTTCTCGTGGCTGAACAAAATAATGAGCTATTAATGAAGAACCATGAATCTCGACCAACTGGAACAACACCATTCCCT
GAAGTGAATGCTGTAAATTTTAATAATCGTGGTCGAGCTGGTCGTGGTCGTGGTCGTGGTCATTGTCGTGACCATGGCAGAGGAAGAAATAGTTATTATTTTCGTGGTGG
TCATTCTAATCATCCAAATTCAAAAGAACCACCCGAAATGATGATCATAAAGGAAAAGCTTCACAAGATAAAAATTCAAAAGATGATGAGCATAAATGCTTCCGATGCGG
GATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCTGATGGAGCGGCAATGCTTGGACGTGCGGTTGGCGTCGATTGATCTCAGCCCACGAATGAAGCTCCGAATGGCTACCGACGCGATGAATGGCGAGGCGGACTG
CTCACGGACGCCCACGGTCGAACACGCTTCGCGTCCTCGAATCGACAACGGCGCTCGATGCCTGGCGAGGCGGCGATCGACGACGACGATAGACACACCTCGACGGCGAA
TGGCAATATGCGATGGCGGGGGCACTAAGCATGAAGAATGGTTGGCGGTAGAGAAACAATCGGGAAAGAAGAGGAAAAACGAGGGCAGTGGCGGCGTTCACAGATATGAT
CATCAGAAAACAGTCATTCTTCCTAAAGCTCGTTATGAATGGATGCATTTGAGGCTCCAAGATTTCAAATCAGTCAGTGATTACAACTCCGCGTTATTTAAAATCAGTTC
AAAATTATTATTGTGCGGAGAGAAAATTACTGATGCGGATATGTTGGAGAAGACATTTTCTACATTCCATGCCTCGAATATGCTCCTGCAGCAGCAATATCAAGAAAAAG
GTTTTAAACAATATTCTGAATTAATTTCATGTCTTCTCGTGGCTGAACAAAATAATGAGCTATTAATGAAGAACCATGAATCTCGACCAACTGGAACAACACCATTCCCT
GAAGTGAATGCTGTAAATTTTAATAATCGTGGTCGAGCTGGTCGTGGTCGTGGTCGTGGTCATTGTCGTGACCATGGCAGAGGAAGAAATAGTTATTATTTTCGTGGTGG
TCATTCTAATCATCCAAATTCAAAAGAACCACCCGAAATGATGATCATAAAGGAAAAGCTTCACAAGATAAAAATTCAAAAGATGATGAGCATAAATGCTTCCGATGCGG
GATGA
Protein sequenceShow/hide protein sequence
MRLMERQCLDVRLASIDLSPRMKLRMATDAMNGEADCSRTPTVEHASRPRIDNGARCLARRRSTTTIDTPRRRMAICDGGGTKHEEWLAVEKQSGKKRKNEGSGGVHRYD
HQKTVILPKARYEWMHLRLQDFKSVSDYNSALFKISSKLLLCGEKITDADMLEKTFSTFHASNMLLQQQYQEKGFKQYSELISCLLVAEQNNELLMKNHESRPTGTTPFP
EVNAVNFNNRGRAGRGRGRGHCRDHGRGRNSYYFRGGHSNHPNSKEPPEMMIIKEKLHKIKIQKMMSINASDAG