; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G14030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G14030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr01:26889782..26890803
RNA-Seq ExpressionClc01G14030
SyntenyClc01G14030
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.9e-4454.04Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF T V+LIN L  PS V Q++SP  ++L  + D   LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP L+PY  HK   H+ RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.3e-4452.8Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+ +++ V    GIQ+R SCPYTSAQNG  ERKHRH++E+G T+LAQA MPL +WW AF T VFLIN L  P+ V+++KSP + L     D +++K FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP L+PY  HK   H+ +CV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

KYP75364.1 Copia protein [Cajanus cajan]2.9e-4451.55Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+  +  + +  G Q+R+SCPYTS QNG  ERKHRH++E G T+LAQA MPL F W AF T VFLIN L  P+P++++KSP  VLL  + D ++LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP ++PY +HK   H+ +CV+LG S  HKG +C+ ++G++ ISRHV FNE+++PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

MCH94186.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Trifolium medium]5.0e-4454.04Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF T V+LIN L  PS V Q++SP  ++L  + D   LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP L+PY  HK   H+ RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

XP_030492909.1 uncharacterized protein LOC115709020 isoform X1 [Cannabis sativa]7.7e-4548Show/hide
Query:  DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLK
        D  GEY     + +  GI  + SCP+TSAQNG  ERKHRH++E G T+LAQASMPL +W  AFQT V+LIN L  P+P+L DKSP EV+   K +   LK
Subjt:  DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLK

Query:  IFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDL-NLRPKARSRSQPST
         FG+ C+P LRPYQ+HKF  HS++CV LG S   KG++CL++ G++ ISRHV FNE ++PFK G   +       I++      L N+   A +  Q  T
Subjt:  IFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDL-NLRPKARSRSQPST

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-4552.8Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+ +++ V    GIQ+R SCPYTSAQNG  ERKHRH++E+G T+LAQA MPL +WW AF T VFLIN L  P+ V+++KSP + L     D +++K FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP L+PY  HK   H+ +CV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

A0A151U7U2 Copia protein1.4e-4451.55Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+  +  + +  G Q+R+SCPYTS QNG  ERKHRH++E G T+LAQA MPL F W AF T VFLIN L  P+P++++KSP  VLL  + D ++LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP ++PY +HK   H+ +CV+LG S  HKG +C+ ++G++ ISRHV FNE+++PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.4e-4454.04Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF T V+LIN L  PS V Q++SP  ++L  + D   LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP L+PY  HK   H+ RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

A0A803Q615 Uncharacterized protein3.4e-4657.14Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GEY    +     GI  + SCP+TSAQNG  ERKHRH++E G T+LAQA +P  +WW AFQT V+LIN L  P+PVL+ KSP+EVL   K D   LK FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG
          CYP LRPYQSHKF  HS +CV LG S  HKG++CL++ G+L ISR+V FNE+++PF TG
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG

A0A803QD60 Uncharacterized protein9.8e-4645.09Show/hide
Query:  DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLK
        D+ GEY  + ++    GI    SCP+TSAQNG  +RKHRH++E G T+LAQA MPL +WW  FQT V+LIN L  P+P+L++KSP E L   + D + LK
Subjt:  DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLK

Query:  IFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDLNLRPKARSRSQPSTK
        +FG  C+P +RPYQ+HKF  HS++ V LG S  HKG+RCLT  GK+ ISR+V FNE ++PFK G   + ++ +  I+        ++ P     +  ST 
Subjt:  IFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDLNLRPKARSRSQPSTK

Query:  AVLRPMQGIRPMHDLLRPTHGLLR
        A L P  G  P      PTH  +R
Subjt:  AVLRPMQGIRPMHDLLRPTHGLLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.8e-1132.5Show/hide
Query:  CKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRP
        C   GI   ++ P+T   NG  ER  R + E   TM++ A +  SFW  A  T  +LIN +P  + V   K+P E+    K  +  L++FG+  Y +++ 
Subjt:  CKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRP

Query:  YQSHKFDVHSVRCVYLGPSP
         Q  KFD  S + +++G  P
Subjt:  YQSHKFDVHSVRCVYLGPSP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-1332.28Show/hide
Query:  GEYPT--VAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKI
        GEY +      C   GI+   + P T   NG  ER +R ++E   +ML  A +P SFW  A QT  +LIN    PS  L  + P  V    ++  S LK+
Subjt:  GEYPT--VAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKI

Query:  FGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNEND
        FG   + ++   Q  K D  S+ C+++G      G+R       K++ SR V F E++
Subjt:  FGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNEND

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-3345.28Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+  +       GI    S P+T   NG  ERKHRH++ETG T+L+ AS+P ++W YAF   V+LIN L  P+P+LQ +SP + L  +  +   L++FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPF
          CYP LRPY  HK D  S +CV+LG S     + CL     +L ISRHVRF+EN +PF
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-3241.71Show/hide
Query:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG
        GE+  +       GI    S P+T   NG  ERKHRH++E G T+L+ AS+P ++W YAF   V+LIN L  P+P+LQ +SP + L     +   LK+FG
Subjt:  GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFG

Query:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPFKTGKYG---SLEERED
          CYP LRPY  HK +  S +C ++G S     + CL    G+L  SRHV+F+E  +PF T  +G   S E+R D
Subjt:  SVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPFKTGKYG---SLEERED

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTTGTGGAGAATATCCAACTGTGGCTCATGTCTGCAAACACTTGGGTATTCAGATTCGTGTTTCCTGTCCATACACTTCAGCACAAAATGGTTGCGTTGAACG
CAAACACCGGCATCTCATTGAGACTGGCTTTACCATGCTTGCTCAAGCCTCTATGCCTCTGTCATTTTGGTGGTATGCTTTTCAAACAATTGTTTTTCTTATTAATGGTC
TTCCTTTTCCTTCACCTGTTTTACAGGACAAATCACCCATGGAGGTTCTTCTCTACTCAAAACTTGATGTTTCTTCTTTAAAAATCTTTGGGAGTGTTTGTTATCCTAAT
CTTCGACCATATCAGTCTCACAAATTTGATGTTCATAGCGTGCGATGTGTTTACCTTGGCCCGTCTCCAATCCACAAAGGCCATCGGTGTCTTACAACGGATGGTAAATT
ATTAATTTCTCGCCATGTTCGATTCAATGAAAATGATTATCCATTCAAGACAGGAAAATATGGTAGTTTGGAGGAGAGAGAAGATTGGATAGTGGAGAGAGGAAAAGGTC
GAGACCTCAACCTTCGACCAAAGGCAAGGTCGAGATCTCAACCCTCGACCAAAGCAGTCCTTCGACCAATGCAAGGCATTCGACCAATGCATGACCTGCTTCGACCAACG
CATGGCCTGCTTCGGCCAACGCATGGAGGCCTCAACCTTCGACCAACGCGAGGCCTACCTTCGACCAACACAAGGCCTGTTTCGACCAACGCAAGGAGGCCTCAAAGTTT
GACCAACGCAAGGCCTACTTTGACCAACGCAAGGCTACCTTCGACCAACGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTTGTGGAGAATATCCAACTGTGGCTCATGTCTGCAAACACTTGGGTATTCAGATTCGTGTTTCCTGTCCATACACTTCAGCACAAAATGGTTGCGTTGAACG
CAAACACCGGCATCTCATTGAGACTGGCTTTACCATGCTTGCTCAAGCCTCTATGCCTCTGTCATTTTGGTGGTATGCTTTTCAAACAATTGTTTTTCTTATTAATGGTC
TTCCTTTTCCTTCACCTGTTTTACAGGACAAATCACCCATGGAGGTTCTTCTCTACTCAAAACTTGATGTTTCTTCTTTAAAAATCTTTGGGAGTGTTTGTTATCCTAAT
CTTCGACCATATCAGTCTCACAAATTTGATGTTCATAGCGTGCGATGTGTTTACCTTGGCCCGTCTCCAATCCACAAAGGCCATCGGTGTCTTACAACGGATGGTAAATT
ATTAATTTCTCGCCATGTTCGATTCAATGAAAATGATTATCCATTCAAGACAGGAAAATATGGTAGTTTGGAGGAGAGAGAAGATTGGATAGTGGAGAGAGGAAAAGGTC
GAGACCTCAACCTTCGACCAAAGGCAAGGTCGAGATCTCAACCCTCGACCAAAGCAGTCCTTCGACCAATGCAAGGCATTCGACCAATGCATGACCTGCTTCGACCAACG
CATGGCCTGCTTCGGCCAACGCATGGAGGCCTCAACCTTCGACCAACGCGAGGCCTACCTTCGACCAACACAAGGCCTGTTTCGACCAACGCAAGGAGGCCTCAAAGTTT
GACCAACGCAAGGCCTACTTTGACCAACGCAAGGCTACCTTCGACCAACGCATGA
Protein sequenceShow/hide protein sequence
MDVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPN
LRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDLNLRPKARSRSQPSTKAVLRPMQGIRPMHDLLRPT
HGLLRPTHGGLNLRPTRGLPSTNTRPVSTNARRPQSLTNARPTLTNARLPSTNA