; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G16080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G16080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationClcChr09:24796602..24797776
RNA-Seq ExpressionClc09G16080
SyntenyClc09G16080
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033068.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-6151.16Show/hide
Query:  KNSIESE--------------FFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFS
        KNSIESE              FFRAEQKAESVTNYFMRLK+I A L LLLPFSPDVKVQQAQREKM V IFLNGLLPEFGM K QILSDSKIPSLD+AF+
Subjt:  KNSIESE--------------FFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFS

Query:  RVLRIESSQSN--------------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTE
        RVLRIESS +                                       S +IVCNYC KPGH+KRDCRKLLYKN Q+SQ AQIAST D+ E SVT   +
Subjt:  RVLRIESSQSN--------------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTE

Query:  EFAKFQ--------------------------------------ATKTRCKHHLHLILLP------PLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVA
        E+ KFQ                                      +  T    +  L   P      P +T DR+MKKIIGRGYESGGLYLFDHQ+ + VA
Subjt:  EFAKFQ--------------------------------------ATKTRCKHHLHLILLP------PLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVA

Query:  C
        C
Subjt:  C

KAA0054107.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-4452.78Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC
        +FF  EQKAESVTNYFMRLK+I AEL LLLPFSPDVKVQQAQREKM V+IFLNGLLPEFGMAK QILSDSKIPSLD+AF+RVLRIESS +          
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC

Query:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES
             + +    L+ KN      R+    IAST  + E SVT   + +AKFQ  +   +       + P +               DR+ KKIIG+G+ES
Subjt:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES

Query:  GGLYLFDHQIPKVVAC
         GLYLF+HQ+ + VAC
Subjt:  GGLYLFDHQIPKVVAC

TYK29397.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.9e-4452.31Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC
        +FF  EQK ESVTNYFMRLK+I AEL LLLPFSPDVKVQQAQREKM V+IFLNGLLPEFGMAK QILSDSKIPSLD+AF+RVLRIESS +          
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC

Query:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES
             + +    L+ KN      R+    IAST  + E SVT   + +AKFQ  +   +       + P +               DR+ KKIIG+G+ES
Subjt:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES

Query:  GGLYLFDHQIPKVVAC
         GLYLF+HQ+ + VAC
Subjt:  GGLYLFDHQIPKVVAC

TYK30615.1 Copia protein [Cucumis melo var. makuwa]8.1e-4246.69Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------
        +F RAEQKAESVTNYFMRLK+I AEL LLLPFSPDV                          K QILSDSKIPSLD AF+RVLR ESS            
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------

Query:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------
                                  + +S EIVCNYCRKP H KRDCRKLLYKN Q+SQ AQIAST D+ E S+T    E AK Q              
Subjt:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------

Query:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC
                 T+C       LL      DR+ KKIIG+GYESGGLYLFDHQ+ + VAC
Subjt:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.1e-5744.73Show/hide
Query:  NFIFRSKNSIE------SEFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRV
        +F++  K  +        +FFRAEQKAESVT+YFMRLK+I AEL LLLPFSPDVKVQQ QREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLD+AF+RV
Subjt:  NFIFRSKNSIE------SEFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRV

Query:  LRIESSQSN-------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ---
        LRIESS ++                               S EIVCNYCRKPGH+KRDCRKLLYKN QRSQ AQIAST D+ E SVT   +EFAKFQ   
Subjt:  LRIESSQSN-------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ---

Query:  ---------------------------ATK----------------------------------------------------------------------
                                   +TK                                                                      
Subjt:  ---------------------------ATK----------------------------------------------------------------------

Query:  TRCKHHLHLILL---PPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC
        ++  H L+ +++      L QDR+ KKIIGRGYESGGLYLFDHQ+ + VAC
Subjt:  TRCKHHLHLILL---PPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC

TrEMBL top hitse value%identityAlignment
A0A5A7SR90 Gag-pol polyprotein2.2e-6151.16Show/hide
Query:  KNSIESE--------------FFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFS
        KNSIESE              FFRAEQKAESVTNYFMRLK+I A L LLLPFSPDVKVQQAQREKM V IFLNGLLPEFGM K QILSDSKIPSLD+AF+
Subjt:  KNSIESE--------------FFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFS

Query:  RVLRIESSQSN--------------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTE
        RVLRIESS +                                       S +IVCNYC KPGH+KRDCRKLLYKN Q+SQ AQIAST D+ E SVT   +
Subjt:  RVLRIESSQSN--------------------------------------SQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTE

Query:  EFAKFQ--------------------------------------ATKTRCKHHLHLILLP------PLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVA
        E+ KFQ                                      +  T    +  L   P      P +T DR+MKKIIGRGYESGGLYLFDHQ+ + VA
Subjt:  EFAKFQ--------------------------------------ATKTRCKHHLHLILLP------PLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVA

Query:  C
        C
Subjt:  C

A0A5A7T406 Copia protein3.3e-4146.3Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------
        +F RAEQKAESVTNYFMRLK+I AEL LLLPFSPDV                          K QILSDSKIPSLD AF+RVL  ESS            
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------

Query:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------
                                  + +S EIVCNYCRKP H KRDCRKLLYKN Q+SQ AQIAST D+ E S+T    E AK Q              
Subjt:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------

Query:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC
                 T+C       LL      DR+ KKIIG+GYESGGLYLFDHQ+ + VAC
Subjt:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC

A0A5A7UHS1 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-4552.78Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC
        +FF  EQKAESVTNYFMRLK+I AEL LLLPFSPDVKVQQAQREKM V+IFLNGLLPEFGMAK QILSDSKIPSLD+AF+RVLRIESS +          
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC

Query:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES
             + +    L+ KN      R+    IAST  + E SVT   + +AKFQ  +   +       + P +               DR+ KKIIG+G+ES
Subjt:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES

Query:  GGLYLFDHQIPKVVAC
         GLYLF+HQ+ + VAC
Subjt:  GGLYLFDHQIPKVVAC

A0A5D3DZU1 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-4452.31Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC
        +FF  EQK ESVTNYFMRLK+I AEL LLLPFSPDVKVQQAQREKM V+IFLNGLLPEFGMAK QILSDSKIPSLD+AF+RVLRIESS +          
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEIVCNYC

Query:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES
             + +    L+ KN      R+    IAST  + E SVT   + +AKFQ  +   +       + P +               DR+ KKIIG+G+ES
Subjt:  RKPGHLKRDCRKLLYKNQ-----RSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQ-------------DRMMKKIIGRGYES

Query:  GGLYLFDHQIPKVVAC
         GLYLF+HQ+ + VAC
Subjt:  GGLYLFDHQIPKVVAC

A0A5D3E5M8 Copia protein3.9e-4246.69Show/hide
Query:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------
        +F RAEQKAESVTNYFMRLK+I AEL LLLPFSPDV                          K QILSDSKIPSLD AF+RVLR ESS            
Subjt:  EFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESS------------

Query:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------
                                  + +S EIVCNYCRKP H KRDCRKLLYKN Q+SQ AQIAST D+ E S+T    E AK Q              
Subjt:  --------------------------QSNSQEIVCNYCRKPGHLKRDCRKLLYKN-QRSQQAQIASTSDMTEKSVTTFTEEFAKFQ--------------

Query:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC
                 T+C       LL      DR+ KKIIG+GYESGGLYLFDHQ+ + VAC
Subjt:  ------ATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCACAACTTTATCTTCAGATCAAAAAATTCGATTGAGAGCGAGTTCTTTCGCGCCGAACAGAAAGCAGAGTCTGTGACCAACTACTTTATGAGACTTAAGAGAAT
AGCTGCCGAACTTACTTTGTTACTACCTTTCAGCCCAGATGTTAAGGTACAACAAGCTCAGCGAGAAAAGATGGCTGTTATGATCTTTCTGAATGGACTTTTACCTGAAT
TTGGAATGGCCAAAACACAAATTCTTTCTGACTCTAAAATCCCGTCATTAGACGAGGCTTTCAGTCGTGTTCTTCGTATTGAGAGTTCTCAATCCAATTCTCAGGAGATT
GTCTGTAACTACTGTCGTAAGCCTGGTCATTTGAAACGTGACTGTCGGAAATTGTTGTATAAGAATCAACGATCTCAGCAGGCTCAGATAGCTTCCACCAGTGATATGAC
AGAGAAGTCAGTTACCACTTTCACAGAGGAGTTTGCTAAATTTCAGGCTACGAAGACTCGTTGCAAGCATCATCTTCATCTAATCCTACTGCCACCATTGCTGACACAGG
ATCGTATGATGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATCTTTTTGATCACCAGATACCGAAAGTTGTGGCTTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGCACAACTTTATCTTCAGATCAAAAAATTCGATTGAGAGCGAGTTCTTTCGCGCCGAACAGAAAGCAGAGTCTGTGACCAACTACTTTATGAGACTTAAGAGAAT
AGCTGCCGAACTTACTTTGTTACTACCTTTCAGCCCAGATGTTAAGGTACAACAAGCTCAGCGAGAAAAGATGGCTGTTATGATCTTTCTGAATGGACTTTTACCTGAAT
TTGGAATGGCCAAAACACAAATTCTTTCTGACTCTAAAATCCCGTCATTAGACGAGGCTTTCAGTCGTGTTCTTCGTATTGAGAGTTCTCAATCCAATTCTCAGGAGATT
GTCTGTAACTACTGTCGTAAGCCTGGTCATTTGAAACGTGACTGTCGGAAATTGTTGTATAAGAATCAACGATCTCAGCAGGCTCAGATAGCTTCCACCAGTGATATGAC
AGAGAAGTCAGTTACCACTTTCACAGAGGAGTTTGCTAAATTTCAGGCTACGAAGACTCGTTGCAAGCATCATCTTCATCTAATCCTACTGCCACCATTGCTGACACAGG
ATCGTATGATGAAGAAGATTATTGGTAGAGGATATGAGTCAGGAGGCCTTTATCTTTTTGATCACCAGATACCGAAAGTTGTGGCTTGCTGA
Protein sequenceShow/hide protein sequence
MMHNFIFRSKNSIESEFFRAEQKAESVTNYFMRLKRIAAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLDEAFSRVLRIESSQSNSQEI
VCNYCRKPGHLKRDCRKLLYKNQRSQQAQIASTSDMTEKSVTTFTEEFAKFQATKTRCKHHLHLILLPPLLTQDRMMKKIIGRGYESGGLYLFDHQIPKVVAC