; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G02110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G02110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr08:3889159..3889572
RNA-Seq ExpressionClc08G02110
SyntenyClc08G02110
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038882247.1 uncharacterized protein LOC120073473 [Benincasa hispida]4.0e-2645.21Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------KLESLLATKDLPNKMFLGETFFTFKMDASKTY
        M+  R++I+ FDGK DF LWK K+K VLGQQKA  A+ + +  P    A E                 KL  +   KDL NK FL E FFT KMD +K+ 
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------KLESLLATKDLPNKMFLGETFFTFKMDASKTY

Query:  IENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
         +NL+E+K++  +F+S+GD +G++NEA++LLNSLL+++KDVK A+K
Subjt:  IENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]1.7e-2441.46Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIA-KAQE----------------------------------KLESLLATKDLPNK
        M+  R++I+ F+ K DF LWKAK+K VL +QKA  A+ D +  P I  KA++                                  KL  +   KDLPNK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIA-KAQE----------------------------------KLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
         FL E FFT+KMD +K+  +NL+E+K + S+F+S+GD +G++NEA++LLNSL +T+KDVK ALK
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

XP_038887098.1 uncharacterized protein LOC120077280 [Benincasa hispida]6.4e-2441.46Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------------------------KLESLLATKDLPNK
        M+  +++I+ FD K DF L KAK+KAVLGQQKA  A+ D S  P      E                                   KL  +   KD PNK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------------------------KLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
         FL E FFT+KMD +K+  +NL+E+K++ SEF+S+GD +G++NEA++L NSL +T+KDVK ALK
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

XP_038896323.1 uncharacterized protein LOC120084587 [Benincasa hispida]6.4e-4060.81Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-------------------KLESLLATKDLPNKMFLGETFFTFKMDASK
        MS+ RF+++ FDGKGDFGLWKAK+KA+L QQKAH+ALL+ STL     AQE                   KLE L ATKDLP+KM+L E FFTFKMD+SK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-------------------KLESLLATKDLPNKMFLGETFFTFKMDASK

Query:  TYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
        T   N DE+KKIV+EFK+LG+KL DKNEAYVL NSL ++YK++KNALK
Subjt:  TYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

XP_038904517.1 uncharacterized protein LOC120090894 [Benincasa hispida]2.9e-2443.15Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN------------IAKAQ-----EKLESLLATKDLPNKMFLGETFFTFKMDASKTY
        M+  R++I+ F+G  DFGLWK K+KAVLGQQKA  A+ D +  P             + KA       KL  +   KDLPNK F  + F T+KMD  K+ 
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN------------IAKAQ-----EKLESLLATKDLPNKMFLGETFFTFKMDASKTY

Query:  IENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
         +NL+++K++ SEFKS+GD +G++N+A++LLNSL +++KDV   +K
Subjt:  IENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class1.7e-2239.63Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK
        M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++                                  +KLESL  TK LPNK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
        +++ E FF +KMD SK   ENLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

A0A5A7U6R2 Retrovirus-related Pol polyprotein from transposon TNT 1-942.6e-2340.85Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK
        M+  +F+I+ FDG GDF LW  ++ A+LG QKA KAL D   LP  + K++                                  EKL+SL   KDLPNK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
        MF+ E  F+FK + +K   ENLDE+KK+ +     G+KLG +NEA +L+NS+ DTYK+VK  LK
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

A0A5A7UB25 Putative gag-pol polyprotein1.4e-2139.02Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK
        M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++                                  +KLESL  TK L NK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
        +++ E FF +KMD SK+  ENLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

A0A5C7HX22 Uncharacterized protein1.7e-2249.23Show/hide
Query:  QNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN-------IAKAQEKLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLG
        + FDG GDFG+W+ KVKA+L QQK  +A+ D   LP+            +KLESL  TK L NK++L E  F+FKMDASK   +NLDE+KK++ E  + G
Subjt:  QNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPN-------IAKAQEKLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLG

Query:  --DKLGDKNEAYVLLNSLLDTYKDVKNALK
          +KL D+NEA +LLNSL +++ DVK A+K
Subjt:  --DKLGDKNEAYVLLNSLLDTYKDVKNALK

A0A5D3DNU1 Putative gag-pol polyprotein1.0e-2239.63Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK
        M+  RF++  F+G GDF LW+ K++A+L Q K  K +LD   LP NI +++                                  +KLESL  TK LPNK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLP-NIAKAQ----------------------------------EKLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK
        +++ E FF +KMD SK+  ENLDE++KIV +  ++G+K+ D+N+A +LLNSL +TY++VK A+K
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNALK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.6e-0922.7Show/hide
Query:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------------------------KLESLLATKDLPNK
        MS +++++  F+G   F  W+ +++ +L QQ  HK L   S  P+  KA++                                   +LESL  +K L NK
Subjt:  MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQE-----------------------------------KLESLLATKDLPNK

Query:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNAL
        ++L +  +   M     ++ +L+ +  ++++  +LG K+ ++++A +LLNSL  +Y ++   +
Subjt:  MFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLGDKNEAYVLLNSLLDTYKDVKNAL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTAATAAGATTCAAAATTCAAAATTTTGATGGAAAGGGTGATTTTGGTCTTTGGAAGGCCAAAGTCAAAGCAGTTCTTGGTCAACAAAAGGCTCACAAG
GCTCTTTTAGATTCATCTACTCTACCAAACATAGCCAAAGCTCAAGAAAAATTGGAAAGTCTTCTTGCAACCAAAGATCTTCCAAATAAAATGTTTTTGGGAGAA
ACTTTTTTCACATTTAAAATGGATGCCTCCAAGACTTACATAGAAAACTTGGATGAATACAAAAAGATAGTTTCAGAATTTAAAAGTCTTGGAGACAAATTGGGT
GACAAAAATGAAGCCTATGTTCTATTAAACTCACTATTGGATACCTACAAGGATGTGAAGAATGCTTTGAAAATATGGCAAAGATTCGATTACAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTAATAAGATTCAAAATTCAAAATTTTGATGGAAAGGGTGATTTTGGTCTTTGGAAGGCCAAAGTCAAAGCAGTTCTTGGTCAACAAAAGGCTCACAAG
GCTCTTTTAGATTCATCTACTCTACCAAACATAGCCAAAGCTCAAGAAAAATTGGAAAGTCTTCTTGCAACCAAAGATCTTCCAAATAAAATGTTTTTGGGAGAA
ACTTTTTTCACATTTAAAATGGATGCCTCCAAGACTTACATAGAAAACTTGGATGAATACAAAAAGATAGTTTCAGAATTTAAAAGTCTTGGAGACAAATTGGGT
GACAAAAATGAAGCCTATGTTCTATTAAACTCACTATTGGATACCTACAAGGATGTGAAGAATGCTTTGAAAATATGGCAAAGATTCGATTACAACTGA
Protein sequenceShow/hide protein sequence
MSLIRFKIQNFDGKGDFGLWKAKVKAVLGQQKAHKALLDSSTLPNIAKAQEKLESLLATKDLPNKMFLGETFFTFKMDASKTYIENLDEYKKIVSEFKSLGDKLG
DKNEAYVLLNSLLDTYKDVKNALKIWQRFDYN