; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G13430 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G13430
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr11:23996134..23997620
RNA-Seq ExpressionClc11G13430
SyntenyClc11G13430
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0043167 - ion binding (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV72701.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Tanacetum cinerariifolium]1.3e-1347.22Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKL
        FCD QSAL+L+ NP YH+RTKHID++ +++R++ ++   ++ KI+T HNPTD++T AL ++KFE+  +L+ +
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKL

KAG7554353.1 GAG-pre-integrase domain [Arabidopsis suecica]4.8e-1345.68Show/hide
Query:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE
        +  +S + FCD  SA+ALS N  +H+RTKHIDVK+H++RE+    ++ + KIST  NP D+ T  LA  KF+    LL++E
Subjt:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE

KAG8487895.1 hypothetical protein CXB51_018354 [Gossypium anomalum]2.5e-1452Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEV
        +CDGQSA+ L+NN  YH RTKHIDV+FH+VREI  + ++ L KI T  NP D+MTN + + KFE+  +L+ + +V
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEV

KAG8498396.1 hypothetical protein CXB51_007029 [Gossypium anomalum]3.7e-1344.32Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM----ATLTE
        FCD QSA+ L  +  +H+RTKHIDV++H+VR+I  + ++ + KIST  NP D+MT +L   KFE+  DL+ LE+ ++ +    +TLT+
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAM----ATLTE

XP_024006146.1 uncharacterized protein LOC112082836 [Eutrema salsugineum]4.3e-1450Show/hide
Query:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKL
        F+  S K  CD QSALAL+ N  YH+RTKH+  K+H++R+I  + ++TL KI T  NP D +T AL   KFE  CDLL +
Subjt:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKL

TrEMBL top hitse value%identityAlignment
A0A2G3B6L9 NB-ARC domain-containing protein4.0e-1347.37Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEVI
        FCD QSA+ L+ +  +H+RTKHIDV++H+VREI  + ++ + KIST +NP D+MT  L S KF++  DL+ L +++
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEVI

A0A6D2KFT3 Uncharacterized protein5.2e-1344.44Show/hide
Query:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE
        +  +S + FCD QSA+ALS N  +H+RTKHIDVK+H++R++    ++ + KIST  NP D+ T  LA  KF+   +LL+++
Subjt:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE

O81903 Putative transposable element2.3e-1344.58Show/hide
Query:  MWFTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE
        M   P+ A  +CD QSA+ LS N  +HDRTKH++VKF+++R+I +  E+ + KI T  NP D++T  +   KFE   D+LKL+
Subjt:  MWFTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE

Q9M1F5 Copia-like polyprotein2.3e-1345.68Show/hide
Query:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE
        +  ++ + FCD QSA+ALS N  +H+RTKHIDVKFH++REI    ++ + KIST  NP D+ T  L  +KF+   D L+++
Subjt:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE

Q9SH77 Putative retroelement pol polyprotein2.3e-1346.91Show/hide
Query:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE
        +  +S + FCD QSA+ALS N  +H+RTKHIDVK+H++REI     + + KIST  NP D+ T  LA  KF+   +LL+++
Subjt:  FTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLE

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.1e-0736.92Show/hide
Query:  KAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKF
        K + D Q  ++++NNP+ H R KHID+K+H+ RE  Q   + L  I T +   D+ T  L + +F
Subjt:  KAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-1250Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE
        +CD QSA+ LS N  YH RTKHIDV++H++RE+     L + KIST  NP D++T  +  +KFE
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0432.81Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE
        +CD   A  L  NP +H R KHI + +H++R   Q   L +  +ST     D +T  L+   F+
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.2e-0432.81Show/hide
Query:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE
        +CD   A  L  NP +H R KHI + +H++R   Q   L +  +ST     D +T  L+   F+
Subjt:  FCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFE

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTTTACACCTCGTTCTGCTAAAGCTTTTTGTGACGGTCAAAGTGCTCTTGCACTTTCTAACAATCCAACTTATCATGATCGAACTAAACATATAGATGTTAAGTT
CCATTATGTTCGAGAAATATTTCAGAAGCGGGAACTTACTTTACACAAAATCAGTACAATGCATAATCCAACAGATGTTATGACAAATGCTTTGGCAAGTGACAAGTTTG
AGTATCCATGTGACTTGCTAAAGTTAGAGGAGGTAATTGTTGCAATGGCGACTTTGACGGAGTACTGGCTAGGGGAATTTGACTCAACGGGAGGCGGCGAAAATGGAGTG
GAAGGAGAAGTCGGCAACAGTTGTTTAGGCGCCATAGTCGCCAGTGATATTAGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTTTACACCTCGTTCTGCTAAAGCTTTTTGTGACGGTCAAAGTGCTCTTGCACTTTCTAACAATCCAACTTATCATGATCGAACTAAACATATAGATGTTAAGTT
CCATTATGTTCGAGAAATATTTCAGAAGCGGGAACTTACTTTACACAAAATCAGTACAATGCATAATCCAACAGATGTTATGACAAATGCTTTGGCAAGTGACAAGTTTG
AGTATCCATGTGACTTGCTAAAGTTAGAGGAGGTAATTGTTGCAATGGCGACTTTGACGGAGTACTGGCTAGGGGAATTTGACTCAACGGGAGGCGGCGAAAATGGAGTG
GAAGGAGAAGTCGGCAACAGTTGTTTAGGCGCCATAGTCGCCAGTGATATTAGTTGGTAG
Protein sequenceShow/hide protein sequence
MWFTPRSAKAFCDGQSALALSNNPTYHDRTKHIDVKFHYVREIFQKRELTLHKISTMHNPTDVMTNALASDKFEYPCDLLKLEEVIVAMATLTEYWLGEFDSTGGGENGV
EGEVGNSCLGAIVASDISW