; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G11775 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G11775
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr02:22796721..22798318
RNA-Seq ExpressionClc02G11775
SyntenyClc02G11775
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_027066460.1 uncharacterized protein LOC113692270 [Coffea arabica]5.1e-2753.52Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +KR+FLEKFFPASR A IRKEI G+ Q  GETL++Y            + QI + LLIQYFYEG  P DRS +D A GGALVNKTP EA+ LIS I EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ
        QQFGT +        EV  ++   +L +LTTLV+Q +  KAQ
Subjt:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ

XP_027166445.1 uncharacterized protein LOC113766451 [Coffea eugenioides]5.1e-2753.52Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +KR+FLEKFFPASR A IRKEI G+ Q  GETL++Y            + QI + LLIQYFYEG  P DRS +D A GGALVNKTP EA+ LIS I EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ
        QQFGT +        EV  ++   +L +LTTLV+Q +  KAQ
Subjt:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ

XP_042410030.1 uncharacterized protein LOC121999411 [Zingiber officinale]1.7e-2755.71Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        MKR FLEKFFPASR A IRK I GI Q  GETL+DY              QIS+ LL+QYFYEG +P DRS ID A GGALVNKTP +AR LIS + EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTA-------PKHATSTQEVSEFRAELADLTTLVKQ
        QQFG+ A         H  ST++  E R+ L +LT+LVKQ
Subjt:  QQFGTTA-------PKHATSTQEVSEFRAELADLTTLVKQ

XP_042456971.1 uncharacterized protein LOC122041379 [Zingiber officinale]8.7e-2753.38Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        MK+ FLEKFFPASR A IRK I GI Q  GETL+DY              QIS  LL+QYFYEGF+P DRS ID A GGALVNKTP +AR LIS + EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTA--PKHATSTQEVS----EFRAELADLTTLVKQTSFAKAQES
        QQFG+ A   +     Q VS    E R+ L +LT+LVKQ +   A ++
Subjt:  QQFGTTA--PKHATSTQEVS----EFRAELADLTTLVKQTSFAKAQES

XP_042472344.1 uncharacterized protein LOC122055010 [Zingiber officinale]8.7e-2755Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        MKR FLEK FPASR A IRK I GI Q  GETL+DY              QIS+ LL+QYFYEG +P DRS ID A GGALVNKTP +AR LIS + EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTA-------PKHATSTQEVSEFRAELADLTTLVKQ
        QQFG+ A         H  ST++  E R+ L +LT+LVKQ
Subjt:  QQFGTTA-------PKHATSTQEVSEFRAELADLTTLVKQ

TrEMBL top hitse value%identityAlignment
A0A5A7USL5 Retrotrans_gag domain-containing protein3.0e-2533.56Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +K+KFLEKFFPASR   IRKEI+GI Q  GE+L  Y            +  I +  LIQYFY G +  DR+T+D A GGAL +KTPTEAR LIS ++EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATS-TQEVSEFRAELADLTTLVKQTSFAKAQESQKHEAEVNYAGNQQGGWTEQGQKPGRIYHADQAEPGAVPLVDVVSQLDTHLLNVRNI
        Q FG  A +   S T+EVSE ++++ ++TTL+  TSF +    +  +  V                 G + H +   P  +  V++V + D H  N+ +I
Subjt:  QQFGTTAPKHATS-TQEVSEFRAELADLTTLVKQTSFAKAQESQKHEAEVNYAGNQQGGWTEQGQKPGRIYHADQAEPGAVPLVDVVSQLDTHLLNVRNI

Query:  SDISVVSCIESFSTDVLEPMVEIVDAVG--------------AEVSEEKTTNSLRRRGVSFDPPLN-LNSYMHVASFPSKLVVAKRSVVKEE
              + + SF  ++ + M ++  A+               A    ++T + L  +  +   PLN    Y+    FPS+L   K+  +KEE
Subjt:  SDISVVSCIESFSTDVLEPMVEIVDAVG--------------AEVSEEKTTNSLRRRGVSFDPPLN-LNSYMHVASFPSKLVVAKRSVVKEE

A0A6J1DRS5 uncharacterized protein LOC1110232485.5e-2754.14Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        MKR FL+KFFPASR A IRKEI+GI Q +GETL++Y            + QIS+  LIQYFYEG +P DRS ID A G ALV+KTP  A+ LI  + EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATSTQEVSEFRAELADLTTLVKQ
        QQFG       T   EVSE +++++DLTTLV+Q
Subjt:  QQFGTTAPKHATSTQEVSEFRAELADLTTLVKQ

A0A6P6SJY7 uncharacterized protein LOC1136922702.5e-2753.52Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +KR+FLEKFFPASR A IRKEI G+ Q  GETL++Y            + QI + LLIQYFYEG  P DRS +D A GGALVNKTP EA+ LIS I EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ
        QQFGT +        EV  ++   +L +LTTLV+Q +  KAQ
Subjt:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ

A0A6P6T124 uncharacterized protein LOC1136968401.3e-2549.66Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDYSNP-----------QISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +K+KFL+K+FPASR A +RKEI GI Q  GE+L++Y  P           QIS  LLIQYFYEG +  DRS IDTA GGALVNKTP EAR LI  + EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDYSNP-----------QISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGT--TAPKHATSTQEVSEFRAELADLTTLVKQTSFAKAQESQ
        QQFGT    P    +  E S  + +L  LT+ V+Q +   A +++
Subjt:  QQFGT--TAPKHATSTQEVSEFRAELADLTTLVKQTSFAKAQESQ

A0A6P6WZ36 uncharacterized protein LOC1137375807.1e-2752.82Show/hide
Query:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA
        +KRKFLEKFFP SR A IRKEI G+ Q  GETL++Y            + QIS+ LLIQYFYE   P DRS +D A GGALVNKTP EA+ LIS I EN+
Subjt:  MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDY-----------SNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENA

Query:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ
        QQFGT +        EV  ++   +L +LTTL++Q +  KAQ
Subjt:  QQFGTTAPKHATSTQEV--SEFRAELADLTTLVKQTSFAKAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGAAATTCTTGGAAAAGTTCTTTCCAGCTTCAAGAGTAGCGAGGATTAGGAAGGAAATTTTTGGCATCATGCAGACAGCTGGTGAGACCTTGCATGACTACTC
CAACCCCCAGATCTCAAATGCTCTGCTCATCCAGTACTTCTACGAAGGGTTCGTCCCTAATGACAGAAGTACTATTGATACAGCTTATGGGGGTGCACTAGTGAACAAGA
CTCCAACTGAGGCCAGAAATCTCATATCCACTATAGTAGAGAATGCCCAGCAATTTGGGACGACAGCTCCCAAGCATGCCACATCGACACAAGAGGTAAGTGAATTTAGA
GCTGAGCTTGCCGACCTAACTACTCTTGTGAAACAAACTTCCTTTGCCAAGGCACAAGAGTCCCAAAAGCATGAAGCTGAAGTCAATTATGCAGGTAATCAGCAAGGTGG
ATGGACAGAACAAGGTCAAAAACCTGGTAGAATCTACCATGCAGATCAAGCAGAACCAGGAGCTGTTCCACTGGTTGATGTTGTCAGTCAATTGGACACCCACTTGCTCA
ATGTGCGTAACATCAGTGACATCTCTGTTGTGAGTTGCATAGAGAGCTTCTCGACGGATGTCCTCGAGCCAATGGTTGAAATCGTTGATGCTGTGGGTGCTGAGGTTTCT
GAAGAAAAGACGACCAATTCTCTAAGAAGACGCGGAGTAAGTTTTGATCCACCGCTTAACTTAAATTCTTACATGCATGTTGCTTCTTTCCCCAGCAAGTTAGTTGTTGC
TAAGAGAAGTGTGGTCAAGGAGGAGGTGACTTCTGTACATTCTTCTGGTGATGTAAGCTTGACTGGAAAGGGAAGGAAGGTTTGGATTGAGGTCCCTCCTGAGGAAAGGA
AGAGGAGGAGGTCAAACAAGAGGCAACTATTTCAGGGTAGTGGTCAACCTCCTAAGTTGGGGGTGTTGGATGGTGGCAAGGCATTCAACGCTAAACATGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGGAAATTCTTGGAAAAGTTCTTTCCAGCTTCAAGAGTAGCGAGGATTAGGAAGGAAATTTTTGGCATCATGCAGACAGCTGGTGAGACCTTGCATGACTACTC
CAACCCCCAGATCTCAAATGCTCTGCTCATCCAGTACTTCTACGAAGGGTTCGTCCCTAATGACAGAAGTACTATTGATACAGCTTATGGGGGTGCACTAGTGAACAAGA
CTCCAACTGAGGCCAGAAATCTCATATCCACTATAGTAGAGAATGCCCAGCAATTTGGGACGACAGCTCCCAAGCATGCCACATCGACACAAGAGGTAAGTGAATTTAGA
GCTGAGCTTGCCGACCTAACTACTCTTGTGAAACAAACTTCCTTTGCCAAGGCACAAGAGTCCCAAAAGCATGAAGCTGAAGTCAATTATGCAGGTAATCAGCAAGGTGG
ATGGACAGAACAAGGTCAAAAACCTGGTAGAATCTACCATGCAGATCAAGCAGAACCAGGAGCTGTTCCACTGGTTGATGTTGTCAGTCAATTGGACACCCACTTGCTCA
ATGTGCGTAACATCAGTGACATCTCTGTTGTGAGTTGCATAGAGAGCTTCTCGACGGATGTCCTCGAGCCAATGGTTGAAATCGTTGATGCTGTGGGTGCTGAGGTTTCT
GAAGAAAAGACGACCAATTCTCTAAGAAGACGCGGAGTAAGTTTTGATCCACCGCTTAACTTAAATTCTTACATGCATGTTGCTTCTTTCCCCAGCAAGTTAGTTGTTGC
TAAGAGAAGTGTGGTCAAGGAGGAGGTGACTTCTGTACATTCTTCTGGTGATGTAAGCTTGACTGGAAAGGGAAGGAAGGTTTGGATTGAGGTCCCTCCTGAGGAAAGGA
AGAGGAGGAGGTCAAACAAGAGGCAACTATTTCAGGGTAGTGGTCAACCTCCTAAGTTGGGGGTGTTGGATGGTGGCAAGGCATTCAACGCTAAACATGGCTGA
Protein sequenceShow/hide protein sequence
MKRKFLEKFFPASRVARIRKEIFGIMQTAGETLHDYSNPQISNALLIQYFYEGFVPNDRSTIDTAYGGALVNKTPTEARNLISTIVENAQQFGTTAPKHATSTQEVSEFR
AELADLTTLVKQTSFAKAQESQKHEAEVNYAGNQQGGWTEQGQKPGRIYHADQAEPGAVPLVDVVSQLDTHLLNVRNISDISVVSCIESFSTDVLEPMVEIVDAVGAEVS
EEKTTNSLRRRGVSFDPPLNLNSYMHVASFPSKLVVAKRSVVKEEVTSVHSSGDVSLTGKGRKVWIEVPPEERKRRRSNKRQLFQGSGQPPKLGVLDGGKAFNAKHG