; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDimer_Tnp_hAT domain-containing protein
Genome locationCG_Chr05:14731670..14739470
RNA-Seq ExpressionClCG05G011860
SyntenyClCG05G011860
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8538184.1 hypothetical protein F0562_027792 [Nyssa sinensis]4.8e-2045.6Show/hide
Query:  SNMDFDDFNDYDLFV-SNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN---------KLHPKTIEVLMCA
        SN+  D+ +DYDL+V S+ + +N K + D YL+E VLPR+ DF+IL+WWK NG KYP LQ I +DIL IP+S+VAS++         +LHP  +E LMC+
Subjt:  SNMDFDDFNDYDLFV-SNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN---------KLHPKTIEVLMCA

Query:  QKWITTKVLTSTIKHLATILEDADN
        Q W+  ++  S  +   T + D DN
Subjt:  QKWITTKVLTSTIKHLATILEDADN

XP_004143736.1 L10-interacting MYB domain-containing protein [Cucumis sativus]1.6e-2056.9Show/hide
Query:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN
        I  GAWA  PSQ  + K      E + +DTIG+T+ NENEFNNVES +N+V+TS T++Q  EK+RK++ S VGS S++ D LDRL D+IEYR RDLP C+
Subjt:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN

Query:  ILEVMESLVSLLGIVE
         LEVME+L SL GI+E
Subjt:  ILEVMESLVSLLGIVE

XP_008436966.1 PREDICTED: L10-interacting MYB domain-containing protein-like [Cucumis melo]1.3e-2057.76Show/hide
Query:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN
        I  GAWA  PSQ  + K      + + DDTIG+T+ NENEFNNVES +N+V+TS TY+Q  EK+RK++ S VGS S++ D LDRL D IEYR RDLP C+
Subjt:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN

Query:  ILEVMESLVSLLGIVE
         LEVME+L SL GI+E
Subjt:  ILEVMESLVSLLGIVE

XP_022159819.1 L10-interacting MYB domain-containing protein-like [Momordica charantia]1.1e-2157.14Show/hide
Query:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSET-YSQGREKKRKKIGSGVGSSKMSDKLDRLCDIIEY-RRDLP
        D  +  GAWA  PSQ  +  + S +VE + DDT+G+T     EFNNVES +N+V+TS T Y+QG EKKRKK+ S VGSSK+ D LDRLCD IEY RRDLP
Subjt:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSET-YSQGREKKRKKIGSGVGSSKMSDKLDRLCDIIEY-RRDLP

Query:  RCNILEVMESLVSLLGIVE
         C+ LEVM++L  L GI+E
Subjt:  RCNILEVMESLVSLLGIVE

XP_038874429.1 L10-interacting MYB domain-containing protein-like [Benincasa hispida]4.1e-1953.78Show/hide
Query:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEY-RRDLP
        D  +  GAWA  PSQ  + K+ S  +E + DDT G+T     +FNNVES +N+V+TS TY+QG EK+RKK+ S VGS  ++   LDRLCD+IEY RRDLP
Subjt:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEY-RRDLP

Query:  RCNILEVMESLVSLLGIVE
         C+ LEVME+L  L GI+E
Subjt:  RCNILEVMESLVSLLGIVE

TrEMBL top hitse value%identityAlignment
A0A0A0KKV6 Myb_DNA-bind_3 domain-containing protein8.0e-2156.9Show/hide
Query:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN
        I  GAWA  PSQ  + K      E + +DTIG+T+ NENEFNNVES +N+V+TS T++Q  EK+RK++ S VGS S++ D LDRL D+IEYR RDLP C+
Subjt:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN

Query:  ILEVMESLVSLLGIVE
         LEVME+L SL GI+E
Subjt:  ILEVMESLVSLLGIVE

A0A1S3AT15 L10-interacting MYB domain-containing protein-like6.1e-2157.76Show/hide
Query:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN
        I  GAWA  PSQ  + K      + + DDTIG+T+ NENEFNNVES +N+V+TS TY+Q  EK+RK++ S VGS S++ D LDRL D IEYR RDLP C+
Subjt:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN

Query:  ILEVMESLVSLLGIVE
         LEVME+L SL GI+E
Subjt:  ILEVMESLVSLLGIVE

A0A5A7UN70 L10-interacting MYB domain-containing protein-like6.1e-2157.76Show/hide
Query:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN
        I  GAWA  PSQ  + K      + + DDTIG+T+ NENEFNNVES +N+V+TS TY+Q  EK+RK++ S VGS S++ D LDRL D IEYR RDLP C+
Subjt:  IGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGS-SKMSDKLDRLCDIIEYR-RDLPRCN

Query:  ILEVMESLVSLLGIVE
         LEVME+L SL GI+E
Subjt:  ILEVMESLVSLLGIVE

A0A5J5B6I7 Dimer_Tnp_hAT domain-containing protein2.3e-2045.6Show/hide
Query:  SNMDFDDFNDYDLFV-SNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN---------KLHPKTIEVLMCA
        SN+  D+ +DYDL+V S+ + +N K + D YL+E VLPR+ DF+IL+WWK NG KYP LQ I +DIL IP+S+VAS++         +LHP  +E LMC+
Subjt:  SNMDFDDFNDYDLFV-SNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN---------KLHPKTIEVLMCA

Query:  QKWITTKVLTSTIKHLATILEDADN
        Q W+  ++  S  +   T + D DN
Subjt:  QKWITTKVLTSTIKHLATILEDADN

A0A6J1DZU8 L10-interacting MYB domain-containing protein-like5.5e-2257.14Show/hide
Query:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSET-YSQGREKKRKKIGSGVGSSKMSDKLDRLCDIIEY-RRDLP
        D  +  GAWA  PSQ  +  + S +VE + DDT+G+T     EFNNVES +N+V+TS T Y+QG EKKRKK+ S VGSSK+ D LDRLCD IEY RRDLP
Subjt:  DGIIGEGAWA--PSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSET-YSQGREKKRKKIGSGVGSSKMSDKLDRLCDIIEY-RRDLP

Query:  RCNILEVMESLVSLLGIVE
         C+ LEVM++L  L GI+E
Subjt:  RCNILEVMESLVSLLGIVE

SwissProt top hitse value%identityAlignment
B9FJG3 Zinc finger BED domain-containing protein RICESLEEPER 14.5e-1334.13Show/hide
Query:  DDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDNKL-----------------HPKTIEVL
        D   D+D+++S        K + + YLDE + PR  +F+IL+WWKLN +KYP L ++ RDIL IP+S V+S N +                  P+ +E L
Subjt:  DDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDNKL-----------------HPKTIEVL

Query:  MCAQKWITTKVLTSTIKHLATILEDA
        +CA+ W+     T      A +  DA
Subjt:  MCAQKWITTKVLTSTIKHLATILEDA

Q0JMB2 Zinc finger BED domain-containing protein RICESLEEPER 41.6e-1033.65Show/hide
Query:  DYDLFVSNFNKMNH--KKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVA-------SDN----------KLHPKTIEVLMCA
        D+D+++S    M    K + + YL+E +  R+ DF++L WW+ N +KYP L  + RD+L IP+S+V         DN           L P+ +E L+CA
Subjt:  DYDLFVSNFNKMNH--KKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVA-------SDN----------KLHPKTIEVLMCA

Query:  QKWI
        + W+
Subjt:  QKWI

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 21.3e-1234.65Show/hide
Query:  DDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN-----------------KLHPKTIEVL
        D   D+D+++S        K + + YLDE + PR  +F+IL+WWKLN +K+P L  + RDIL IP+S V+S N                  L P+ +E L
Subjt:  DDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDN-----------------KLHPKTIEVL

Query:  MCAQKWITTKVLTSTIKHLATILEDAD
        +CA+ W+  + L +T +  +T L   D
Subjt:  MCAQKWITTKVLTSTIKHLATILEDAD

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 35.0e-1232.26Show/hide
Query:  DSRSKDLDGASSNMDFDDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD------------
        D+   D   A +  + D+  D+D+++S          + + YL+E ++PR  DF IL WWKLN +K+P L ++ RD+L IP+S V+S             
Subjt:  DSRSKDLDGASSNMDFDDFNDYDLFVSNF-NKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD------------

Query:  ------NKLHPKTIEVLMCAQKWI
              + L P+T+E L CA+ W+
Subjt:  ------NKLHPKTIEVLMCAQKWI

Q9M2N5 Zinc finger BED domain-containing protein DAYSLEEPER1.7e-1538.83Show/hide
Query:  DDFNDYDLFVSNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD--------------NKLHPKTIEVLMCAQ
        D  +D+D ++      N K + D YLDE +LPR  +F++L WWK N +KYP L ++ RDIL IP+S+ A D                L P+T+E L+CA+
Subjt:  DDFNDYDLFVSNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD--------------NKLHPKTIEVLMCAQ

Query:  KWI
        +W+
Subjt:  KWI

Arabidopsis top hitse value%identityAlignment
AT1G18560.1 BED zinc finger ;hAT family dimerisation domain6.7e-0429.11Show/hide
Query:  YLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDNKLHPKTIEV--------------LMCAQKWI
        YL E ++P   D  +L WWK+N  +YP L  + RD L +  +S A +     K  E+              ++C + WI
Subjt:  YLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDNKLHPKTIEV--------------LMCAQKWI

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain1.2e-1638.83Show/hide
Query:  DDFNDYDLFVSNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD--------------NKLHPKTIEVLMCAQ
        D  +D+D ++      N K + D YLDE +LPR  +F++L WWK N +KYP L ++ RDIL IP+S+ A D                L P+T+E L+CA+
Subjt:  DDFNDYDLFVSNFNKMNHKKQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASD--------------NKLHPKTIEVLMCAQ

Query:  KWI
        +W+
Subjt:  KWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCATCTCAAGGTTTTGTATCTAAGTCTGTAAGTGCAATGGATGGAATAATTGGAGAAGGAGCATGGGCCCCATCTCAAGGTTTTGTATCTAAGTCTGTAAGTGC
AAGAGTTGAAGGTTTTACTGATGATACAATAGGAGAGACAGAAGTCAATGAGAATGAGTTTAACAATGTTGAAAGTACATCTAATGATGTTGATACCAGTGAAACTTACT
CTCAAGGGAGAGAGAAAAAGAGGAAGAAAATTGGTTCAGGAGTTGGATCTTCAAAGATGTCGGACAAATTAGATCGTCTATGTGATATAATCGAATACAGAAGAGATTTG
CCAAGATGCAATATTCTTGAAGTAATGGAGTCTTTAGTGAGTTTGCTTGGGATTGTGGAAGAGAATCCCTATTTGACCAGAGATGGGGGTCAAATCGGAGAGATTCCCCG
AGTGGCGTCTAGGGTCGGGGTGGGGATAGGAGGGTATCCTCGCCCCGTCCCCGACCTCGACCCTGCCTTCGCTCTCGCCCTCTCGCTGCCTCTGTCTCGTCTCTCTCCTG
CCCTCTCGTTGTCTTCAGATCTCGTCTTCTCGCTGGCTCTGTCTCTCGATCGGCCTCTATCTCGCTCTTTCTCTGTCTCACTCTTGGGCTGTCGGGGATATATTTACTGC
CCACAACCTACACCGACCCGTCCCCGACCCCATCCACATCGCTGTCGCGTCTCGCGAGTTGTTGTCGTGCCGTCGTCGGATTGGACTCGAGGTGAAGCTGCAAAAAGAGC
ATTAGCTAGGATGATAATCAAACATGAATATCCTCTGTCAATAGTTGAACATAAAGGTTTTAGAGAATTTGATTTGGAGAAAAAATATCAGTTAAAACGTGAAAATGAAG
ATGATGATTCTAGATCTAAAGACTTAGATGGAGCATCAAGTAATATGGATTTTGATGATTTTAATGATTATGACTTGTTTGTATCAAACTTCAATAAGATGAATCATAAG
AAACAATTTGATGCATATTTAGATGAACATGTCTTACCTCGATCATTTGATTTTAACATACTGAGTTGGTGGAAATTGAATGGGGTCAAGTATCCCATATTACAAGAAAT
TACTAGAGACATATTATTCATCCCGTTATCTAGTGTTGCATCAGATAACAAACTTCATCCGAAGACAATAGAGGTTTTGATGTGCGCTCAAAAATGGATTACGACTAAAG
TTTTGACCAGTACGATTAAGCATTTAGCAACTATTTTAGAAGATGCAGACAACTCTAATGGCTTGGTGACTGAGGAATGGGTACTTGCGGGGAACAATTTTCCCGACGGG
GGTGGGGATGGGGTAAATTCCCCCACTGGGGGAACCCCATCCGCCTCACCCCGCCCCCTGAAGATTGTTGTTCCATCAATGACACTTCATAATTTTATAAGAACTCATTT
TACAACTGACTTTGAGTTCAAGCCTTATGACGATAATGACGATTTGCTTCCCTCAAGTGATGACAGAAATGAGTTAAGTAATACTGAAGATAGAGCAAGTGAAGATTTGC
TCTGCTCAAGCGATGACAGAAATGAGTTAAGTGATATTGAAGATAGTGAAACTTCTTGTCATGGAAGGGAGATAGAAGATGAGCAGGAGAAAATTGCAAAACTTCTTATG
TATGGCATGTTGTTCTTTGCTATGCTGGAGGTCGTAGATCACGTAGAGTATAAACTAGCACTATGTCAATATGCTTTGTGCTCCTTGCTACGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCATCTCAAGGTTTTGTATCTAAGTCTGTAAGTGCAATGGATGGAATAATTGGAGAAGGAGCATGGGCCCCATCTCAAGGTTTTGTATCTAAGTCTGTAAGTGC
AAGAGTTGAAGGTTTTACTGATGATACAATAGGAGAGACAGAAGTCAATGAGAATGAGTTTAACAATGTTGAAAGTACATCTAATGATGTTGATACCAGTGAAACTTACT
CTCAAGGGAGAGAGAAAAAGAGGAAGAAAATTGGTTCAGGAGTTGGATCTTCAAAGATGTCGGACAAATTAGATCGTCTATGTGATATAATCGAATACAGAAGAGATTTG
CCAAGATGCAATATTCTTGAAGTAATGGAGTCTTTAGTGAGTTTGCTTGGGATTGTGGAAGAGAATCCCTATTTGACCAGAGATGGGGGTCAAATCGGAGAGATTCCCCG
AGTGGCGTCTAGGGTCGGGGTGGGGATAGGAGGGTATCCTCGCCCCGTCCCCGACCTCGACCCTGCCTTCGCTCTCGCCCTCTCGCTGCCTCTGTCTCGTCTCTCTCCTG
CCCTCTCGTTGTCTTCAGATCTCGTCTTCTCGCTGGCTCTGTCTCTCGATCGGCCTCTATCTCGCTCTTTCTCTGTCTCACTCTTGGGCTGTCGGGGATATATTTACTGC
CCACAACCTACACCGACCCGTCCCCGACCCCATCCACATCGCTGTCGCGTCTCGCGAGTTGTTGTCGTGCCGTCGTCGGATTGGACTCGAGGTGAAGCTGCAAAAAGAGC
ATTAGCTAGGATGATAATCAAACATGAATATCCTCTGTCAATAGTTGAACATAAAGGTTTTAGAGAATTTGATTTGGAGAAAAAATATCAGTTAAAACGTGAAAATGAAG
ATGATGATTCTAGATCTAAAGACTTAGATGGAGCATCAAGTAATATGGATTTTGATGATTTTAATGATTATGACTTGTTTGTATCAAACTTCAATAAGATGAATCATAAG
AAACAATTTGATGCATATTTAGATGAACATGTCTTACCTCGATCATTTGATTTTAACATACTGAGTTGGTGGAAATTGAATGGGGTCAAGTATCCCATATTACAAGAAAT
TACTAGAGACATATTATTCATCCCGTTATCTAGTGTTGCATCAGATAACAAACTTCATCCGAAGACAATAGAGGTTTTGATGTGCGCTCAAAAATGGATTACGACTAAAG
TTTTGACCAGTACGATTAAGCATTTAGCAACTATTTTAGAAGATGCAGACAACTCTAATGGCTTGGTGACTGAGGAATGGGTACTTGCGGGGAACAATTTTCCCGACGGG
GGTGGGGATGGGGTAAATTCCCCCACTGGGGGAACCCCATCCGCCTCACCCCGCCCCCTGAAGATTGTTGTTCCATCAATGACACTTCATAATTTTATAAGAACTCATTT
TACAACTGACTTTGAGTTCAAGCCTTATGACGATAATGACGATTTGCTTCCCTCAAGTGATGACAGAAATGAGTTAAGTAATACTGAAGATAGAGCAAGTGAAGATTTGC
TCTGCTCAAGCGATGACAGAAATGAGTTAAGTGATATTGAAGATAGTGAAACTTCTTGTCATGGAAGGGAGATAGAAGATGAGCAGGAGAAAATTGCAAAACTTCTTATG
TATGGCATGTTGTTCTTTGCTATGCTGGAGGTCGTAGATCACGTAGAGTATAAACTAGCACTATGTCAATATGCTTTGTGCTCCTTGCTACGCTAG
Protein sequenceShow/hide protein sequence
MAPSQGFVSKSVSAMDGIIGEGAWAPSQGFVSKSVSARVEGFTDDTIGETEVNENEFNNVESTSNDVDTSETYSQGREKKRKKIGSGVGSSKMSDKLDRLCDIIEYRRDL
PRCNILEVMESLVSLLGIVEENPYLTRDGGQIGEIPRVASRVGVGIGGYPRPVPDLDPAFALALSLPLSRLSPALSLSSDLVFSLALSLDRPLSRSFSVSLLGCRGYIYC
PQPTPTRPRPHPHRCRVSRVVVVPSSDWTRGEAAKRALARMIIKHEYPLSIVEHKGFREFDLEKKYQLKRENEDDDSRSKDLDGASSNMDFDDFNDYDLFVSNFNKMNHK
KQFDAYLDEHVLPRSFDFNILSWWKLNGVKYPILQEITRDILFIPLSSVASDNKLHPKTIEVLMCAQKWITTKVLTSTIKHLATILEDADNSNGLVTEEWVLAGNNFPDG
GGDGVNSPTGGTPSASPRPLKIVVPSMTLHNFIRTHFTTDFEFKPYDDNDDLLPSSDDRNELSNTEDRASEDLLCSSDDRNELSDIEDSETSCHGREIEDEQEKIAKLLM
YGMLFFAMLEVVDHVEYKLALCQYALCSLLR