; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g014040 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g014040
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionDimer_Tnp_hAT domain-containing protein
Genome locationChr06:25577250..25578449
RNA-Seq ExpressionLcy06g014040
SyntenyLcy06g014040
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFP83826.1 hypothetical protein PHJA_000526100 [Phtheirospermum japonicum]6.4e-4269.83Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGF+YGAMD +KEEIAKNLGGEE SYKE+WNIIDEKWEFQ+H HLH+ATYFLNP FQY+D FS+H E+K GLY C++KLI +E++R KADLQ++ FR+R+
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        G FG    I+S KKRS
Subjt:  GFFGFRQAIASYKKRS

GFP92431.1 hypothetical protein PHJA_001387300 [Phtheirospermum japonicum]7.8e-4067.23Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGF YG MD AKEEIAKNLG E+ SYK++WNIIDEKWEFQ+HRHLH+A YFLNP FQYDD FS+H E+K GLY C++KLI +E++R KADLQ+D FR+R+
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRSSDK
        G F    A +S KKRS  K
Subjt:  GFFGFRQAIASYKKRSSDK

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]2.2e-5850.36Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGFIYGAMDSAKEEIAKN GGEEASYKEIWNIIDEKWEFQLHRHLH+A YFLNP FQYD+NFS HPEIKLGLYTC DK+I DE ER KADLQ D FRRRE
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKK--------------------------------------------------------------------------------------
        GFFGF+QAIAS KK                                                                                      
Subjt:  GFFGFRQAIASYKK--------------------------------------------------------------------------------------

Query:  ---------------------RSSDKGKRSMTLDEEEWLDIGSDNENQDETISYEDDSMEDPISDSDNDIIDGL
                             RS DKGK  MTLDE EW+DI SDNEN+D  I Y+DDSMEDPISDSDND++D L
Subjt:  ---------------------RSSDKGKRSMTLDEEEWLDIGSDNENQDETISYEDDSMEDPISDSDNDIIDGL

XP_030494946.1 uncharacterized protein LOC115710731 [Cannabis sativa]3.5e-4068.1Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGF+Y AMD AKE+IA NLGGEE  YKEIW IIDEKWEFQLHRHLH+A Y+LNP   Y ++FSNHPE+KLGL+ CMD+LI D  ER KADLQ  +F  +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        GFFGF QA  +++KRS
Subjt:  GFFGFRQAIASYKKRS

XP_030502380.1 uncharacterized protein LOC115717535 [Cannabis sativa]3.5e-4068.1Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGF+Y AMD AKE+IA NLGGEE  YKEIW IIDEKWEFQLHRHLH+A Y+LNP   Y ++FSNHPE+KLGL+ CMD+LI D  ER KADLQ  +F  +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        GFFGF QA  +++KRS
Subjt:  GFFGFRQAIASYKKRS

TrEMBL top hitse value%identityAlignment
A0A2I0V9L1 Uncharacterized protein3.0e-3765.52Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGFIY AMD AKE IA NLGG E S++EIWNIID++WE QLHRHLH+A Y+LNP +QY +N S +PEIKLGLY CMD+LI+D  ER+ ADLQ+  FR +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        GFFG +QA  +  KRS
Subjt:  GFFGFRQAIASYKKRS

A0A2I0WAP2 Uncharacterized protein3.0e-3765.52Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGFIY AMD AKE IA NLGG E S++EIWNIID++WE QLHRHLH+A Y+LNP +QY +N S +PEIKLGLY CMD+LI+D  ER+ ADLQ+  FR +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        GFFG +QA  +  KRS
Subjt:  GFFGFRQAIASYKKRS

A0A2I0XBC2 Dimer_Tnp_hAT domain-containing protein3.0e-3765.52Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGFIY AMD AKE IA NLGG E S++EIWNIID++WE QLHRHLH+A Y+LNP +QY +N S +PEIKLGLY CMD+LI+D  ER+ ADLQ+  FR +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRS
        GFFG +QA  +  KRS
Subjt:  GFFGFRQAIASYKKRS

A0A6J1E3R9 uncharacterized protein LOC1110258021.1e-5850.36Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGFIYGAMDSAKEEIAKN GGEEASYKEIWNIIDEKWEFQLHRHLH+A YFLNP FQYD+NFS HPEIKLGLYTC DK+I DE ER KADLQ D FRRRE
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKK--------------------------------------------------------------------------------------
        GFFGF+QAIAS KK                                                                                      
Subjt:  GFFGFRQAIASYKK--------------------------------------------------------------------------------------

Query:  ---------------------RSSDKGKRSMTLDEEEWLDIGSDNENQDETISYEDDSMEDPISDSDNDIIDGL
                             RS DKGK  MTLDE EW+DI SDNEN+D  I Y+DDSMEDPISDSDND++D L
Subjt:  ---------------------RSSDKGKRSMTLDEEEWLDIGSDNENQDETISYEDDSMEDPISDSDNDIIDGL

A0A803QF45 Uncharacterized protein3.0e-3755.17Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE
        MGF+Y AMD AKE+IA NLGGEE  YKEIW IIDEKWEFQL+R+LH+A Y+LNP   Y  +FSNHPE+KLG + CMD+LI D  ER KA+LQ  +F  +E
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRRE

Query:  GFFGFRQAIASYKKRSSD----KGKRSMTLDEEEWL---DIGSDN
        GFFGF Q   +++KRS      +  + + L  + W+   D+GS N
Subjt:  GFFGFRQAIASYKKRSSD----KGKRSMTLDEEEWL---DIGSDN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13020.1 hAT transposon superfamily protein8.1e-1138.89Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGL
        +G+IY  +D  K  I K    E+  Y  +W++ID+ W   LH  LH+A Y+LNP   Y  +F   PE+  GL
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGL

AT3G13030.1 hAT transposon superfamily protein6.4e-1638.14Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR
        +G++Y  MDS KE IA+    +   YK +W++ID+ W   LH  LH+A YFLNP   Y  NF    E+  GL + +  ++ D + + K   QID++R
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR

AT3G13030.2 hAT transposon superfamily protein6.4e-1638.14Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR
        +G++Y  MDS KE IA+    +   YK +W++ID+ W   LH  LH+A YFLNP   Y  NF    E+  GL + +  ++ D + + K   QID++R
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR

AT3G13030.3 hAT transposon superfamily protein6.4e-1638.14Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR
        +G++Y  MDS KE IA+    +   YK +W++ID+ W   LH  LH+A YFLNP   Y  NF    E+  GL + +  ++ D + + K   QID++R
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFR

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related2.1e-1940.17Show/hide
Query:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQY-DDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRR
        MG+IYGAMD AKE I K+   +E +YK  + IID +W+ QLHR LH+A Y+LNP+F Y   +   + E+  G   C+ +L+     + K   ++D F++ 
Subjt:  MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQY-DDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRR

Query:  EGFFGFRQAIASYKKRS
         G FG   AI    K S
Subjt:  EGFFGFRQAIASYKKRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTATATATGGTGCCATGGATTCAGCAAAAGAGGAAATTGCCAAAAATCTAGGGGGAGAGGAAGCAAGCTACAAGGAGATATGGAACATTATTGATGAA
AAGTGGGAGTTTCAACTTCATCGACACTTACATTCCGCAACATATTTCTTGAACCCAGATTTTCAATATGATGATAATTTTTCCAATCATCCAGAGATCAAATTG
GGATTGTATACATGTATGGATAAATTGATAGCGGACGAAAACGAGAGAACAAAAGCTGATCTTCAAATTGATTTATTTCGAAGGAGGGAAGGATTTTTTGGGTTC
CGTCAAGCAATAGCATCTTACAAAAAACGATCTTCAGATAAAGGGAAGAGGTCGATGACGTTGGATGAAGAGGAATGGTTAGATATTGGGAGTGACAACGAGAAT
CAAGATGAAACTATTAGTTATGAAGATGACTCTATGGAAGATCCTATTAGTGATAGTGATAATGATATTATTGATGGACTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTATATATGGTGCCATGGATTCAGCAAAAGAGGAAATTGCCAAAAATCTAGGGGGAGAGGAAGCAAGCTACAAGGAGATATGGAACATTATTGATGAA
AAGTGGGAGTTTCAACTTCATCGACACTTACATTCCGCAACATATTTCTTGAACCCAGATTTTCAATATGATGATAATTTTTCCAATCATCCAGAGATCAAATTG
GGATTGTATACATGTATGGATAAATTGATAGCGGACGAAAACGAGAGAACAAAAGCTGATCTTCAAATTGATTTATTTCGAAGGAGGGAAGGATTTTTTGGGTTC
CGTCAAGCAATAGCATCTTACAAAAAACGATCTTCAGATAAAGGGAAGAGGTCGATGACGTTGGATGAAGAGGAATGGTTAGATATTGGGAGTGACAACGAGAAT
CAAGATGAAACTATTAGTTATGAAGATGACTCTATGGAAGATCCTATTAGTGATAGTGATAATGATATTATTGATGGACTTTAG
Protein sequenceShow/hide protein sequence
MGFIYGAMDSAKEEIAKNLGGEEASYKEIWNIIDEKWEFQLHRHLHSATYFLNPDFQYDDNFSNHPEIKLGLYTCMDKLIADENERTKADLQIDLFRRREGFFGF
RQAIASYKKRSSDKGKRSMTLDEEEWLDIGSDNENQDETISYEDDSMEDPISDSDNDIIDGL