; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027094 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027094
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDimer_Tnp_hAT domain-containing protein
Genome locationscaffold8:2480078..2481746
RNA-Seq ExpressionSpg027094
SyntenySpg027094
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKU69514.1 hypothetical protein MA16_Dca022888 [Dendrobium catenatum]2.1e-4445.45Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           DR+LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNE
        PL+ DDVASDDEW V+D  E
Subjt:  PLVVDDVASDDEWIVEDNNE

PKU72727.1 hypothetical protein MA16_Dca007447 [Dendrobium catenatum]4.6e-4445Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           D++LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNE
        PL+ DDVASDDEW V+D  E
Subjt:  PLVVDDVASDDEWIVEDNNE

PKU85227.1 hypothetical protein MA16_Dca025470 [Dendrobium catenatum]4.6e-4441.6Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           DR+LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG
        PL+ DDVASDDEW V+D     V+       D ++NV    +     + S+  T Q+ IK G
Subjt:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG

XP_022159386.1 uncharacterized protein LOC111025802 [Momordica charantia]1.8e-5148.3Show/hide
Query:  VPAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYT-----------------------
        VPAMGFIYGAMD AKEEIAKN GGEEASYK+IWNIIDEKWEF+LHR+LH A YFLNPHFQYD+NFS HPEIKL LYT                       
Subjt:  VPAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYT-----------------------

Query:  --------------------FDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEED
                             DWWTQFGDGTPELAKFAIKVLS TCSA+GC RN S+FNQ                           D+HLKRK LKEE+
Subjt:  --------------------FDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEED

Query:  DPLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRS-SGKTRQQNIKSGD
        DPL+                                     GEG+STQQR  S KT+Q   +S D
Subjt:  DPLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRS-SGKTRQQNIKSGD

XP_031121045.1 uncharacterized protein LOC116024293 [Ipomoea triloba]1.1e-4542.63Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYTF-----------------------
        PAMGF+Y AMD AKE+IAKNLGGEE  YK+IW IID+KW+F++HR+LH AAY+LNP   Y  +FS HPEIKL L+ +                       
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYTF-----------------------

Query:  --------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WWTQ+GDGTPEL KFA+KVL LTCS++GCERN S FNQ                           DRHLK K L  E+D
Subjt:  --------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSS
         L++D++ SDDEW+V +N E      +   DD +++V+E     +TQ  +S
Subjt:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSS

TrEMBL top hitse value%identityAlignment
A0A2I0W1H0 Dimer_Tnp_hAT domain-containing protein1.0e-4445.45Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           DR+LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNE
        PL+ DDVASDDEW V+D  E
Subjt:  PLVVDDVASDDEWIVEDNNE

A0A2I0WAP2 Uncharacterized protein2.2e-4445Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           D++LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNE
        PL+ DDVASDDEW V+D  E
Subjt:  PLVVDDVASDDEWIVEDNNE

A0A2I0X230 Dimer_Tnp_hAT domain-containing protein5.0e-4441.22Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           DR+LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG
        PL+ DDVASDDEW V+D     V+       D ++NV    +   + +  +  T Q+ IK G
Subjt:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG

A0A2I0XBC2 Dimer_Tnp_hAT domain-containing protein2.2e-4441.6Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------
        PAMGFIY AMD AKE IA NLGG E S+++IWNIID++WE +LHR+LH A Y+LNP +QY +N S +PEIKL LY                         
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLY-------------------------

Query:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD
                            +WW QFGDGTPEL +FA+KVL LTCS++GCERN S++NQ                           DR+LK+KGL +++D
Subjt:  ------------------TFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEEDD

Query:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG
        PL+ DDVASDDEW V+D     V+       D ++NV    +     + S+  T Q+ IK G
Subjt:  PLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSG

A0A6J1E3R9 uncharacterized protein LOC1110258028.5e-5248.3Show/hide
Query:  VPAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYT-----------------------
        VPAMGFIYGAMD AKEEIAKN GGEEASYK+IWNIIDEKWEF+LHR+LH A YFLNPHFQYD+NFS HPEIKL LYT                       
Subjt:  VPAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYT-----------------------

Query:  --------------------FDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEED
                             DWWTQFGDGTPELAKFAIKVLS TCSA+GC RN S+FNQ                           D+HLKRK LKEE+
Subjt:  --------------------FDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------------------------DRHLKRKGLKEED

Query:  DPLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRS-SGKTRQQNIKSGD
        DPL+                                     GEG+STQQR  S KT+Q   +S D
Subjt:  DPLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRS-SGKTRQQNIKSGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily4.1e-1428.28Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKL---------------------------R
        PA+G IY  M  AKE I      +E  +K   +I+D  W   LH  LH AA FLNP  QY      +PEIK                            +
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKL---------------------------R

Query:  LYTFD----------------------WWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------DRHLKRK------------GLKEEDD
        ++TF                       WW QFGD  P L + AI++LS  CS    ER  S+F Q         DR +  K             +  E D
Subjt:  LYTFD----------------------WWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQ---------DRHLKRK------------GLKEEDD

Query:  PLVVDDVASDDEWIVEDNNES---GVDFFVEHEDDSNINVYEHG
        P+ ++D+    EW+ E  N S    +D F    D  ++N  + G
Subjt:  PLVVDDVASDDEWIVEDNNES---GVDFFVEHEDDSNINVYEHG

AT3G13020.1 hAT transposon superfamily protein5.9e-1329.79Show/hide
Query:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRL----------------------------
        +G+IY  +D  K  I K    E+  Y  +W++ID+ W   LH  LH A Y+LNP   Y  +F   PE+   L                            
Subjt:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRL----------------------------

Query:  -------------YTFDWWTQFGDGTPELAKFAIKVLSLTC
                        DWWT+     PEL  FAIK+LS TC
Subjt:  -------------YTFDWWTQFGDGTPELAKFAIKVLSLTC

AT3G13030.1 hAT transposon superfamily protein2.2e-1227.81Show/hide
Query:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEI-----------------------KLRLYTF--
        +G++Y  MD  KE IA+    +   YK +W++ID+ W   LH  LH A YFLNP   Y  NF    E+                       ++ +Y    
Subjt:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEI-----------------------KLRLYTF--

Query:  ------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKGLKEE
                          +WW       PEL   AIK+LS TC         S +   R L  K L  E
Subjt:  ------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKGLKEE

AT3G13030.2 hAT transposon superfamily protein2.2e-1227.81Show/hide
Query:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEI-----------------------KLRLYTF--
        +G++Y  MD  KE IA+    +   YK +W++ID+ W   LH  LH A YFLNP   Y  NF    E+                       ++ +Y    
Subjt:  MGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEI-----------------------KLRLYTF--

Query:  ------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKGLKEE
                          +WW       PEL   AIK+LS TC         S +   R L  K L  E
Subjt:  ------------------DWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKGLKEE

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related2.5e-2432.05Show/hide
Query:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYD-----------------------------------DNFS
        P MG+IYGAMD AKE I K+   +E +YK  + IID +W+ +LHR LH A Y+LNP F Y                                    D F 
Subjt:  PAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYD-----------------------------------DNFS

Query:  NHP---------EIKLRLYTFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKG----------------------LKEED--DPL
                     ++ ++   +WW+ +G  TP L  FAIKVLSLTCSATGCERN   F Q  H KR+                        K  D  DP+
Subjt:  NHP---------EIKLRLYTFDWWTQFGDGTPELAKFAIKVLSLTCSATGCERNCSSFNQDRHLKRKG----------------------LKEED--DPL

Query:  VVDDVASDDEWIV----EDNNESGVDFFVEHEDD
        +++++   +EW+     E+++++  D  V   DD
Subjt:  VVDDVASDDEWIV----EDNNESGVDFFVEHEDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCCTGCAATGGGATTTATATATGGTGCCATGGATTTAGCAAAAGAGGAAATTGCCAAAAATCTAGGGGGAGAGGAAGCAAGCTACAAGAAGATATGGAACATTAT
TGATGAAAAGTGGGAGTTTGAACTTCATCGATACTTACATGTCGCAGCATATTTCTTGAACCCACATTTTCAATATGATGATAATTTTTCCAATCATCCAGAGATCAAAT
TGAGATTGTATACATTTGATTGGTGGACTCAATTTGGCGATGGAACACCAGAACTAGCTAAATTTGCCATTAAAGTTCTAAGTCTTACATGTTCAGCAACTGGTTGTGAG
CGTAATTGTAGTTCATTTAATCAGGATAGACACTTGAAGCGAAAGGGTCTCAAGGAAGAAGATGATCCATTAGTAGTAGATGATGTGGCATCTGATGATGAGTGGATTGT
TGAAGACAATAATGAGTCTGGAGTTGATTTCTTTGTTGAACATGAGGATGATTCTAACATCAATGTTTATGAGCATGGAGAAGGAAGTAGTACTCAACAAAGGAGTAGTG
GTAAGACAAGACAACAAAATATTAAAAGTGGAGATCTGAATAGCCTCAACGTCGCGTCGGAGCACCGATCTGAACCAGAGCGGCGAGTCAGATCTAGAGGCCGGGGGAAT
CCATTTTTAGATCCAACGATGAAGATTGTCTGGAACGACAAAGGGCGTGTGTCAGATCTGGAGGTCGAGCGGAGATCGAAGATGGGTTTCGAGTGGTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCCTGCAATGGGATTTATATATGGTGCCATGGATTTAGCAAAAGAGGAAATTGCCAAAAATCTAGGGGGAGAGGAAGCAAGCTACAAGAAGATATGGAACATTAT
TGATGAAAAGTGGGAGTTTGAACTTCATCGATACTTACATGTCGCAGCATATTTCTTGAACCCACATTTTCAATATGATGATAATTTTTCCAATCATCCAGAGATCAAAT
TGAGATTGTATACATTTGATTGGTGGACTCAATTTGGCGATGGAACACCAGAACTAGCTAAATTTGCCATTAAAGTTCTAAGTCTTACATGTTCAGCAACTGGTTGTGAG
CGTAATTGTAGTTCATTTAATCAGGATAGACACTTGAAGCGAAAGGGTCTCAAGGAAGAAGATGATCCATTAGTAGTAGATGATGTGGCATCTGATGATGAGTGGATTGT
TGAAGACAATAATGAGTCTGGAGTTGATTTCTTTGTTGAACATGAGGATGATTCTAACATCAATGTTTATGAGCATGGAGAAGGAAGTAGTACTCAACAAAGGAGTAGTG
GTAAGACAAGACAACAAAATATTAAAAGTGGAGATCTGAATAGCCTCAACGTCGCGTCGGAGCACCGATCTGAACCAGAGCGGCGAGTCAGATCTAGAGGCCGGGGGAAT
CCATTTTTAGATCCAACGATGAAGATTGTCTGGAACGACAAAGGGCGTGTGTCAGATCTGGAGGTCGAGCGGAGATCGAAGATGGGTTTCGAGTGGTGGAGATGA
Protein sequenceShow/hide protein sequence
MVPAMGFIYGAMDLAKEEIAKNLGGEEASYKKIWNIIDEKWEFELHRYLHVAAYFLNPHFQYDDNFSNHPEIKLRLYTFDWWTQFGDGTPELAKFAIKVLSLTCSATGCE
RNCSSFNQDRHLKRKGLKEEDDPLVVDDVASDDEWIVEDNNESGVDFFVEHEDDSNINVYEHGEGSSTQQRSSGKTRQQNIKSGDLNSLNVASEHRSEPERRVRSRGRGN
PFLDPTMKIVWNDKGRVSDLEVERRSKMGFEWWR