; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015865 (gene) of Snake gourd v1 genome

Gene IDTan0015865
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG02:56307001..56308433
RNA-Seq ExpressionTan0015865
SyntenyTan0015865
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]6.0e-3044.57Show/hide
Query:  RDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLG
        R  +P +++  S    Q++  +     KI++G ++G VDLF ESH++ KDG VND+A DAY  MQ L++A +QEG EP++Q E C+ VL  R  H+KGLG
Subjt:  RDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLG

Query:  WDPNSS-----SSSSVTSSSQHEKELEKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN
        + P  +     SSS+VTSS+ +EKELEKKVE M+ E+  + T+       +S+WE+RW E +++M  RQG+G SN
Subjt:  WDPNSS-----SSSSVTSSSQHEKELEKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]8.1e-4336.31Show/hide
Query:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------
        E R   G +R +EL+R+VN HGRI IEIDE+VGKPVC+ AT FS AIGTI R+TIPL  K WSDV K+VRD + D+L                       
Subjt:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE
               KI++G +V QVDLF +SHF  KDGWVN++A+DAYL+MQ+L+EAS QE   P+S  EVCK VL  RSG+IKGLG +P  SSSSSVTS  Q +KE
Subjt:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE

Query:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN
        LEKK+E+M+ E+                LT++LS WE RWAE    +   QG +G SN
Subjt:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]6.0e-4639.27Show/hide
Query:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------
        E R   G +R +EL+R+VN HGRI IEIDE+VGKPVC+ AT FS AIGTI R+TIPL  K WSDV K+VRD + D+L                       
Subjt:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------

Query:  -------------------------------------------------------------------------------SKIQKGHEVGQVDLFHESHFS
                                                                                        KI++G +V QVDLF +SHF 
Subjt:  -------------------------------------------------------------------------------SKIQKGHEVGQVDLFHESHFS

Query:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEI--------------G
         KDGWVN++A+DAYL+MQ+L+EAS QE   P+S  EVCK VL  RSG+IKGLG +P  SSSSSVTS  Q +KELEKK+E+M+ E+               
Subjt:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEI--------------G

Query:  TLTTKLSSWEERWAEFTKYMDERQG-EGSSN
         LT++LS WE RWAE    +   QG +G SN
Subjt:  TLTTKLSSWEERWAEFTKYMDERQG-EGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]3.1e-2629.89Show/hide
Query:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------
        E R   G +R +EL+R+VN HGRI IEIDE+VGKPVC+ AT FS AIGTI R+TIPL  K WSDV K+VRD + D+L                       
Subjt:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE
               KI++G +V QVDLF +SHF  KDGWVN++A+DAYL+MQ+L+EAS QE   P+                                 SS + +KE
Subjt:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE

Query:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN
        LEKK+E+M+ E+                LT++LS WE RWAE    +   QG +G SN
Subjt:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]8.1e-4336.31Show/hide
Query:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------
        E R   G +R +EL+R+VN HGRI IEIDE+VGKPVC+ AT FS AIGTI R+TIPL  K WSDV K+VRD + D+L                       
Subjt:  EARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL-----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE
               KI++G +V QVDLF +SHF  KDGWVN++A+DAYL+MQ+L+EAS QE   P+S  EVCK VL  RSG+IKGLG +P  SSSSSVTS  Q +KE
Subjt:  ------SKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKE

Query:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN
        LEKK+E+M+ E+                LT++LS WE RWAE    +   QG +G SN
Subjt:  LEKKVEQMQAEI--------------GTLTTKLSSWEERWAEFTKYMDERQG-EGSSN

TrEMBL top hitse value%identityAlignment
A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class6.5e-2229.96Show/hide
Query:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL------------------SKI
        ++  RG  G  R IEL+++V  HG+I IEI+E+ GKPV + A   +  IGT  R+TIPL  + W  VP  VR+ + DRL                   K+
Subjt:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL------------------SKI

Query:  Q--------------------------------------------------KGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPI
        Q                                                  KG +V ++++FHE+HF  K+GW+ND A+DAYL+MQ+++  S++ G + I
Subjt:  Q--------------------------------------------------KGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPI

Query:  SQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW
        S ++ C+ VL +RS        +P S  S     SS  EKE + ++  ++     LT +L+ WE+ +
Subjt:  SQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW

A0A5A7UJI4 Uncharacterized protein4.5e-2335.68Show/hide
Query:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFS
        ++  RG  G  R IEL+++V  HG+I  EI+E+ GKP+ + A     +IGT  R+TIPL  + W  VP  VR  + DRL K +KG +V ++++FHE+HF 
Subjt:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFS

Query:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW
         K+GW+ND  R            S++ G + IS ++ C+ VL +RS        +P S  S     SS  EKE + ++  ++     LT +L+ WE+ +
Subjt:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW

A0A5D3BKN7 DUF4218 domain-containing protein4.1e-2437.19Show/hide
Query:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFS
        ++ ARG  G  R IEL+++V  HG+I IEI+E+ GKPV + A      IGT  R+TIPL  + W  VP  VR  + DRL K +KG +V ++++FHE+HF 
Subjt:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFS

Query:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW
         K+GW+ND  R            S++ G + IS ++ C+ VL +RS        +P S  S     SS  EKE + ++  ++     LT +L+ WE+ +
Subjt:  IKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERW

A0A5D3E0V3 Uncharacterized protein2.8e-2533.48Show/hide
Query:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLS--------------------
        ++  RG  G  R IEL+++V  HG+I IEI+E+ GKPV +     +  IGT  R+TIPL  + W  VP  VR+ + D L                     
Subjt:  EREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLS--------------------

Query:  -----------KIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQ
                   K +KG +V ++++FHE+HF  K+GW+ND A+DAYL+MQ+++  S++ G + IS ++ C+ VL +RS        +P S  S     SS 
Subjt:  -----------KIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQ

Query:  HEKELEKKVEQMQAEIGTLTTKLSSWEERW
         EKE + ++  ++     LT +L+ WE+ +
Subjt:  HEKELEKKVEQMQAEIGTLTTKLSSWEERW

A0A6J1DUH3 uncharacterized protein LOC1110232122.9e-3044.57Show/hide
Query:  RDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLG
        R  +P +++  S    Q++  +     KI++G ++G VDLF ESH++ KDG VND+A DAY  MQ L++A +QEG EP++Q E C+ VL  R  H+KGLG
Subjt:  RDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQVDLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLG

Query:  WDPNSS-----SSSSVTSSSQHEKELEKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN
        + P  +     SSS+VTSS+ +EKELEKKVE M+ E+  + T+       +S+WE+RW E +++M  RQG+G SN
Subjt:  WDPNSS-----SSSSVTSSSQHEKELEKKVEQMQAEIGTLTTK-------LSSWEERWAEFTKYMDERQGEGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTGGAGGTTAATCGTTCGACAGCGCAACATCGTACACGAGCTTCAGCCTCTAGAACGCGAGGCGAGAGGCCGAGAGGGCATAGCCAGAAGGATTGAGTTAGAGCG
CTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGGCAAACCAGTGTGTAGTAAGGCCACTACGTTCAGTGGAGCCATTGGTACCATCACCCGAG
ATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGACAGCATAAAAGATCGACTCTCTAAGATACAAAAGGGTCATGAAGTAGGCCAAGTT
GATTTGTTCCATGAAAGTCACTTCAGCATAAAGGACGGATGGGTGAACGACCATGCGAGGGATGCATATTTGAAAATGCAACAACTTCTTGAGGCATCATCACAGGAAGG
ATCTGAGCCAATCTCACAGTCAGAAGTTTGTAAAATGGTTTTAAGTACTCGATCAGGCCACATAAAAGGTCTTGGTTGGGATCCAAATTCTAGTTCGTCGTCTAGCGTCA
CATCTTCTTCCCAACATGAAAAAGAGCTTGAAAAGAAGGTGGAGCAGATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGCTGAA
TTCACAAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCTGGAGGTTAATCGTTCGACAGCGCAACATCGTACACGAGCTTCAGCCTCTAGAACGCGAGGCGAGAGGCCGAGAGGGCATAGCCAGAAGGATTGAGTTAGAGCG
CTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGGCAAACCAGTGTGTAGTAAGGCCACTACGTTCAGTGGAGCCATTGGTACCATCACCCGAG
ATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGACAGCATAAAAGATCGACTCTCTAAGATACAAAAGGGTCATGAAGTAGGCCAAGTT
GATTTGTTCCATGAAAGTCACTTCAGCATAAAGGACGGATGGGTGAACGACCATGCGAGGGATGCATATTTGAAAATGCAACAACTTCTTGAGGCATCATCACAGGAAGG
ATCTGAGCCAATCTCACAGTCAGAAGTTTGTAAAATGGTTTTAAGTACTCGATCAGGCCACATAAAAGGTCTTGGTTGGGATCCAAATTCTAGTTCGTCGTCTAGCGTCA
CATCTTCTTCCCAACATGAAAAAGAGCTTGAAAAGAAGGTGGAGCAGATGCAAGCTGAGATTGGTACCTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGGCTGAA
TTCACAAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
Protein sequenceShow/hide protein sequence
MLWRLIVRQRNIVHELQPLEREARGREGIARRIELERYVNAHGRIPIEIDEKVGKPVCSKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSKIQKGHEVGQV
DLFHESHFSIKDGWVNDHARDAYLKMQQLLEASSQEGSEPISQSEVCKMVLSTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEQMQAEIGTLTTKLSSWEERWAE
FTKYMDERQGEGSSNP