; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011558 (gene) of Snake gourd v1 genome

Gene IDTan0011558
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCACTA en-spm transposon protein
Genome locationLG10:58650257..58651740
RNA-Seq ExpressionTan0011558
SyntenyTan0011558
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]7.4e-2729.46Show/hide
Query:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT
        + H  Q+ +  ++L+ ++   +  ST +  T ASG             RR RGHSR +EL+R+VN HGRI IEIDE+V KPVC  AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------
        IPL  K WSDV K+VRD + D+L                                                                             
Subjt:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SLKMQQLLEASSQE
                                                                                               L+MQ+L+EAS QE
Subjt:  --------------------------------------------------------------------------------------SLKMQQLLEASSQE

Query:  GSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-E
           P+S  EVCK VLG RSG+IKGLG +P  SSSSSVTS  Q +KELEKK+E M+ E+                LT++LS WE RW E    +   QG +
Subjt:  GSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-E

Query:  GSSN
        G SN
Subjt:  GSSN

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]5.5e-3031.56Show/hide
Query:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT
        + H  Q+ +  ++L+ ++   +  ST +  T ASG             RR RGHSR +EL+R+VN HGRI IEIDE+V KPVC  AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------
        IPL  K WSDV K+VRD + D+L                                                                             
Subjt:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------

Query:  -----------------------------------------------------------SLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGW
                                                                    L+MQ+L+EAS QE   P+S  EVCK VLG RSG+IKGLG 
Subjt:  -----------------------------------------------------------SLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGW

Query:  DPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-EGSSN
        +P  SSSSSVTS  Q +KELEKK+E M+ E+                LT++LS WE RW E    +   QG +G SN
Subjt:  DPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-EGSSN

XP_038887410.1 poly [ADP-ribose] polymerase 1-like isoform X3 [Benincasa hispida]1.4e-2046.88Show/hide
Query:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT
        + H  Q+ +  ++L+ ++   +  ST +  T ASG             RR RGHSR +EL+R+VN HGRI IEIDE+V KPVC  AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDSIKDRLSLKMQ
        IPL  K WSDV K+VRD + D+L + ++
Subjt:  IPLHYKTWSDVPKQVRDSIKDRLSLKMQ

XP_038887411.1 poly [ADP-ribose] polymerase 1-like isoform X4 [Benincasa hispida]1.4e-2046.88Show/hide
Query:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT
        + H  Q+ +  ++L+ ++   +  ST +  T ASG             RR RGHSR +EL+R+VN HGRI IEIDE+V KPVC  AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDSIKDRLSLKMQ
        IPL  K WSDV K+VRD + D+L + ++
Subjt:  IPLHYKTWSDVPKQVRDSIKDRLSLKMQ

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]7.4e-2729.46Show/hide
Query:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT
        + H  Q+ +  ++L+ ++   +  ST +  T ASG             RR RGHSR +EL+R+VN HGRI IEIDE+V KPVC  AT FS AIGTI R+T
Subjt:  MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASG-------------RRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDT

Query:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------
        IPL  K WSDV K+VRD + D+L                                                                             
Subjt:  IPLHYKTWSDVPKQVRDSIKDRL-----------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------SLKMQQLLEASSQE
                                                                                               L+MQ+L+EAS QE
Subjt:  --------------------------------------------------------------------------------------SLKMQQLLEASSQE

Query:  GSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-E
           P+S  EVCK VLG RSG+IKGLG +P  SSSSSVTS  Q +KELEKK+E M+ E+                LT++LS WE RW E    +   QG +
Subjt:  GSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGN--------------LTTKLSSWEERWTEFTKYMDERQG-E

Query:  GSSN
        G SN
Subjt:  GSSN

TrEMBL top hitse value%identityAlignment
A0A5A7T4Q4 CACTA en-spm transposon protein6.6e-1335.59Show/hide
Query:  STAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPL----HYKTWSDVPKQVRDSIKDRL-------SLK
        S++Q  T    RRA+  SR +ELER+V  +GRI + I  + +KP+   A  FS AIG   R T P+    H++ +SD P++ R ++ + L          
Subjt:  STAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPL----HYKTWSDVPKQVRDSIKDRL-------SLK

Query:  MQQLLEASSQ---EGSEPISQSEVCKMVLGTRSGHIKGLGWDPN-------SSSSSSVTSSSQHEKELEKKVEHMQA
            +  + Q   EGS+P S+ E+C  VLG R G+ KGLGW P        S+S+SS + S   +KE+E +V+  +A
Subjt:  MQQLLEASSQ---EGSEPISQSEVCKMVLGTRSGHIKGLGWDPN-------SSSSSSVTSSSQHEKELEKKVEHMQA

A0A5A7TEQ7 CACTA en-spm transposon protein3.3e-1236.81Show/hide
Query:  RGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSLKMQQLLEASSQEGSEPISQSEVCKMV
        R  SR +EL+ YV+A+GRI + I   V+KP+   A  FS AIG   R+T  +    W+D+    R  I              +  +GS+P S+ E+C+ +
Subjt:  RGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSLKMQQLLEASSQEGSEPISQSEVCKMV

Query:  LGTRSGHIKGLGWDPNSSSSSSV------TSSSQHEKELEKKVE
        LG R G+ KGLGW P S S          TS SQ   EL+ +VE
Subjt:  LGTRSGHIKGLGWDPNSSSSSSV------TSSSQHEKELEKKVE

A0A5A7TT86 CACTA en-spm transposon protein2.5e-1234.01Show/hide
Query:  STAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIK-------------DRLS
        S++Q  T    RRA+  SR +ELE YV  + RIP+ I    +KP+   A  FS  IG   R T P     W+DV ++  + +K             DR+ 
Subjt:  STAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIK-------------DRLS

Query:  LKMQ------------------QLLEASSQ---EGSEPISQSEVCKMVLGTRSGHIKGLGWDPN-------SSSSSSVTSSSQHEKELEKKVEHMQA
        L  +                  Q+LE  SQ   +GS+P S+ E+C  VLG R G+ KGLGW P        S+SSSS + S   EKE+E + +  +A
Subjt:  LKMQ------------------QLLEASSQ---EGSEPISQSEVCKMVLGTRSGHIKGLGWDPN-------SSSSSSVTSSSQHEKELEKKVEHMQA

A0A5D3BKN7 DUF4218 domain-containing protein3.3e-1231.66Show/hide
Query:  KRSTAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSLKMQ-------
        +R+ +Q  T+   R  RG+ R IEL+++V  HG+I IEI+E+  KPV T A      IGT  R+TIPL  + W  VP  VR  + DRL  K +       
Subjt:  KRSTAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRLSLKMQ-------

Query:  ------------------QLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGNLTTKLSSWEERW
                          +    S++ G + IS ++ C+ VLG+RS        +P S  S     SS  EKE + ++ +++     LT +L+ WE+ +
Subjt:  ------------------QLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGNLTTKLSSWEERW

A0A6J1DUH3 uncharacterized protein LOC1110232122.3e-1845.97Show/hide
Query:  VRDSIKDRLSLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDPNSS-----SSSSVTSSSQHEKELEKKVEHMQAEIGNLTTK-------L
        V D+ +D     MQ L++A +QEG EP++Q E C+ VLG R  H+KGLG+ P  +     SSS+VTSS+ +EKELEKKVE M+ E+  + T+       +
Subjt:  VRDSIKDRLSLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDPNSS-----SSSSVTSSSQHEKELEKKVEHMQAEIGNLTTK-------L

Query:  SSWEERWTEFTKYMDERQGEGSSN
        S+WE+RW E +++M  RQG+G SN
Subjt:  SSWEERWTEFTKYMDERQGEGSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGTTGTTGTGTTAGAAGAGGAGAATGCTTTGGAGGTTAAACGTTCGACAGCGCAACATCGTACACGAGCTTCAGGCAGGAG
GGCTAGAGGGCATAGCCGAAGGATTGAGTTAGAGCGCTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGACAAACCAGTGTGTACTAAGGCCA
CTACGTTCAGTGGAGCCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGACAGCATAAAAGATCGACTC
TCTTTGAAAATGCAACAACTTCTTGAAGCATCATCACAAGAAGGATCTGAGCCAATCTCACAGTCAGAAGTTTGTAAAATGGTTTTGGGTACTCGATCAGGCCACATAAA
AGGTCTTGGTTGGGACCCAAATTCTAGTTCGTCGTCTAGCGTCACATCTTCTTCCCAACATGAAAAAGAGCTTGAAAAGAAGGTGGAGCATATGCAAGCTGAGATTGGTA
ACTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGACTGAATTCACAAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCATCCAAGTCAAGATCATGATGTAGTTGTTGTGTTAGAAGAGGAGAATGCTTTGGAGGTTAAACGTTCGACAGCGCAACATCGTACACGAGCTTCAGGCAGGAG
GGCTAGAGGGCATAGCCGAAGGATTGAGTTAGAGCGCTATGTCAATGCACATGGTAGAATACCCATCGAGATCGATGAGAAGGTCGACAAACCAGTGTGTACTAAGGCCA
CTACGTTCAGTGGAGCCATTGGTACCATCACCCGAGATACAATTCCACTGCATTATAAAACGTGGAGCGACGTCCCAAAGCAAGTTCGAGACAGCATAAAAGATCGACTC
TCTTTGAAAATGCAACAACTTCTTGAAGCATCATCACAAGAAGGATCTGAGCCAATCTCACAGTCAGAAGTTTGTAAAATGGTTTTGGGTACTCGATCAGGCCACATAAA
AGGTCTTGGTTGGGACCCAAATTCTAGTTCGTCGTCTAGCGTCACATCTTCTTCCCAACATGAAAAAGAGCTTGAAAAGAAGGTGGAGCATATGCAAGCTGAGATTGGTA
ACTTAACGACGAAGTTGTCCTCATGGGAAGAAAGATGGACTGAATTCACAAAGTACATGGATGAAAGGCAGGGTGAAGGTTCTTCAAACCCCTAG
Protein sequenceShow/hide protein sequence
MVHPSQDHDVVVVLEEENALEVKRSTAQHRTRASGRRARGHSRRIELERYVNAHGRIPIEIDEKVDKPVCTKATTFSGAIGTITRDTIPLHYKTWSDVPKQVRDSIKDRL
SLKMQQLLEASSQEGSEPISQSEVCKMVLGTRSGHIKGLGWDPNSSSSSSVTSSSQHEKELEKKVEHMQAEIGNLTTKLSSWEERWTEFTKYMDERQGEGSSNP