; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011822 (gene) of Snake gourd v1 genome

Gene IDTan0011822
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG04:47087092..47089303
RNA-Seq ExpressionTan0011822
SyntenyTan0011822
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]9.7e-4247.62Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY
        +  M+ SD Q  T+HG+PLG +N+RV                                     V+ +  K    ++    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        A+ +MQ  D IQ+ L+EH+FG+EK IYL  DDI+ YCGM EIGYSCI+ YIA LW  CD EI  +F+LVDQ TIS+  KSQE R  NL +RLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]2.5e-4551.9Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY
        V  M+ SD Q  T+HG+PLG EN+RV         VA P   K D+                             +    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM +MQ +D IQ+ LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+  KSQE R  NL NRLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

XP_016901190.1 PREDICTED: uncharacterized protein LOC103493028 isoform X2 [Cucumis melo]2.5e-4551.9Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY
        V  M+ SD Q  T+HG+PLG EN+RV         VA P   K D+                             +    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM +MQ +D IQ+ LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+  KSQE R  NL NRLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]9.7e-4247.62Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY
        +  M+ SD Q  T+HG+PLG +N+RV                                     V+ +  K    ++    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        A+ +MQ  D IQ+ L+EH+FG+EK IYL  DDI+ YCGM EIGYSCI+ YIA LW  CD EI  +F+LVDQ TIS+  KSQE R  NL +RLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]3.7e-4150.95Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV------------------------SVHVPT-----SHSTKYTDAHVTIKLLNRY
        V TM+ SDAQ  +++ +PLG +NVR +        VA P   K  +                            PT     + S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV------------------------SVHVPT-----SHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM SMQ DD IQ+ LSE + G+EK IYL RDDI+ YCGM EIGYSCI+AYIA LW  CD EI  KF++VDQ TIS+  K QE R  NL NRLEMV+  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.2e-4551.9Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY
        V  M+ SD Q  T+HG+PLG EN+RV         VA P   K D+                             +    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM +MQ +D IQ+ LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+  KSQE R  NL NRLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

A0A1S4DZN2 uncharacterized protein LOC103493028 isoform X21.2e-4551.9Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY
        V  M+ SD Q  T+HG+PLG EN+RV         VA P   K D+                             +    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM +MQ +D IQ+ LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+  KSQE R  NL NRLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.2e-4551.9Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY
        V  M+ SD Q  T+HG+PLG EN+RV         VA P   K D+                             +    T+ S+KYTD HVTIKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRVV--------VASPAKHKSDV-----------------------------SVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
        AM +MQ +D IQ+ LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+  KSQE R  NL NRLEM N  LD
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPYNTG
Subjt:  QQVFIPYNTG

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X42.0e-4048.1Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY
        V T++ ++ Q  TVHGVPLGV+NVRV                                     V+ S  K+ S        +  +K+TD HV+IKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
         MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ YIAYLW V +YEI  KFL+VD  TIS + KSQE R  NLANRLEMVN  L+
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPY +G
Subjt:  QQVFIPYNTG

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.3e-0445.76Show/hide
Query:  IKAKVLFPQELKDKIFDSVE---ELGDESSTRATLWIQARKGKNNEYFDEATKQCVGRI
        I+AK L+   +  K + ++    +L  + S RA LW +ARKGKNNEYFD+AT++C  RI
Subjt:  IKAKVLFPQELKDKIFDSVE---ELGDESSTRATLWIQARKGKNNEYFDEATKQCVGRI

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X42.0e-4048.1Show/hide
Query:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY
        V T++ ++ Q  TVHGVPLGV+NVRV                                     V+ S  K+ S        +  +K+TD HV+IKLLNRY
Subjt:  VSTMYTSDAQFSTVHGVPLGVENVRV-------------------------------------VVASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRY

Query:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD
         MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ YIAYLW V +YEI  KFL+VD  TIS + KSQE R  NLANRLEMVN  L+
Subjt:  AMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITISNFFKSQETRCINLANRLEMVNLDLD

Query:  QQVFIPYNTG
        Q V IPY +G
Subjt:  QQVFIPYNTG

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.3e-0445.76Show/hide
Query:  IKAKVLFPQELKDKIFDSVE---ELGDESSTRATLWIQARKGKNNEYFDEATKQCVGRI
        I+AK L+   +  K + ++    +L  + S RA LW +ARKGKNNEYFD+AT++C  RI
Subjt:  IKAKVLFPQELKDKIFDSVE---ELGDESSTRATLWIQARKGKNNEYFDEATKQCVGRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCGATCCAAATGGAGGTTAGGCATACTAATCGACGTGGTCTCACTACTATGTAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGTTCCCCCAAGA
ATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGGGGATGAGTCTTCCACTCGTGCCACCTTATGGATACAAGCACGAAAAGGAAAAAATAATGAATACTTCGATG
AAGCCACCAAACAATGTGTTGGTCGAATCGTAAGCACAATGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTA
GTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTCTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCATGTGACTATTAAGCTTCTGAATCGTTATGC
AATGTTATCGATGCAAGAAGATGATACGATTCAAGTCAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTG
GGATGGTGGAGATAGGGTACTCCTGCATAGTTGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGTCAAGTTCTTGCTAGTTGATCAAATAACCATT
TCTAATTTTTTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGGTA
A
mRNA sequenceShow/hide mRNA sequence
ATGTATCGATCCAAATGGAGGTTAGGCATACTAATCGACGTGGTCTCACTACTATGTAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGTTCCCCCAAGA
ATTGAAAGATAAAATTTTTGATTCTGTAGAGGAATTGGGGGATGAGTCTTCCACTCGTGCCACCTTATGGATACAAGCACGAAAAGGAAAAAATAATGAATACTTCGATG
AAGCCACCAAACAATGTGTTGGTCGAATCGTAAGCACAATGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTA
GTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTCTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCATGTGACTATTAAGCTTCTGAATCGTTATGC
AATGTTATCGATGCAAGAAGATGATACGATTCAAGTCAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTG
GGATGGTGGAGATAGGGTACTCCTGCATAGTTGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGTCAAGTTCTTGCTAGTTGATCAAATAACCATT
TCTAATTTTTTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGGTA
A
Protein sequenceShow/hide protein sequence
MYRSKWRLGILIDVVSLLCNAWSSNTTIKAKVLFPQELKDKIFDSVEELGDESSTRATLWIQARKGKNNEYFDEATKQCVGRIVSTMYTSDAQFSTVHGVPLGVENVRVV
VASPAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIVKFLLVDQITI
SNFFKSQETRCINLANRLEMVNLDLDQQVFIPYNTG