; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014371 (gene) of Snake gourd v1 genome

Gene IDTan0014371
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG11:33950545..33952807
RNA-Seq ExpressionTan0014371
SyntenyTan0014371
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG4141194.1 hypothetical protein ERO13_D06G060950v2 [Gossypium hirsutum]4.6e-2038.06Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        V++R  ++  S ++IN ++ LPD+    +  M+ N +   L   L ++   G++WI+   G+ S +  YL   ANVWL F+    +PT+H ST+S E ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQR
        L YAIL   SI+VG II +EI  + KK  G  + P+ IT LCL A V    + +R
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.2e-2141.03Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        V VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  + + GA W VSA G  +   + LT  A VW  FLK+ LLPTTH  TVS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC  A      +E++L
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.4e-2141.67Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        V VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  +   GA W VSA G  +   + LT  A VW  FLK+RLLPTTH  TVS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC  A      +E++L
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-2042.25Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        + VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  +   GA W VSA G  +   + LT  A VW  FLK+RLLPTTH   VS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLC
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLC

TYH88163.1 hypothetical protein ES332_D01G168900v1 [Gossypium tomentosum]2.7e-2031.67Show/hide
Query:  RKEQPSHVRSPQTPPHFITEQARERF-AILQSKELFLERGF----------------------------ATPFAD--LPHFIKENIINHGVFRVLVRGIS
        RK   S   +P+  P  I E+ +ERF +I + + +  E+GF                            A  F+D  L      ++       V+VR   
Subjt:  RKEQPSHVRSPQTPPHFITEQARERF-AILQSKELFLERGF----------------------------ATPFAD--LPHFIKENIINHGVFRVLVRGIS

Query:  IDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILK
        +  + ++IN ++ LPD+    Y  M+ N +   L   L ++   G++WI+   G+ S +  YL   ANVW  F++   +P +H  T+S E MLL YAIL 
Subjt:  IDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILK

Query:  VLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGV
          SI+VG+II +EI   AKK AG ++ P+ IT LCL A V
Subjt:  VLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGV

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.0e-2141.03Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        V VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  + + GA W VSA G  +   + LT  A VW  FLK+ LLPTTH  TVS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC  A      +E++L
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL

A0A2P5BCG4 Uncharacterized protein (Fragment)6.9e-2241.67Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        V VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  +   GA W VSA G  +   + LT  A VW  FLK+RLLPTTH  TVS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC  A      +E++L
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRL

A0A2P5DXM3 Uncharacterized protein1.0e-2042.25Show/hide
Query:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML
        + VRG+ + WS EAIN ++ L D P   ++  + N +   L   L  +   GA W VSA G  +   + LT  A VW  FLK+RLLPTTH   VS + ML
Subjt:  VLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWML

Query:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLC
        L +++L   SI+VGR+I  EI+  A +  G LF P+ IT+LC
Subjt:  LTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLC

A0A5D2MA47 Uncharacterized protein1.3e-2031.67Show/hide
Query:  RKEQPSHVRSPQTPPHFITEQARERF-AILQSKELFLERGF----------------------------ATPFAD--LPHFIKENIINHGVFRVLVRGIS
        RK   S   +P+  P  I E+ +ERF +I + + +  E+GF                            A  F+D  L      ++       V+VR   
Subjt:  RKEQPSHVRSPQTPPHFITEQARERF-AILQSKELFLERGF----------------------------ATPFAD--LPHFIKENIINHGVFRVLVRGIS

Query:  IDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILK
        +  + ++IN ++ LPD+    Y  M+ N +   L   L ++   G++WI+   G+ S +  YL   ANVW  F++   +P +H  T+S E MLL YAIL 
Subjt:  IDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGARWIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILK

Query:  VLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGV
          SI+VG+II +EI   AKK AG ++ P+ IT LCL A V
Subjt:  VLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGV

A0A803NKF3 Uncharacterized protein3.4e-2126.08Show/hide
Query:  MTVLLSKYNVKHKFSTPYHPQANGQAEISNREIKKILERVVNPSRKDCLLQL-CYLPLLHLRHRG------------KPKQRVKRRVPKLQHLQSLPL--
        +  LL+KY VKHK +  YHPQ NGQA+ISNRE+  ILE+VVNPS KD   +L   L    L H+                +  K R+ ++  L+      
Subjt:  MTVLLSKYNVKHKFSTPYHPQANGQAEISNREIKKILERVVNPSRKDCLLQL-CYLPLLHLRHRG------------KPKQRVKRRVPKLQHLQSLPL--

Query:  ---------NWRKG-----TLLRDHPSNRPSSKRKEQPSHVRSPQTPPHFIT------------EQARERFAILQSK--------ELFLERGFATPFAD-
                  WR G      L+++      +S+ K  P  ++S  + P  +T            E+    F   ++K        + ++ERG      D 
Subjt:  ---------NWRKG-----TLLRDHPSNRPSSKRKEQPSHVRSPQTPPHFIT------------EQARERFAILQSK--------ELFLERGFATPFAD-

Query:  --LPHFIKENI-INHGVF---------------------------RVLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGAR
          LP  + E + + H                               V+VR + + +S +AIN ++ L  +   +Y  +    S   L   +  +    + 
Subjt:  --LPHFIKENI-INHGVF---------------------------RVLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGAR

Query:  WIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDV
        W +  +G   F+      +     +F++  L PT+HD TVS +   + + I K + +DVG ++A++I   A +  GKLF P+TIT LC  AGV +
Subjt:  WIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTACTCTTGAGTAAATATAATGTCAAACACAAGTTCTCAACTCCATATCACCCGCAGGCCAATGGACAAGCTGAAATATCTAATAGGGAAATCAAGAAGATTTT
GGAAAGAGTAGTAAATCCCAGCAGGAAGGATTGCCTACTCCAGCTCTGCTACCTCCCCCTCCTCCACCTAAGGCATCGAGGCAAACCAAAGCAAAGAGTAAAAAGAAGAG
TGCCAAAGCTTCAGCACCTGCAAAGCCTACCCCTCAATTGGCGGAAGGGGACATTGTTGAGGGACCACCCGTCGAATCGCCCGTCCTCTAAGAGGAAAGAACAACCTTCC
CATGTCCGCTCCCCTCAGACTCCACCACATTTCATCACCGAGCAGGCTCGCGAACGGTTTGCCATATTACAATCCAAGGAGCTCTTTTTGGAACGAGGTTTTGCTACTCC
TTTCGCCGATCTCCCTCATTTTATTAAGGAAAACATCATAAATCATGGTGTGTTTCGTGTCCTTGTTCGTGGGATATCGATAGATTGGTCTCCTGAAGCCATCAACAAGA
TGTATGAGCTACCCGACATTCCTCGGGCCAACTACAATAGGATGGTGTTGAATCCTTCCACTACTCATCTAAATGCAGCCCTGCGGCTGATAGGATTGAAAGGTGCTCGT
TGGATTGTCTCCGCCACCGGTACGAGGTCTTTTCAAGCAGCTTACTTGACGGATGAGGCGAATGTGTGGCTTTCCTTCCTCAAGAACAGACTACTTCCAACAACCCATGA
TTCTACTGTCTCTAGTGAATGGATGTTGCTGACTTATGCAATCTTGAAGGTTCTGAGCATTGATGTGGGACGCATTATTGCGGAGGAAATTCAAACTCATGCCAAGAAAA
CTGCTGGCAAACTCTTTTCCCCTAACACTATCACTAAGCTTTGTTTGATTGCAGGTGTTGATGTTGATCCTTCCGAGCAGCGGTTGACCCGACTCTTCCATGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGTACTCTTGAGTAAATATAATGTCAAACACAAGTTCTCAACTCCATATCACCCGCAGGCCAATGGACAAGCTGAAATATCTAATAGGGAAATCAAGAAGATTTT
GGAAAGAGTAGTAAATCCCAGCAGGAAGGATTGCCTACTCCAGCTCTGCTACCTCCCCCTCCTCCACCTAAGGCATCGAGGCAAACCAAAGCAAAGAGTAAAAAGAAGAG
TGCCAAAGCTTCAGCACCTGCAAAGCCTACCCCTCAATTGGCGGAAGGGGACATTGTTGAGGGACCACCCGTCGAATCGCCCGTCCTCTAAGAGGAAAGAACAACCTTCC
CATGTCCGCTCCCCTCAGACTCCACCACATTTCATCACCGAGCAGGCTCGCGAACGGTTTGCCATATTACAATCCAAGGAGCTCTTTTTGGAACGAGGTTTTGCTACTCC
TTTCGCCGATCTCCCTCATTTTATTAAGGAAAACATCATAAATCATGGTGTGTTTCGTGTCCTTGTTCGTGGGATATCGATAGATTGGTCTCCTGAAGCCATCAACAAGA
TGTATGAGCTACCCGACATTCCTCGGGCCAACTACAATAGGATGGTGTTGAATCCTTCCACTACTCATCTAAATGCAGCCCTGCGGCTGATAGGATTGAAAGGTGCTCGT
TGGATTGTCTCCGCCACCGGTACGAGGTCTTTTCAAGCAGCTTACTTGACGGATGAGGCGAATGTGTGGCTTTCCTTCCTCAAGAACAGACTACTTCCAACAACCCATGA
TTCTACTGTCTCTAGTGAATGGATGTTGCTGACTTATGCAATCTTGAAGGTTCTGAGCATTGATGTGGGACGCATTATTGCGGAGGAAATTCAAACTCATGCCAAGAAAA
CTGCTGGCAAACTCTTTTCCCCTAACACTATCACTAAGCTTTGTTTGATTGCAGGTGTTGATGTTGATCCTTCCGAGCAGCGGTTGACCCGACTCTTCCATGTTTGA
Protein sequenceShow/hide protein sequence
MTVLLSKYNVKHKFSTPYHPQANGQAEISNREIKKILERVVNPSRKDCLLQLCYLPLLHLRHRGKPKQRVKRRVPKLQHLQSLPLNWRKGTLLRDHPSNRPSSKRKEQPS
HVRSPQTPPHFITEQARERFAILQSKELFLERGFATPFADLPHFIKENIINHGVFRVLVRGISIDWSPEAINKMYELPDIPRANYNRMVLNPSTTHLNAALRLIGLKGAR
WIVSATGTRSFQAAYLTDEANVWLSFLKNRLLPTTHDSTVSSEWMLLTYAILKVLSIDVGRIIAEEIQTHAKKTAGKLFSPNTITKLCLIAGVDVDPSEQRLTRLFHV