; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006013 (gene) of Snake gourd v1 genome

Gene IDTan0006013
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG11:61771499..61773327
RNA-Seq ExpressionTan0006013
SyntenyTan0006013
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-4342.25Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +    G+K +  +L++D +L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++      NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW+ L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI   VDALA   KP+S DDH+L+I +GLG +Y   ISV++   D+PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  LQRVYSMLLTQES
        +Q V S+LLTQES
Subjt:  LQRVYSMLLTQES

KAA0053143.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]1.3e-4243.65Show/hide
Query:  GSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWK
        G+K +  +L +DN+L+WKFQ+L  L  + L+ + + + + PSK+L ST  S++   R  NP+Y  W R + LI  W LGSMS  I+++++   +++EIW 
Subjt:  GSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWK

Query:  HLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES
         L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI+  VDALA   KP+S DDH+L+I  GLG +Y   IS+++   D+PS+Q V S+LLTQES
Subjt:  HLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES

KAA0067213.1 keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa]3.7e-3739.8Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +     +K +  +L +DN+L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++   R  NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW  L   FSS+++A+ ++ K KL  +KK ++ L+EYF KI+  VDALA   KP+S DDH+L+I +GLG +Y   ISV+    ++PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  L
        +
Subjt:  L

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-4342.25Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +    G+K +  +L++D +L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++      NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW+ L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI   VDALA   KP+S DDH+L+I +GLG +Y   ISV++   D+PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  LQRVYSMLLTQES
        +Q V S+LLTQES
Subjt:  LQRVYSMLLTQES

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.1e-4950.25Show/hide
Query:  KFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAE--STSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTS
        K INPGSK +  +L++DN L+WKFQ+   L+G+GL+ YID + D P++F+ +T +  S+S    NP Y  WI+QD LI AW LGSM+  I+S++L+  ++
Subjt:  KFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAE--STSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTS

Query:  REIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQ
        REIW  L   F+S+ +ARV++LK KLE  KKGNL L++YF KIK +VD+LA+AGK +S +DH++HI +GLG E+D  ISV+T  +   +LQ V S+LL Q
Subjt:  REIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQ

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-4442.25Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +    G+K +  +L++D +L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++      NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW+ L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI   VDALA   KP+S DDH+L+I +GLG +Y   ISV++   D+PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  LQRVYSMLLTQES
        +Q V S+LLTQES
Subjt:  LQRVYSMLLTQES

A0A5A7UB21 Keratin, type II cytoskeletal 1-like6.4e-4343.65Show/hide
Query:  GSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWK
        G+K +  +L +DN+L+WKFQ+L  L  + L+ + + + + PSK+L ST  S++   R  NP+Y  W R + LI  W LGSMS  I+++++   +++EIW 
Subjt:  GSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWK

Query:  HLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES
         L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI+  VDALA   KP+S DDH+L+I  GLG +Y   IS+++   D+PS+Q V S+LLTQES
Subjt:  HLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like1.8e-3739.8Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +     +K +  +L +DN+L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++   R  NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSD--RLSNPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW  L   FSS+++A+ ++ K KL  +KK ++ L+EYF KI+  VDALA   KP+S DDH+L+I +GLG +Y   ISV+    ++PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  L
        +
Subjt:  L

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-4442.25Show/hide
Query:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA
        ++    SS + +    G+K +  +L++D +L+WKFQ+L  L  + L+ +++ + + PSK+L ST  S++      NP Y  W RQD LI +W LGSMS  
Subjt:  IDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLS--NPKYDYWIRQDNLIIAWFLGSMSMA

Query:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS
        I++++L   +++EIW+ L   FSS+++A+ ++ K KL  +KKG++ L+EYF KI   VDALA   KP+S DDH+L+I +GLG +Y   ISV++   D+PS
Subjt:  IVSELLEYTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPS

Query:  LQRVYSMLLTQES
        +Q V S+LLTQES
Subjt:  LQRVYSMLLTQES

A0A6J1DLT9 uncharacterized protein LOC1110217575.4e-5050.25Show/hide
Query:  KFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAE--STSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTS
        K INPGSK +  +L++DN L+WKFQ+   L+G+GL+ YID + D P++F+ +T +  S+S    NP Y  WI+QD LI AW LGSM+  I+S++L+  ++
Subjt:  KFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAE--STSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTS

Query:  REIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQ
        REIW  L   F+S+ +ARV++LK KLE  KKGNL L++YF KIK +VD+LA+AGK +S +DH++HI +GLG E+D  ISV+T  +   +LQ V S+LL Q
Subjt:  REIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQ

Query:  E
        E
Subjt:  E

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.3e-2331.38Show/hide
Query:  QLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWKHLATRFSSK
        +L   NYL+W  QV     G+ L  ++D    +P    P+T  + +    NP Y  W RQD LI +  LG++SM++   +   TT+ +IW+ L   +++ 
Subjt:  QLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWKHLATRFSSK

Query:  HVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES
            V +L+T+L+   KG   + +Y   +    D LA+ GKP+ HD+ V  +   L  EY P I  +   D  P+L  ++  LL  ES
Subjt:  HVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQES

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1728.95Show/hide
Query:  QLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWKHLATRFSSK
        +L   NYL+W  QV     G+ L  ++D    +P    P+T  + +    NP Y  W RQD LI +  LG++SM++   +   TT+ +IW+ L   +++ 
Subjt:  QLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLEYTTSREIWKHLATRFSSK

Query:  HVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQESCL
            V +L+               + T+     D LA+ GKP+ HD+ V  +   L  +Y P I  +   D  PSL  ++  L+ +ES L
Subjt:  HVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQESCL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTTCCATCATTGATAAAGATTTTGTATCCTCTTCGCTTCCGAAGTTCATCAATCCAGGGAGCAAGACTGCTACGCCACAGTTAGATGAAGACAATTACCTGGT
GTGGAAGTTTCAGGTCTTAATTACTCTTCGTGGCCATGGATTGCAGAAATATATTGATGAAGATTATGATGTTCCTTCGAAGTTTCTTCCCTCGACTGCAGAATCGACCT
CTGATCGACTTTCAAATCCAAAGTATGATTACTGGATTCGCCAAGACAACCTCATAATTGCTTGGTTCTTAGGTTCGATGTCTATGGCGATTGTTTCTGAATTGCTTGAA
TACACAACCTCTCGTGAGATTTGGAAGCATCTGGCTACTCGTTTTTCATCGAAGCATGTTGCTAGGGTTTTGGAATTGAAGACCAAATTGGAATTGATGAAGAAAGGAAA
TCTAGGGCTTCAGGAATATTTTACGAAAATTAAGGGTGTAGTTGATGCTTTGGCTGTTGCTGGCAAGCCTATTTCTCACGATGATCATGTGCTACATATTTTTTCTGGTC
TTGGCCTGGAGTATGATCCGACGATTTCGGTTCTCACTGGTGGTGATGACACTCCTTCATTGCAGCGGGTGTATTCTATGCTTTTGACTCAGGAGAGTTGTCTTCACCGT
CACTCGTCCGTTCATTATTAA
mRNA sequenceShow/hide mRNA sequence
ATCAGGTCAATTGTTTTAAGACCATTTCCTAGTTCTTCTCTCTCATTCTCTTTCATGGTATCAAAGACTCTTAGGTTTCACAATCGTTCGAGATCCTGATTTTTTTTCCG
ATGGAATCTTCCATCATTGATAAAGATTTTGTATCCTCTTCGCTTCCGAAGTTCATCAATCCAGGGAGCAAGACTGCTACGCCACAGTTAGATGAAGACAATTACCTGGT
GTGGAAGTTTCAGGTCTTAATTACTCTTCGTGGCCATGGATTGCAGAAATATATTGATGAAGATTATGATGTTCCTTCGAAGTTTCTTCCCTCGACTGCAGAATCGACCT
CTGATCGACTTTCAAATCCAAAGTATGATTACTGGATTCGCCAAGACAACCTCATAATTGCTTGGTTCTTAGGTTCGATGTCTATGGCGATTGTTTCTGAATTGCTTGAA
TACACAACCTCTCGTGAGATTTGGAAGCATCTGGCTACTCGTTTTTCATCGAAGCATGTTGCTAGGGTTTTGGAATTGAAGACCAAATTGGAATTGATGAAGAAAGGAAA
TCTAGGGCTTCAGGAATATTTTACGAAAATTAAGGGTGTAGTTGATGCTTTGGCTGTTGCTGGCAAGCCTATTTCTCACGATGATCATGTGCTACATATTTTTTCTGGTC
TTGGCCTGGAGTATGATCCGACGATTTCGGTTCTCACTGGTGGTGATGACACTCCTTCATTGCAGCGGGTGTATTCTATGCTTTTGACTCAGGAGAGTTGTCTTCACCGT
CACTCGTCCGTTCATTATTAATCGATGGCACATCGCCTCTGTTAATTTGACGATTCGAGTGCTCACAAATCTTCGCAAGCTCCTCCTTATCCTTCGCCTAGCGGCAATGA
TAGTAATCGAAAGGGGAAGAATCAACATCATCAGAATAATAGTGGTAATTGGCAAAATCGTCGTCCTTGGAACAATAATGGTGGTAAGCCTCAATGTCAGTTATGTGGTC
GTTTTGGTCACACTGCCCTGCGTTGTTACTTTCGATTTGAGCGCTGGTTTCAAGGACCAAATTCTACTCCCTCTGGTTCTCATTCGTTTGGTGTGAATAATGCTCCTGCT
CTTTTGCCGCCACCTCAACCTCCGTATCAAGCTTACACTTTGCAACATGACATGAACAGGGAGAATCAGTGGTATCCTGATTTAGGGGCCTCCAATCATGTTACTCCTGA
TTTGTCCAATTTGTCTTTTGGAAATGAGTATAATGGCGATAACAAGGTTCATGTAGGCAATGGTACTGGTTTGCCTATTCAAAACATTGGACATTTCAACTGACAAACTT
CTTCTCCAAGGTCAATTGGTTGATGGACTTTATAGGTTCACTTTGGCAAAAGCGGATCATCCTTCCTCTTATCTTTCCTCTGATGAGTGTTCCCTTCCCTCTACTTCTCA
TGTTAATTCTATTACTCTGTCTTCTGATATGAATTCTTGTACTGTTCCTGCTGTTGTTAAACCATATACTTTACATGATTTGTGGCATAATAGGCAAGGGATTCTGCTCA
TTCCATTGTTCAGCAAGTTCTTAATCGTTGTAATATTCCTGTTCATTCAAATAAAACTCATCGTTTATGTCATGCTTGTTCTATTGGGAAAGCTCACAATTTGCCTTTTT
CTGATTCAACCACTGTTTATGATGCTC
Protein sequenceShow/hide protein sequence
MESSIIDKDFVSSSLPKFINPGSKTATPQLDEDNYLVWKFQVLITLRGHGLQKYIDEDYDVPSKFLPSTAESTSDRLSNPKYDYWIRQDNLIIAWFLGSMSMAIVSELLE
YTTSREIWKHLATRFSSKHVARVLELKTKLELMKKGNLGLQEYFTKIKGVVDALAVAGKPISHDDHVLHIFSGLGLEYDPTISVLTGGDDTPSLQRVYSMLLTQESCLHR
HSSVHY