; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019087 (gene) of Snake gourd v1 genome

Gene IDTan0019087
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:40440235..40441347
RNA-Seq ExpressionTan0019087
SyntenyTan0019087
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]8.9e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

TYK20422.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-3955.81Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMRDH+PRS++VL+E     TD+STRVVDKVG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

TYK29165.1 gag/pol protein [Cucumis melo var. makuwa]6.8e-3950.76Show/hide
Query:  MTKRPFSEKVIEPKSP----WNSCIGPLWSYECQGTRR------DTPKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKST
        MTKRPF+ K    K P     +   GP+ +   +G  +      D  ++ K    +DPQE++V VSTNATFLEEDHMRDH+PRS++VL+E     T  ST
Subjt:  MTKRPFSEKVIEPKSP----WNSCIGPLWSYECQGTRR------DTPKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKST

Query:  RVVDKVGP---------------SQELRMPRRSGRDVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
        RVVD+VGP               SQ LRMPRR GR          AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+  +PIGCKWIYKRKRD A
Subjt:  RVVDKVGP---------------SQELRMPRRSGRDVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein4.3e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

A0A5D3BUN8 Gag/pol protein4.3e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

A0A5D3CYF4 Gag/pol protein4.3e-3955.23Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMR+H+PRS++VLSE     TD+STRVVD+VG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

A0A5D3DA25 Gag/pol protein1.1e-3955.81Show/hide
Query:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------
        PK+ +    +DPQE++V VSTNATFLEEDHMRDH+PRS++VL+E     TD+STRVVDKVG               PSQ LRMPRRSGR V Q       
Subjt:  PKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVG---------------PSQELRMPRRSGRDVRQ-------

Query:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
             + P  G         AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+G +PIGCKWIYKRKRD A
Subjt:  -----LWPLHG---------AMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

A0A5D3E133 Gag/pol protein3.3e-3950.76Show/hide
Query:  MTKRPFSEKVIEPKSP----WNSCIGPLWSYECQGTRR------DTPKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKST
        MTKRPF+ K    K P     +   GP+ +   +G  +      D  ++ K    +DPQE++V VSTNATFLEEDHMRDH+PRS++VL+E     T  ST
Subjt:  MTKRPFSEKVIEPKSP----WNSCIGPLWSYECQGTRR------DTPKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKST

Query:  RVVDKVGP---------------SQELRMPRRSGRDVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA
        RVVD+VGP               SQ LRMPRR GR          AMND+D+D+W+KAMDLEMESMYFNS  +LVD P+  +PIGCKWIYKRKRD A
Subjt:  RVVDKVGP---------------SQELRMPRRSGRDVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-0640Show/hide
Query:  DVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRD
        D R+   L   ++  ++++ +KAM  EMES+  N   KLV+ P G+RP+ CKW++K K+D
Subjt:  DVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRD

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAAAAGACCTTTTTCCGAAAAGGTTATAGAGCCAAAGAGCCCTTGGAACTCGTGCATTGGACCTTTGTGGTCCTATGAATGTCAAGGCACGAGGAGGGATACTCC
AAAGAAACGAAAGGTGGTCTGCATTTATGATCCTCAAGAGGACAAGGTGCTTGTGTCGACAAATGCCACGTTCTTAGAGGAAGACCACATGAGAGATCATCAACCTCGTA
GTAGGATTGTCTTAAGTGAAATTTCCGGGAAGCTTACGGATAAATCAACAAGAGTTGTTGATAAAGTTGGTCCTTCTCAAGAGTTGAGAATGCCTCGGCGTAGTGGGAGG
GATGTTAGACAACTTTGGCCGTTACATGGGGCAATGAATGATATAGATAGGGACCGATGGATTAAAGCCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCACCTGC
GAAACTTGTAGATCAACCGGATGGTGAAAGACCTATCGGTTGCAAGTGGATCTACAAGAGGAAACGAGATCAAGCGGTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTAAAAGACCTTTTTCCGAAAAGGTTATAGAGCCAAAGAGCCCTTGGAACTCGTGCATTGGACCTTTGTGGTCCTATGAATGTCAAGGCACGAGGAGGGATACTCC
AAAGAAACGAAAGGTGGTCTGCATTTATGATCCTCAAGAGGACAAGGTGCTTGTGTCGACAAATGCCACGTTCTTAGAGGAAGACCACATGAGAGATCATCAACCTCGTA
GTAGGATTGTCTTAAGTGAAATTTCCGGGAAGCTTACGGATAAATCAACAAGAGTTGTTGATAAAGTTGGTCCTTCTCAAGAGTTGAGAATGCCTCGGCGTAGTGGGAGG
GATGTTAGACAACTTTGGCCGTTACATGGGGCAATGAATGATATAGATAGGGACCGATGGATTAAAGCCATGGACCTTGAAATGGAGTCAATGTACTTCAATTCACCTGC
GAAACTTGTAGATCAACCGGATGGTGAAAGACCTATCGGTTGCAAGTGGATCTACAAGAGGAAACGAGATCAAGCGGTGTGA
Protein sequenceShow/hide protein sequence
MTKRPFSEKVIEPKSPWNSCIGPLWSYECQGTRRDTPKKRKVVCIYDPQEDKVLVSTNATFLEEDHMRDHQPRSRIVLSEISGKLTDKSTRVVDKVGPSQELRMPRRSGR
DVRQLWPLHGAMNDIDRDRWIKAMDLEMESMYFNSPAKLVDQPDGERPIGCKWIYKRKRDQAV