; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000930 (gene) of Snake gourd v1 genome

Gene IDTan0000930
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:13574033..13574620
RNA-Seq ExpressionTan0000930
SyntenyTan0000930
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-4955.24Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH
        MVRSM+SYAQL SSFWGYAVE A +ILN VPSKSVSETP+ELW+GRK                                 GY KET+G +F+D QE++V 
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH

Query:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY
        VSTNATFLEEDHMR+H+PRS +VLS    EATD+ST+VVD+VG               PSQ LR+ R SG+ V Q +RY+GL E QVVI DDGV+DPL+Y
Subjt:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY

Query:  KHAMNDIDRD
        K AMND+D+D
Subjt:  KHAMNDIDRD

KAA0046206.1 gag/pol protein [Cucumis melo var. makuwa]5.9e-5366.86Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MVRSM+S+AQL +SFWGYA+E A YILN V SKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNATFL+EDH+R+HQ RS +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD-------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDIDRD
        +   STKVVD       Q  PSQ L   R SG+ V Q DRY+GL EAQ++I DDG++DPLTYK AMND+D D
Subjt:  K---STKVVD-------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDIDRD

KAA0048693.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4964.88Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MVR M+S+AQL  SFWGYA+E A YILN VPSKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNA FLEE+H+ +HQ  S +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI
        K   STKV+D      Q  PSQ L   R S + V Q DRY+GL EAQ++I +DG++DPLTYKHAMND+
Subjt:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI

KAA0049915.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-5055.24Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH
        MVRSM+SYAQL SSFWGYAVE A +ILN VPSKSVSETP+ELW+GRK                                 GY KET+G +F+D QE++V 
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH

Query:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY
        VSTNATFLEEDHMRDH+PRS +VL+E+    TD+ST+VVD+VG               PSQ LR+ R SG+ V Q +RY+GL E QVVI DDGV+DPL+Y
Subjt:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY

Query:  KHAMNDIDRD
        KHAMND+D+D
Subjt:  KHAMNDIDRD

TYK14918.1 gag/pol protein [Cucumis melo var. makuwa]4.3e-5166.07Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MV  M+S+AQL  SFWGYA+E A YILN VPSKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNA FLEEDH+R+HQ RS +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI
        K   STKV+D      Q  PSQ L   R SG+ V Q DRY+GL EAQ++I +DG++DPLTYK AMND+
Subjt:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI

TrEMBL top hitse value%identityAlignment
A0A5A7TYM5 Gag/pol protein5.1e-5064.88Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MVR M+S+AQL  SFWGYA+E A YILN VPSKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNA FLEE+H+ +HQ  S +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI
        K   STKV+D      Q  PSQ L   R S + V Q DRY+GL EAQ++I +DG++DPLTYKHAMND+
Subjt:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI

A0A5A7TZD0 Gag/pol protein8.7e-5055.24Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH
        MVRSM+SYAQL SSFWGYAVE A +ILN VPSKSVSETP+ELW+GRK                                 GY KET+G +F+D QE++V 
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH

Query:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY
        VSTNATFLEEDHMR+H+PRS +VLS    EATD+ST+VVD+VG               PSQ LR+ R SG+ V Q +RY+GL E QVVI DDGV+DPL+Y
Subjt:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY

Query:  KHAMNDIDRD
        K AMND+D+D
Subjt:  KHAMNDIDRD

A0A5A7U8G7 Gag/pol protein2.3e-5055.24Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH
        MVRSM+SYAQL SSFWGYAVE A +ILN VPSKSVSETP+ELW+GRK                                 GY KET+G +F+D QE++V 
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRK---------------------------------GYSKETKGDMFYDSQEDKVH

Query:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY
        VSTNATFLEEDHMRDH+PRS +VL+E+    TD+ST+VVD+VG               PSQ LR+ R SG+ V Q +RY+GL E QVVI DDGV+DPL+Y
Subjt:  VSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG---------------PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTY

Query:  KHAMNDIDRD
        KHAMND+D+D
Subjt:  KHAMNDIDRD

A0A5D3CS08 Gag/pol protein2.9e-5366.86Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MVRSM+S+AQL +SFWGYA+E A YILN V SKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNATFL+EDH+R+HQ RS +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD-------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDIDRD
        +   STKVVD       Q  PSQ L   R SG+ V Q DRY+GL EAQ++I DDG++DPLTYK AMND+D D
Subjt:  K---STKVVD-------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDIDRD

A0A5D3CUL5 Gag/pol protein2.1e-5166.07Show/hide
Query:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD
        MV  M+S+AQL  SFWGYA+E A YILN VPSKSVSETPYELWKGRKGY KE+KG +FYD QE+KV VSTNA FLEEDH+R+HQ RS +VL EISK  TD
Subjt:  MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATD

Query:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI
        K   STKV+D      Q  PSQ L   R SG+ V Q DRY+GL EAQ++I +DG++DPLTYK AMND+
Subjt:  K---STKVVD------QVGPSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-0645.45Show/hide
Query:  RSMISYAQLRSSFWGYAVEAAAYILNMVPSKSV---SETPYELWKGRKGYSKETK
        R+M+S A+L  SFWG AV  A Y++N +PS+++   S+TPYE+W  +K Y K  +
Subjt:  RSMISYAQLRSSFWGYAVEAAAYILNMVPSKSV---SETPYELWKGRKGYSKETK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGATCCATGATAAGCTATGCTCAGCTGCGTAGTTCGTTTTGGGGATATGCAGTAGAAGCTGCTGCATATATTTTGAACATGGTTCCCTCTAAGAGTGTTTCAGA
AACACCCTATGAGTTATGGAAAGGGCGTAAAGGATACTCCAAAGAAACGAAAGGTGATATGTTTTACGATTCTCAAGAAGACAAGGTGCATGTGTCGACAAACGCCACGT
TCTTAGAGGAAGACCACATGAGAGATCATCAGCCTCGTAGCATAATTGTCTTGAGCGAAATTTCCAAGGAAGCTACAGATAAATCAACAAAAGTTGTTGATCAAGTTGGT
CCTTCTCAACATTTGAGAATATCTCGACATAGTGGGAAGGATGTTGGACAACTCGACCGTTACATGGGTTTGATTGAAGCCCAGGTCGTCATACTTGATGATGGAGTTAA
GGATCCATTGACCTATAAACATGCAATGAATGACATAGACAGGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGATCCATGATAAGCTATGCTCAGCTGCGTAGTTCGTTTTGGGGATATGCAGTAGAAGCTGCTGCATATATTTTGAACATGGTTCCCTCTAAGAGTGTTTCAGA
AACACCCTATGAGTTATGGAAAGGGCGTAAAGGATACTCCAAAGAAACGAAAGGTGATATGTTTTACGATTCTCAAGAAGACAAGGTGCATGTGTCGACAAACGCCACGT
TCTTAGAGGAAGACCACATGAGAGATCATCAGCCTCGTAGCATAATTGTCTTGAGCGAAATTTCCAAGGAAGCTACAGATAAATCAACAAAAGTTGTTGATCAAGTTGGT
CCTTCTCAACATTTGAGAATATCTCGACATAGTGGGAAGGATGTTGGACAACTCGACCGTTACATGGGTTTGATTGAAGCCCAGGTCGTCATACTTGATGATGGAGTTAA
GGATCCATTGACCTATAAACATGCAATGAATGACATAGACAGGGACTAG
Protein sequenceShow/hide protein sequence
MVRSMISYAQLRSSFWGYAVEAAAYILNMVPSKSVSETPYELWKGRKGYSKETKGDMFYDSQEDKVHVSTNATFLEEDHMRDHQPRSIIVLSEISKEATDKSTKVVDQVG
PSQHLRISRHSGKDVGQLDRYMGLIEAQVVILDDGVKDPLTYKHAMNDIDRD