; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012148 (gene) of Snake gourd v1 genome

Gene IDTan0012148
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:54953920..54955106
RNA-Seq ExpressionTan0012148
SyntenyTan0012148
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5977.92Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    DKDS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLI EIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5977.92Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    DKDS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLI EIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

KAA0058279.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-6078.57Show/hide
Query:  TGYIDSRLSIDKDS-LNLHRGVFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS+   DKDS  +  R VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDS-LNLHRGVFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLIWEIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-5977.92Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    DKDS     G VFTLN+G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLI EIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-5975.32Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    D+DS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLR F+++LEVVPNM+  IT+YCDNS AVANSREPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN
         KHIE+KYHLI EIVHRGDVIVT+IAS HNV DPFTK LTAKVFE  LE LGL+  P+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.7e-5975.32Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    D+DS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLR F+++LEVVPNM+  IT+YCDNS AVANSREPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN
         KHIE+KYHLI EIVHRGDVIVT+IAS HNV DPFTK LTAKVFE  LE LGL+  P+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN

A0A5A7TZD0 Gag/pol protein2.7e-5977.92Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    DKDS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLI EIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

A0A5A7UXM9 Gag/pol protein4.2e-6078.57Show/hide
Query:  TGYIDSRLSIDKDS-LNLHRGVFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS+   DKDS  +  R VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDS-LNLHRGVFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLIWEIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

A0A5A7V1F5 Gag/pol protein1.6e-5977.92Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    DKDS     G VFTLN+G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLRKF+ +LEVVPNM L IT+YCDNS AVANS+EPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ
         KHIE+KYHLI EIV RGDVIVTKIASEHN+ DPFTK LTAKVFE  LE LGL+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQ

A0A5D3CPJ6 Gag/pol protein2.7e-5975.32Show/hide
Query:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR
        TGY DS    D+DS     G VFTLN G VVWRS+KQGCIADSTMEAEYVAACEAAKE VWLR F+++LEVVPNM+  IT+YCDNS AVANSREPRSHKR
Subjt:  TGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKR

Query:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN
         KHIE+KYHLI EIVHRGDVIVT+IAS HNV DPFTK LTAKVFE  LE LGL+  P+
Subjt:  KKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.2e-1532.9Show/hide
Query:  GYIDSRLS---IDKDSLNLHRGVFTLND-GVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSH
        GY+DS  +   ID+ S   +  +F + D  ++ W + +Q  +A S+ EAEY+A  EA +E +WL+  + ++ +   +   I +Y DN   ++ +  P  H
Subjt:  GYIDSRLS---IDKDSLNLHRGVFTLND-GVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSH

Query:  KRKKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGL
        KR KHI+ KYH   E V    + +  I +E+ + D FTK L A  F    + LGL
Subjt:  KRKKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGL

P0CV72 Secreted RxLR effector protein 1611.1e-0445.9Show/hide
Query:  GYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWL
        GY D+  + D +S     G +F LN G V WRS KQ  +A S+ E EY+A  EA +E VWL
Subjt:  GYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-2236.47Show/hide
Query:  QGTT--CLCMGLRIRSHTGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYC
        +GTT  CLC G       GY D+ ++ D D+     G +FT + G + W+S  Q C+A ST EAEY+AA E  KE++WL++F+  L +         +YC
Subjt:  QGTT--CLCMGLRIRSHTGYIDSRLSIDKDSLNLHRG-VFTLNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYC

Query:  DNSDAVANSREPRSHKRKKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGL
        D+  A+  S+    H R KHI+ +YH I E+V    + V KI++  N  D  TK +    FE   E +G+
Subjt:  DNSDAVANSREPRSHKRKKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-0732.58Show/hide
Query:  LNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKRKKHIEQKYHLIWE
        L   ++ W+S KQ  ++ S+ EAEY A   A  E++WL +F   L++   ++    ++CDN+ A+  +     H+R KHIE   H + E
Subjt:  LNDGVVVWRSVKQGCIADSTMEAEYVAACEAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKRKKHIEQKYHLIWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAACCCAAAGGGTTCATTGAACCAGGACAAGAGCAAAAGTTGGGATGGTCAGTAGGTATCAATCCAATCCAGGACTTGAACACTAGACAGCGGTTAAAACAATC
CTTAAGTATCTTCGGAGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATCCGATCTCACACAGGATACATAGATTCTCGACTTTCGATCGATAAAGATTCTCTAA
ATCTACATCGTGGCGTGTTCACTCTTAATGATGGAGTTGTAGTATGGCGAAGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGT
GAAGCAGCTAAAGAGGTCGTTTGGCTAAGGAAATTCATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGACCATCACGATGTATTGCGATAACAGTGATGCAGTGGC
AAATTCGAGGGAACCCCGAAGTCACAAGAGGAAAAAGCACATTGAGCAGAAATATCATCTCATCTGGGAGATCGTGCATAGAGGAGACGTGATTGTCACGAAGATAGCCT
CAGAGCACAACGTTGTTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTATTTGAGAGTCGCCTAGAAGGTCTAGGTTTACAAGTCTTCCCCAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCAACCCAAAGGGTTCATTGAACCAGGACAAGAGCAAAAGTTGGGATGGTCAGTAGGTATCAATCCAATCCAGGACTTGAACACTAGACAGCGGTTAAAACAATC
CTTAAGTATCTTCGGAGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATCCGATCTCACACAGGATACATAGATTCTCGACTTTCGATCGATAAAGATTCTCTAA
ATCTACATCGTGGCGTGTTCACTCTTAATGATGGAGTTGTAGTATGGCGAAGCGTCAAGCAAGGATGCATCGCTGATTCCACCATGGAGGCCGAATATGTAGCAGCTTGT
GAAGCAGCTAAAGAGGTCGTTTGGCTAAGGAAATTCATGCTAAATTTGGAAGTTGTTCCAAATATGACTTTGACCATCACGATGTATTGCGATAACAGTGATGCAGTGGC
AAATTCGAGGGAACCCCGAAGTCACAAGAGGAAAAAGCACATTGAGCAGAAATATCATCTCATCTGGGAGATCGTGCATAGAGGAGACGTGATTGTCACGAAGATAGCCT
CAGAGCACAACGTTGTTGATCCTTTTACAAAGGCTCTCACGGCTAAAGTATTTGAGAGTCGCCTAGAAGGTCTAGGTTTACAAGTCTTCCCCAACTAG
Protein sequenceShow/hide protein sequence
MDQPKGFIEPGQEQKLGWSVGINPIQDLNTRQRLKQSLSIFGEQGTTCLCMGLRIRSHTGYIDSRLSIDKDSLNLHRGVFTLNDGVVVWRSVKQGCIADSTMEAEYVAAC
EAAKEVVWLRKFMLNLEVVPNMTLTITMYCDNSDAVANSREPRSHKRKKHIEQKYHLIWEIVHRGDVIVTKIASEHNVVDPFTKALTAKVFESRLEGLGLQVFPN