; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021357 (gene) of Snake gourd v1 genome

Gene IDTan0021357
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:33260659..33261959
RNA-Seq ExpressionTan0021357
SyntenyTan0021357
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-6049.65Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SWQQL  GE+ + +G+G +V A  +  ++L+   +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.4e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

A0A5A7TU93 Gag/pol protein4.1e-6049.65Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SWQQL  GE+ + +G+G +V A  +  ++L+   +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

A0A5A7TWB9 Gag/pol protein3.4e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

A0A5A7V4M1 Gag/pol protein3.4e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

A0A5D3CPJ6 Gag/pol protein3.4e-5949.29Show/hide
Query:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG
        M EGASV     +MM HFN+AEMNGA IDE+SQVSFI+E+LP+SFL+FRSN VMNKI+YTLTTLLNELQ F+SL++I+  + EANVA   R +HRGS SG
Subjt:  MKEGASV-----DMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVA--YRSYHRGSNSG

Query:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------
        TKS+  S    K KK K G  ++A     K  K+  A KG CF CN +GHWKRNCPK+L E+K   QGKYDLL                           
Subjt:  TKSVAPSRPKGK-KKIKKGNADRAVALKGKNVKEV-ADKGKCFQCNGDGHWKRNCPKFLVERK--NQGKYDLL---------------------------

Query:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN
          GI SW+QL  GE+ + +G+G +V A  +  ++L    +++LL+NVY+V    RNL+ + CLLEQ  S++F+ N+ FI +N
Subjt:  --GIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLISCLLEQCISVSFHGNRAFISRN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAGGGGCGTCTGTGGACATGATGACCCACTTTAATTTGGCAGAGATGAACGGGGCTTCGATCGATGAGTCGAGCCAGGTCAGCTTTATTATGGAGACTCTTCC
GAAGAGTTTCCTTAAGTTTCGTAGCAATCCTGTTATGAACAAAATTAGTTACACTCTAACTACCCTCCTCAATGAGCTACAGAACTTTCAATCCTTGTTGAGGATCAGGA
CACCAGAAGCTGAGGCAAATGTTGCTTACAGATCATATCACAGGGGTTCGAACTCTGGGACCAAATCTGTTGCTCCTTCTCGCCCGAAAGGGAAGAAGAAAATTAAGAAG
GGTAACGCTGACCGAGCTGTCGCCCTAAAGGGCAAGAATGTCAAGGAAGTTGCAGACAAAGGAAAGTGTTTCCAATGCAATGGGGATGGTCACTGGAAGAGAAACTGTCC
CAAGTTCCTTGTCGAGAGGAAGAATCAAGGTAAATATGATTTACTGGGAATTGATTCCTGGCAGCAGCTGCGAAAGGGTGAGGTAAATCTATGGATTGGATCTGGGGAGA
TAGTCTTTGCTGCAACTATCGACAAAGTGAAGCTTTTCTCTGGCAGTAACTACATTTTGTTAGACAATGTGTACATAGTTCTTGGCTTTACTAGAAACCTCGTTCTAATT
TCCTGTTTATTAGAACAATGTATTTCCGTTTCTTTTCATGGTAATAGAGCGTTTATTTCCAGAAATGATGACTATTCGAGATATGGGTATATTAACCTAATGCACAAAAA
GACTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAGGGGCGTCTGTGGACATGATGACCCACTTTAATTTGGCAGAGATGAACGGGGCTTCGATCGATGAGTCGAGCCAGGTCAGCTTTATTATGGAGACTCTTCC
GAAGAGTTTCCTTAAGTTTCGTAGCAATCCTGTTATGAACAAAATTAGTTACACTCTAACTACCCTCCTCAATGAGCTACAGAACTTTCAATCCTTGTTGAGGATCAGGA
CACCAGAAGCTGAGGCAAATGTTGCTTACAGATCATATCACAGGGGTTCGAACTCTGGGACCAAATCTGTTGCTCCTTCTCGCCCGAAAGGGAAGAAGAAAATTAAGAAG
GGTAACGCTGACCGAGCTGTCGCCCTAAAGGGCAAGAATGTCAAGGAAGTTGCAGACAAAGGAAAGTGTTTCCAATGCAATGGGGATGGTCACTGGAAGAGAAACTGTCC
CAAGTTCCTTGTCGAGAGGAAGAATCAAGGTAAATATGATTTACTGGGAATTGATTCCTGGCAGCAGCTGCGAAAGGGTGAGGTAAATCTATGGATTGGATCTGGGGAGA
TAGTCTTTGCTGCAACTATCGACAAAGTGAAGCTTTTCTCTGGCAGTAACTACATTTTGTTAGACAATGTGTACATAGTTCTTGGCTTTACTAGAAACCTCGTTCTAATT
TCCTGTTTATTAGAACAATGTATTTCCGTTTCTTTTCATGGTAATAGAGCGTTTATTTCCAGAAATGATGACTATTCGAGATATGGGTATATTAACCTAATGCACAAAAA
GACTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGATTTAG
Protein sequenceShow/hide protein sequence
MKEGASVDMMTHFNLAEMNGASIDESSQVSFIMETLPKSFLKFRSNPVMNKISYTLTTLLNELQNFQSLLRIRTPEAEANVAYRSYHRGSNSGTKSVAPSRPKGKKKIKK
GNADRAVALKGKNVKEVADKGKCFQCNGDGHWKRNCPKFLVERKNQGKYDLLGIDSWQQLRKGEVNLWIGSGEIVFAATIDKVKLFSGSNYILLDNVYIVLGFTRNLVLI
SCLLEQCISVSFHGNRAFISRNDDYSRYGYINLMHKKTETLEKFKEYKTEI