; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019654 (gene) of Snake gourd v1 genome

Gene IDTan0019654
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:48442523..48443107
RNA-Seq ExpressionTan0019654
SyntenyTan0019654
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.6e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

A0A5A7TU93 Gag/pol protein1.6e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

A0A5A7TWB9 Gag/pol protein1.6e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

A0A5D3CPJ6 Gag/pol protein1.6e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

A0A5D3CSZ6 Gag/pol protein1.6e-7169.07Show/hide
Query:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE
        M+S+ + +LA+DKL G+NY +WKN INT+L+ D+L+FVL EECPQV  +  +R++R+ Y+RW +ANEKA+ YI+ASLSEVLAKKHE M+TA+EIM+SLQE
Subjt:  MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQE

Query:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS
        MFGQ S+Q++HD LK+++NARM EG SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE+LP+SFLQF SN VMNKI+YTLTTLLNELQ F+S
Subjt:  MFGQQSFQVRHDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.5e-0525.49Show/hide
Query:  GDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQEMFGQQSFQVRHDLLK
        G+ Y  WK  I  +L   ++  V++   P            +  D W +A   AK  II  LS+         +TA++I+E+L  ++ ++S   +  L K
Subjt:  GDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQEMFGQQSFQVRHDLLK

Query:  HVFNARMKEGVSVRE--HVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSF
         + + ++   +S+    H+ D +    LA   GA I+E  ++S +L TLP  +
Subjt:  HVFNARMKEGVSVRE--HVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCAATAATAGCTTTACTCGCTTCCGACAAACTAGTGGGAGATAACTACCAAACATGGAAAAACAATATAAACACAATTCTAGTAACTGACAACCTTAAGTT
CGTGCTCAATGAGGAGTGTCCTCAAGTGTCGGGCTCGACCACGTCACGAAGTATTCGTGATGCGTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGATCTACATCA
TTGCCAGCTTGTCTGAAGTATTGGCAAAGAAGCATGAGTTGATGGTCACCGCCAAGGAGATCATGGAGTCATTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCCGA
CATGATTTGCTCAAACACGTCTTCAACGCCCGGATGAAAGAAGGGGTGTCTGTCCGTGAACACGTTCTAGACATGATGACCCACTTTAATCTGGCAGAGATGAACGGGGC
TCCGATTGATGAGTCGAGCCAGGTCAGCTTTATCTTGGAGACTCTTCCGAAAAGTTTCCTTCAGTTTTGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTAACCA
CCCTTCTAAACGAGCTACAGAACTTCCAGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCAATAATAGCTTTACTCGCTTCCGACAAACTAGTGGGAGATAACTACCAAACATGGAAAAACAATATAAACACAATTCTAGTAACTGACAACCTTAAGTT
CGTGCTCAATGAGGAGTGTCCTCAAGTGTCGGGCTCGACCACGTCACGAAGTATTCGTGATGCGTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGATCTACATCA
TTGCCAGCTTGTCTGAAGTATTGGCAAAGAAGCATGAGTTGATGGTCACCGCCAAGGAGATCATGGAGTCATTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCCGA
CATGATTTGCTCAAACACGTCTTCAACGCCCGGATGAAAGAAGGGGTGTCTGTCCGTGAACACGTTCTAGACATGATGACCCACTTTAATCTGGCAGAGATGAACGGGGC
TCCGATTGATGAGTCGAGCCAGGTCAGCTTTATCTTGGAGACTCTTCCGAAAAGTTTCCTTCAGTTTTGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTAACCA
CCCTTCTAAACGAGCTACAGAACTTCCAGTCTTGA
Protein sequenceShow/hide protein sequence
MSSSIIALLASDKLVGDNYQTWKNNINTILVTDNLKFVLNEECPQVSGSTTSRSIRDAYDRWIRANEKAKIYIIASLSEVLAKKHELMVTAKEIMESLQEMFGQQSFQVR
HDLLKHVFNARMKEGVSVREHVLDMMTHFNLAEMNGAPIDESSQVSFILETLPKSFLQFCSNVVMNKISYTLTTLLNELQNFQS