; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001749 (gene) of Snake gourd v1 genome

Gene IDTan0001749
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:75411987..75412565
RNA-Seq ExpressionTan0001749
SyntenyTan0001749
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-6868.75Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+ + K+ +NT+L++DDLRFVL+EECPQ+P+ NA ++V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-6969.27Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+   K+ +NT+L++DDLRFVL+EECPQ+P+ NA Q+V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-6971.88Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        MSSSIIALLK D+ TGEN+ T KSKLN ILV+ DLRFVL+EECP  P++NA QSVK+AYD W KANDKA +Y+L S+S++L+KKHE MV+AR+IM SL+E
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNV EMNKA+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-6868.75Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+ + K+ +NT+L++DDLRFVL+EECPQ+P+ NA ++V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-6971.88Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        MSSSIIALLK D+ TGEN+ T KSKLN ILV+ DLRFVL+EECP  P++NA QSVK+AYD W KANDKA +Y+L S+S++L+KKHE MV+AR+IM SL+E
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNV EMNKA+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.8e-6968.75Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+ + K+ +NT+L++DDLRFVL+EECPQ+P+ NA ++V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

A0A5A7V6N0 Gag/pol protein2.6e-6969.27Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+   K+ +NT+L++DDLRFVL+EECPQ+P+ NA Q+V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

A0A5A7VA67 Gag/pol protein1.5e-6971.88Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        MSSSIIALLK D+ TGEN+ T KSKLN ILV+ DLRFVL+EECP  P++NA QSVK+AYD W KANDKA +Y+L S+S++L+KKHE MV+AR+IM SL+E
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNV EMNKA+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

A0A5D3CPJ6 Gag/pol protein5.8e-6968.75Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+  G N+ + K+ +NT+L++DDLRFVL+EECPQ+P+ NA ++V+E Y+RW KAN+KA+ YIL S+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMN AVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

A0A5D3D0D9 Gag/pol protein1.5e-6971.88Show/hide
Query:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE
        MSSSIIALLK D+ TGEN+ T KSKLN ILV+ DLRFVL+EECP  P++NA QSVK+AYD W KANDKA +Y+L S+S++L+KKHE MV+AR+IM SL+E
Subjt:  MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF
        MFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNV EMNKA+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTF
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTCACTGGTGAGAATTTTACTACGTTGAAGTCCAAGTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTAATTGAGGAATGTCCTCAGATCCCTTCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGTTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTCGTTGAGATGAACAAAGC
GGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTTTGAAGAGTTTCCTGCAATTTCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTACAGACTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTCACTGGTGAGAATTTTACTACGTTGAAGTCCAAGTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTAATTGAGGAATGTCCTCAGATCCCTTCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGTTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTCGTTGAGATGAACAAAGC
GGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTTTGAAGAGTTTCCTGCAATTTCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTACAGACTTTCTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRFTGENFTTLKSKLNTILVVDDLRFVLIEECPQIPSRNAPQSVKEAYDRWIKANDKAKVYILVSVSEVLAKKHEGMVSAREIMSSLQEMFGQPSGQIR
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNKAVIDEQSQVSFILESLLKSFLQFRSNAVMNKIEYNLTTLLNELQTF