; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017701 (gene) of Snake gourd v1 genome

Gene IDTan0017701
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:54037549..54037782
RNA-Seq ExpressionTan0017701
SyntenyTan0017701
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]1.5e-2273.33Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDE
        +SD+L KKHE  ITAKEIMD +Q MFGQ S+QARH ALK+I+NSRM+EG+SVR+HVL++MVHFN+AESNGA IDE
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDE

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]2.2e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +SD+L KKHE  +T KEIMD +Q MFGQ S QARH ALK+++NSRM+EG+SVR+HVL++MVHFN+AESNG  IDE S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

A0A5A7V4M1 Gag/pol protein1.8e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

A0A5D3CPJ6 Gag/pol protein1.8e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +S++L KKHE M+TA+EIMD +Q MFGQ S Q +H+ALKYI+N+RM EG SVR+HVL+MMVHFN+AE NGA IDE+S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

A0A6J1DFZ2 uncharacterized protein LOC1110200957.3e-2373.33Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDE
        +SD+L KKHE  ITAKEIMD +Q MFGQ S+QARH ALK+I+NSRM+EG+SVR+HVL++MVHFN+AESNGA IDE
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDE

A0A6J1DWL0 uncharacterized protein LOC1110247341.1e-2167.53Show/hide
Query:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS
        +SD+L KKHE  +T KEIMD +Q MFGQ S QARH ALK+++NSRM+EG+SVR+HVL++MVHFN+AESNG  IDE S
Subjt:  MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGACTTATTAGTCAAGAAGCATGAGGGCATGATAACCGCCAAGGAAATCATGGATTTTATGCAGGGTATGTTTGGACAACAGTCCACACAAGCTAGGCACAATGC
CCTAAAGTACATATTTAACTCAAGGATGCAAGAGGGTACATCTGTTCGGGATCATGTCCTTGATATGATGGTGCACTTCAACATCGCAGAGTCGAATGGTGCTTCCATCG
ATGAATCGAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGACTTATTAGTCAAGAAGCATGAGGGCATGATAACCGCCAAGGAAATCATGGATTTTATGCAGGGTATGTTTGGACAACAGTCCACACAAGCTAGGCACAATGC
CCTAAAGTACATATTTAACTCAAGGATGCAAGAGGGTACATCTGTTCGGGATCATGTCCTTGATATGATGGTGCACTTCAACATCGCAGAGTCGAATGGTGCTTCCATCG
ATGAATCGAGCTAG
Protein sequenceShow/hide protein sequence
MSDLLVKKHEGMITAKEIMDFMQGMFGQQSTQARHNALKYIFNSRMQEGTSVRDHVLDMMVHFNIAESNGASIDESS