; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004322 (gene) of Snake gourd v1 genome

Gene IDTan0004322
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:98198685..98199170
RNA-Seq ExpressionTan0004322
SyntenyTan0004322
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.8e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.3e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

A0A5A7TU93 Gag/pol protein4.3e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

A0A5A7TWB9 Gag/pol protein4.3e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

A0A5A7V4M1 Gag/pol protein4.3e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

A0A5D3CPJ6 Gag/pol protein4.3e-4968.06Show/hide
Query:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL
        KA+ Y+LAS+S++LAKKHE M+TA+EI DSLQ MFGQ S Q +H+ALKYI+N+RM EG+SVR+HVL+MMVHFN+AE NG  ID++SQV+FILE+LP+SFL
Subjt:  KAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFL

Query:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS
        QFRSN VMNK+ + LT+LLNELQ F+ LMKI+G KGEANV + +
Subjt:  QFRSNVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCGAGGCCAATGTAAGGCCAAAGTCTATATGTTGGCAAGTATCTCTGACATACTAGCCAAGAAGCACGAGGGCATGATGACCGCTAAGGAAATCACGGATTCGTT
GCAGGGCATGTTTGGACAACAGTCTACACAGGCCCGACACAATGCCCTCAAGTATATATTCAACTCGAGGATGCCAGAGGGTTCATCTGTTCGGGATCATGTCTTGGATA
TGATGGTACACTTCAATATCGCGGAGTCGAATGGTACTTCCATCGATAAATCGAGCCAAGTCAACTTCATTCTGGAAACTCTTCCAGATAGTTTCTTACAGTTTAGAAGT
AATGTTGTTATGAACAAGGTTACTTTTAATCTCACTTCCCTTCTGAATGAACTCCAGGCCTTTCAGCCTTTGATGAAAATTCAGGGACCGAAAGGTGAGGCAAATGTTAC
CAGTAAGAGTTATAACAATCAGTGGCTTCATCTTCTTCTAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATCGAGGCCAATGTAAGGCCAAAGTCTATATGTTGGCAAGTATCTCTGACATACTAGCCAAGAAGCACGAGGGCATGATGACCGCTAAGGAAATCACGGATTCGTT
GCAGGGCATGTTTGGACAACAGTCTACACAGGCCCGACACAATGCCCTCAAGTATATATTCAACTCGAGGATGCCAGAGGGTTCATCTGTTCGGGATCATGTCTTGGATA
TGATGGTACACTTCAATATCGCGGAGTCGAATGGTACTTCCATCGATAAATCGAGCCAAGTCAACTTCATTCTGGAAACTCTTCCAGATAGTTTCTTACAGTTTAGAAGT
AATGTTGTTATGAACAAGGTTACTTTTAATCTCACTTCCCTTCTGAATGAACTCCAGGCCTTTCAGCCTTTGATGAAAATTCAGGGACCGAAAGGTGAGGCAAATGTTAC
CAGTAAGAGTTATAACAATCAGTGGCTTCATCTTCTTCTAACCTGA
Protein sequenceShow/hide protein sequence
MDRGQCKAKVYMLASISDILAKKHEGMMTAKEITDSLQGMFGQQSTQARHNALKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGTSIDKSSQVNFILETLPDSFLQFRS
NVVMNKVTFNLTSLLNELQAFQPLMKIQGPKGEANVTSKSYNNQWLHLLLT