; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001866 (gene) of Snake gourd v1 genome

Gene IDTan0001866
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:45821170..45822629
RNA-Seq ExpressionTan0001866
SyntenyTan0001866
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.3e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

TYK28896.1 gag/pol protein [Cucumis melo var. makuwa]4.9e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFIL++LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQNF+S+M+I+  + EANVA   R +HRGST  TK V  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.0e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

A0A5A7TU93 Gag/pol protein4.0e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

A0A5A7V4M1 Gag/pol protein4.0e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

A0A5D3CPJ6 Gag/pol protein4.0e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFILE+LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQ F+S+M+I+  + EANVA   R +HRGSTS TK +  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

A0A5D3DYU3 Gag/pol protein2.4e-5864.32Show/hide
Query:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN
        M+TA+EIMDSLQ+MFGQ S+Q++HD+LK+++NARM E +SVREHVL+MM HFN+AEMN A IDE+SQVSFIL++LP SFLQFRSNAVMNKI+YTLTTLLN
Subjt:  MITAKEIMDSLQDMFGQQSFQVRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLN

Query:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK
        +LQNF+S+M+I+  + EANVA   R +HRGST  TK V  S    K K K+G    KA+  A +  KK K  A KG CFHCN + HWKRNCPK+LAE+K
Subjt:  KLQNFQSMMRIRAPEVEANVA--YRSYHRGSTSRTKHVAPSRSKGKKKMKRG----KADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGTCCCCCTTCGGATGGTATTTGTATGAGTCAATATCAAAGTGGATGGAGAGAGTACTTATAGTGAATGGGAGCGAGACCTGTGAAAACGTGACCCACGGTCT
CTTCTTTCGATTCACACCGAGTGTCCTCAGTTGTCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTTTA
CATCATTGCCAACTTATCTGAAGTATTGGCAAAGAAGCATGAGTTGTATGATCACCGCTAAAGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTCAG
GTAAGGCACGATTCACTCAAACACGTCTTCAACGCACGGATGAAAGAAGTGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGGCGGAGATGAA
CTGGGCTTCGATCGACGAGTCAAGCCAGGTCAGCTTTATTTTGGAGACTCTTCCGAATAGTTTCCTTCAATTTCGTAGCAACGCTGTTATGAACAAGATTAGCTACACTC
TGACTACCCTCCTCAATAAGCTACAGAATTTCCAGTCCATGATGAGGATCAGAGCACCGGAAGTTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCA
AGGACTAAACATGTTGCTCCTTCTCGCTCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAGCTGACCGAGATGCCCCCCAAAAGGGCAAGAAGGTCAAGGAAGTTGCAGA
GAAAGGAAAGTGTTTCCACTGTAATGGGGACGACCACTGGAAGAGAAACTGTCCCAAGTTCCTTGCCGAGAGAAAAAATCAAGATAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGTCCCCCTTCGGATGGTATTTGTATGAGTCAATATCAAAGTGGATGGAGAGAGTACTTATAGTGAATGGGAGCGAGACCTGTGAAAACGTGACCCACGGTCT
CTTCTTTCGATTCACACCGAGTGTCCTCAGTTGTCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCATACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTTTA
CATCATTGCCAACTTATCTGAAGTATTGGCAAAGAAGCATGAGTTGTATGATCACCGCTAAAGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTCAG
GTAAGGCACGATTCACTCAAACACGTCTTCAACGCACGGATGAAAGAAGTGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGGCGGAGATGAA
CTGGGCTTCGATCGACGAGTCAAGCCAGGTCAGCTTTATTTTGGAGACTCTTCCGAATAGTTTCCTTCAATTTCGTAGCAACGCTGTTATGAACAAGATTAGCTACACTC
TGACTACCCTCCTCAATAAGCTACAGAATTTCCAGTCCATGATGAGGATCAGAGCACCGGAAGTTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCA
AGGACTAAACATGTTGCTCCTTCTCGCTCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAGCTGACCGAGATGCCCCCCAAAAGGGCAAGAAGGTCAAGGAAGTTGCAGA
GAAAGGAAAGTGTTTCCACTGTAATGGGGACGACCACTGGAAGAGAAACTGTCCCAAGTTCCTTGCCGAGAGAAAAAATCAAGATAAATGA
Protein sequenceShow/hide protein sequence
MGVSPFGWYLYESISKWMERVLIVNGSETCENVTHGLFFRFTPSVLSCRARLHHEVFVMHTIDGSGPMKRPRFTSLPTYLKYWQRSMSCMITAKEIMDSLQDMFGQQSFQ
VRHDSLKHVFNARMKEVSSVREHVLDMMTHFNLAEMNWASIDESSQVSFILETLPNSFLQFRSNAVMNKISYTLTTLLNKLQNFQSMMRIRAPEVEANVAYRSYHRGSTS
RTKHVAPSRSKGKKKMKRGKADRDAPQKGKKVKEVAEKGKCFHCNGDDHWKRNCPKFLAERKNQDK