; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020106 (gene) of Snake gourd v1 genome

Gene IDTan0020106
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:59175096..59175586
RNA-Seq ExpressionTan0020106
SyntenyTan0020106
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]3.5e-3263.64Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAP----TRKAKT
        ++SLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L  SKGKE EANV VT +KF+RGSSS  K GPS        K + K KGKG+AP     +K   
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAP----TRKAKT

Query:  TGKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
         GKCFHC  DGHWKRNCPKYLAEKKAEK  QG
Subjt:  TGKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

KAA0026069.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-3166.17Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQ----APTRKAKT
        +ESLP+SFL FR+NAVMNKI Y+LTT+LNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K        T+KAKT
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQ----APTRKAKT

Query:  T-GKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
          G CFHC  +GHWKRNCPKYLAEKK  K KQG
Subjt:  T-GKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

KAA0040701.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-3164.06Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTTGKC
        +ESLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L K KGKE EANV T+K KF RG SS +KSGPS   N+ I+KK K+   K Q   +K    GKC
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTTGKC

Query:  FHCGVDGHWKRNCPKYLAEKKAEKEKQG
        +HCG +GHW RNCPKYLA+KKAEKE QG
Subjt:  FHCGVDGHWKRNCPKYLAEKKAEKEKQG

KAA0050451.1 gag/pol protein [Cucumis melo var. makuwa]7.8e-3264.62Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G
        +ESLPKSF+PF+ NA +NKIE++LTTLLNELQ F++L KSKGKE EANV T+K KF RGSSS +K+GPS    K   K +KK KGK     ++ KTT  G
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G

Query:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
        KC+HCG +GHW RNCPKYLA+KKAEKE QG
Subjt:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

KAA0053407.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3164.12Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G
        +ESLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L K KGKE EANV +T  KF RGSS  +KSGPS    K  +K +KK KGK     +  KTT  G
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G

Query:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQGN
        KC+HCG +GHW RNCPKYLA+KKAEKE QGN
Subjt:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQGN

TrEMBL top hitse value%identityAlignment
A0A5A7SLD1 Gag/pol protein4.2e-3166.17Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQ----APTRKAKT
        +ESLP+SFL FR+NAVMNKI Y+LTT+LNELQTFESLMK KG++ EANV TS +KF RGS+SGTKS PS S NK  +KKK     K        T+KAKT
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTS-KKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQ----APTRKAKT

Query:  T-GKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
          G CFHC  +GHWKRNCPKYLAEKK  K KQG
Subjt:  T-GKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

A0A5A7TGB4 Gag/pol protein1.1e-3164.06Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTTGKC
        +ESLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L K KGKE EANV T+K KF RG SS +KSGPS   N+ I+KK K+   K Q   +K    GKC
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTTGKC

Query:  FHCGVDGHWKRNCPKYLAEKKAEKEKQG
        +HCG +GHW RNCPKYLA+KKAEKE QG
Subjt:  FHCGVDGHWKRNCPKYLAEKKAEKEKQG

A0A5A7UA90 Gag/pol protein3.8e-3264.62Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G
        +ESLPKSF+PF+ NA +NKIE++LTTLLNELQ F++L KSKGKE EANV T+K KF RGSSS +K+GPS    K   K +KK KGK     ++ KTT  G
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSK-KFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G

Query:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
        KC+HCG +GHW RNCPKYLA+KKAEKE QG
Subjt:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

A0A5A7UFY5 Gag/pol protein8.4e-3264.12Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G
        +ESLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L K KGKE EANV +T  KF RGSS  +KSGPS    K  +K +KK KGK     +  KTT  G
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTT--G

Query:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQGN
        KC+HCG +GHW RNCPKYLA+KKAEKE QGN
Subjt:  KCFHCGVDGHWKRNCPKYLAEKKAEKEKQGN

E2GK51 Gag/pol protein (Fragment)1.7e-3263.64Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAP----TRKAKT
        ++SLPKSF+PF+TNA +NKIE++LTTLLNELQ F++L  SKGKE EANV VT +KF+RGSSS  K GPS        K + K KGKG+AP     +K   
Subjt:  MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANV-VTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAP----TRKAKT

Query:  TGKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG
         GKCFHC  DGHWKRNCPKYLAEKKAEK  QG
Subjt:  TGKCFHCGVDGHWKRNCPKYLAEKKAEKEKQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAGCCTGACTACTCTCCTCAACGAGCTACAGACTTTTGAGTCCCT
CATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGTCCTCAGGGACCAAATCTGGTCCTTCTTTTTCTAAGAATAAGG
GTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCACACGCAAGGCCAAGACCACAGGAAAATGTTTCCACTGTGGTGTTGACGGGCACTGGAAGAGG
AACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAGCCTGACTACTCTCCTCAACGAGCTACAGACTTTTGAGTCCCT
CATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGTCCTCAGGGACCAAATCTGGTCCTTCTTTTTCTAAGAATAAGG
GTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCACACGCAAGGCCAAGACCACAGGAAAATGTTTCCACTGTGGTGTTGACGGGCACTGGAAGAGG
AACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAACTAG
Protein sequenceShow/hide protein sequence
MESLPKSFLPFRTNAVMNKIEYSLTTLLNELQTFESLMKSKGKEKEANVVTSKKFLRGSSSGTKSGPSFSKNKGIQKKKKKDKGKGQAPTRKAKTTGKCFHCGVDGHWKR
NCPKYLAEKKAEKEKQGN