; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013200 (gene) of Snake gourd v1 genome

Gene IDTan0013200
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:60627866..60628441
RNA-Seq ExpressionTan0013200
SyntenyTan0013200
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-5066.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-5066.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-5066.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-5066.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-5066.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein9.2e-5166.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

A0A5A7TWB9 Gag/pol protein9.2e-5166.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

A0A5A7V4M1 Gag/pol protein9.2e-5166.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

A0A5D3CPJ6 Gag/pol protein9.2e-5166.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

A0A5D3CSZ6 Gag/pol protein9.2e-5166.86Show/hide
Query:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K
        +ESLP+SFL FR+NAVMNKI Y LTTLLNEL TFESLMK KG++ EANV TS +KF RGS SGTK  PS S NK  +KKK     K    A K       
Subjt:  MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTS-KKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKA-----K

Query:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL
        A G CFHC  +GHWKRNCPKYLAEKK  K KQGKYDLLV+ETCLVE+DDSAWI+DSGATNHVCSSFQ +   ++L
Subjt:  ATGKCFHCGADGHWKRNCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTTCATACTTTTGAGTCCCT
GATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACTAAATTCAGTCCTTCCTTTTCTAAGAATAAGG
GTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCGCATGCAAGGCCAAAGCCACAGGAAAATGTTTCCACTGTGGTGCAGACGGGCACTGGAAGAGG
AACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAAATATGATTTACTCGTTATTGAAACATGTTTAGTGGAACATGATGATTCCGCCTGGAT
ATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTCAGAAACTAGTTCTGGGCAAGAAATTGTCGATGGAGAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGT
CTCAGCCAAAGCAGTGGGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACAAATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTTCATACTTTTGAGTCCCT
GATGAAATCAAAAGGAAAAGAGAAGGAGGCAAATGTTGTCACTTCGAAGAAGTTCCTAAGAGGATCGCCCTCTGGGACTAAATTCAGTCCTTCCTTTTCTAAGAATAAGG
GTATTCAGAAAAAGAAGAAGAAGGACAAAGGGAAGGGACAGGCTCCCGCATGCAAGGCCAAAGCCACAGGAAAATGTTTCCACTGTGGTGCAGACGGGCACTGGAAGAGG
AACTGCCCGAAGTACCTTGCAGAAAAGAAAGCTGAGAAAGAAAAACAAGGAAAATATGATTTACTCGTTATTGAAACATGTTTAGTGGAACATGATGATTCCGCCTGGAT
ATTAGATTCAGGAGCCACTAACCATGTTTGTTCTTCTTTTCAGAAACTAGTTCTGGGCAAGAAATTGTCGATGGAGAGATATCTCTCAGGGTTGGAACGGGAGAGGTTGT
CTCAGCCAAAGCAGTGGGAGAAGTGA
Protein sequenceShow/hide protein sequence
MESLPKSFLPFRTNAVMNKIEYNLTTLLNELHTFESLMKSKGKEKEANVVTSKKFLRGSPSGTKFSPSFSKNKGIQKKKKKDKGKGQAPACKAKATGKCFHCGADGHWKR
NCPKYLAEKKAEKEKQGKYDLLVIETCLVEHDDSAWILDSGATNHVCSSFQKLVLGKKLSMERYLSGLERERLSQPKQWEK