; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015522 (gene) of Snake gourd v1 genome

Gene IDTan0015522
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4219 domain-containing protein
Genome locationLG08:8711877..8712303
RNA-Seq ExpressionTan0015522
SyntenyTan0015522
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAG60117.1 copia-type polyprotein, putative [Arabidopsis thaliana]1.5e-3277.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

PKU71906.1 hypothetical protein MA16_Dca021744 [Dendrobium catenatum]5.6e-3577.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASN M+PFQVP LN SNYDNWSIKMKALLG QD+WE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKI +AK+AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

XP_020688719.2 uncharacterized protein LOC110104092 [Dendrobium catenatum]2.5e-3576.84Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASN M+PFQVP LN SNYDNWSIKMKALLG+QD+WE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEK+S+ K+AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

XP_028216659.1 uncharacterized protein LOC114398694 [Glycine soja]8.9e-3377.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        M S++MVPFQVP+LN +NYDNWSI MKALLG QD+ EIVEKGH+E EN  SLSQ QRDSLRDSRKRDKKALYLIYQGL+DDAFEK+ +AK+ KEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

XP_028556025.1 uncharacterized protein LOC110109975 [Dendrobium catenatum]3.9e-3678.95Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASN M+PFQVP LN SNYDNWSIKMKALLG+QD+WE++EKG++EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKIS+AK+AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

TrEMBL top hitse value%identityAlignment
A0A2I0W8B3 DUF4219 domain-containing protein2.7e-3577.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASN M+PFQVP LN SNYDNWSIKMKALLG QD+WE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKI +AK+AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

Q9C536 Copia-type polyprotein, putative7.4e-3377.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

Q9C739 Copia-type polyprotein, putative7.4e-3377.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

Q9M2D1 Copia-type polyprotein7.4e-3377.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

Q9SXB2 T28P6.8 protein7.4e-3377.89Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEA
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein9.2e-3676.6Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKE
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ D+WEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AK+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGTAACAACATGGTCCCGTTCCAAGTCCCAGTGCTCAACAAGAGCAACTACGATAATTGGAGCATCAAGATGAAGGCTCTCCTTGGATCACAAGATATGTGGGA
GATAGTAGAGAAAGGGCACTCTGAGCCGGAGAATGATGGTAGTCTTTCTCAAATACAAAGGGATAGTTTGAGAGACTCAAGAAAGAGAGACAAGAAGGCTCTCTATCTAA
TCTATCAAGGATTAGATGACGATGCTTTTGAGAAGATTTCAGAAGCAAAGACGGCGAAAGAAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGTAACAACATGGTCCCGTTCCAAGTCCCAGTGCTCAACAAGAGCAACTACGATAATTGGAGCATCAAGATGAAGGCTCTCCTTGGATCACAAGATATGTGGGA
GATAGTAGAGAAAGGGCACTCTGAGCCGGAGAATGATGGTAGTCTTTCTCAAATACAAAGGGATAGTTTGAGAGACTCAAGAAAGAGAGACAAGAAGGCTCTCTATCTAA
TCTATCAAGGATTAGATGACGATGCTTTTGAGAAGATTTCAGAAGCAAAGACGGCGAAAGAAGCATGAGAGAAGCTTCAAATATCGTACAAGGGAGCGGATCCAGTAAAA
AAGGTACGTCTTCAAACTCTAAGAGCTGAGTTTGAAACTTTGCATATGAAAGAAGGGGAAGTCATCTCAGATTACTTCTCTAGAGTTTTAACAGTCA
Protein sequenceShow/hide protein sequence
MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDMWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEA