; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015843 (gene) of Snake gourd v1 genome

Gene IDTan0015843
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4219 domain-containing protein
Genome locationLG06:16443576..16443890
RNA-Seq ExpressionTan0015843
SyntenyTan0015843
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAG60117.1 copia-type polyprotein, putative [Arabidopsis thaliana]1.3e-3778.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

CAB75469.1 copia-type reverse transcriptase-like protein [Arabidopsis thaliana]1.3e-3778.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

PKU71906.1 hypothetical protein MA16_Dca021744 [Dendrobium catenatum]1.7e-4079.81Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASN M+PFQVP LN SNYDNWSIKMKALLG QDVWE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKI +AK+AKEAWEKLQ
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

XP_020688719.2 uncharacterized protein LOC110104092 [Dendrobium catenatum]7.5e-4178.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASN M+PFQVP LN SNYDNWSIKMKALLG+QDVWE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEK+S+ K+AKEAWEKLQ
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

XP_028556025.1 uncharacterized protein LOC110109975 [Dendrobium catenatum]1.2e-4180.77Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASN M+PFQVP LN SNYDNWSIKMKALLG+QDVWE++EKG++EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKIS+AK+AKEAWEKLQ
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

TrEMBL top hitse value%identityAlignment
A0A2I0W8B3 DUF4219 domain-containing protein8.0e-4179.81Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASN M+PFQVP LN SNYDNWSIKMKALLG QDVWE++EKG+ EP+++ SLSQ Q+DSLRDSRKRDKKALYLI+QGLDDD+FEKI +AK+AKEAWEKLQ
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

Q9C536 Copia-type polyprotein, putative6.4e-3878.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

Q9C739 Copia-type polyprotein, putative6.4e-3878.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

Q9M2D1 Copia-type polyprotein6.4e-3878.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

Q9SXB2 T28P6.8 protein6.4e-3878.85Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AKEAWEKL+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQ

Query:  ISYK
         SYK
Subjt:  ISYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein2.6e-3677.66Show/hide
Query:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKE
        MASNN VPFQVPVL KSNYDNWS++MKA+LG+ DVWEIVEKG  EPEN+GSLSQ Q+D LRDSRKRDKKAL LIYQGLD+D FEK+ EA +AK+
Subjt:  MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGTAACAACATGGTCCCGTTCCAAGTCCCAGTGCTCAACAAGAGCAACTACGATAATTGGAGCATCAAGATGAAGGCTCTCCTTGGATCACAAGATGTGTGGGA
GATAGTAGAGAAAGGGCACTCTGAGCCGGAGAATGATGGTAGTCTTTCTCAAATACAAAGGGATAGTTTGAGAGACTCAAGAAAGAGAGACAAGAAGGCTCTCTATCTAA
TCTATCAAGGATTAGATGACGATGCTTTTGAGAAGATTTCAGAAGCAAAGACGGCGAAAGAAGCATGGGAGAAGCTTCAAATATCGTACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGTAACAACATGGTCCCGTTCCAAGTCCCAGTGCTCAACAAGAGCAACTACGATAATTGGAGCATCAAGATGAAGGCTCTCCTTGGATCACAAGATGTGTGGGA
GATAGTAGAGAAAGGGCACTCTGAGCCGGAGAATGATGGTAGTCTTTCTCAAATACAAAGGGATAGTTTGAGAGACTCAAGAAAGAGAGACAAGAAGGCTCTCTATCTAA
TCTATCAAGGATTAGATGACGATGCTTTTGAGAAGATTTCAGAAGCAAAGACGGCGAAAGAAGCATGGGAGAAGCTTCAAATATCGTACAAGTGA
Protein sequenceShow/hide protein sequence
MASNNMVPFQVPVLNKSNYDNWSIKMKALLGSQDVWEIVEKGHSEPENDGSLSQIQRDSLRDSRKRDKKALYLIYQGLDDDAFEKISEAKTAKEAWEKLQISYK