; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002053 (gene) of Snake gourd v1 genome

Gene IDTan0002053
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4228 domain-containing protein
Genome locationLG05:76310836..76312114
RNA-Seq ExpressionTan0002053
SyntenyTan0002053
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583958.1 hypothetical protein SDJN03_19890, partial [Cucurbita argyrosperma subsp. sororia]1.6e-7277.39Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        M+N+IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS     A +D +SALPKIVIVPPEADLQRGKIYFLMPLPPNPDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNR--------TATAAVDS---------NAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
        RSRSS+RRKKR  +TNNNN  N          A+A  DS         N+IS++NLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHL+SICESPSD+
Subjt:  RSRSSARRKKRDMITNNNNTTNR--------TATAAVDS---------NAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

KAG7019577.1 hypothetical protein SDJN02_18538, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-7276.24Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        M+N+IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS     A +D +SALPKIVIVPPEADLQRGKIYFLMPLPPNPDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNR------------TATAAVDS--------NAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPS
        RSRSS+RRKKR  +TNNNN  N              A+A  DS        N+IS++NLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHL+SICESPS
Subjt:  RSRSSARRKKRDMITNNNNTTNR------------TATAAVDS--------NAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPS

Query:  DI
        D+
Subjt:  DI

KGN65978.2 hypothetical protein Csa_019723 [Cucumis sativus]2.9e-6978.38Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR
        MKN+IRCCISCILPCGALDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSS S+    D++SALPKIVIVPPEADLQRGKIYFLMPLPP+PDK R
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR

Query:  SRSSARRKKRDMITNNNNTTNRTATAA----VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
             RRKKR+   N++ TT   +TA+      +N+IS+TNLLVSD YLSEILS+KASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  SRSSARRKKRDMITNNNNTTNRTATAA----VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

XP_023001645.1 rhoGEF domain-containing protein gxcI-like [Cucurbita maxima]6.9e-7176.12Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        M+N+IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS     A AD +SALPKIVIVPPEADLQRGKIYFLMPLPPNPDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNR-----------TATAAVDSN--------AISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSD
        RSRSS+RRKKR  +TNNNN  N            +ATA  D N        +IS++NL VSDQYLSEILSEKASTHRERRRGRVGVWRPHL+SICESPS 
Subjt:  RSRSSARRKKRDMITNNNNTTNR-----------TATAAVDSN--------AISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSD

Query:  I
        +
Subjt:  I

XP_023519997.1 uncharacterized protein LOC111783307, partial [Cucurbita pepo subsp. pepo]1.5e-7078.06Show/hide
Query:  IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSS--------SALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR
        IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS S+ SS        SALPKIVIVPPEADLQRGKIYFLMPLPPNPDK R
Subjt:  IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSS--------SALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR

Query:  SRSSARRKKRDMITNNNNTTN----RTATAAV-----------DSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
        SRSS+RRKKR  +TNNNN  N    RT  AA            ++N+IS++NLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHL+SICESPSD+
Subjt:  SRSSARRKKRDMITNNNNTTN----RTATAAV-----------DSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

TrEMBL top hitse value%identityAlignment
A0A0A0M094 Uncharacterized protein1.4e-6978.38Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR
        MKN+IRCCISCILPCGALDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSS S+    D++SALPKIVIVPPEADLQRGKIYFLMPLPP+PDK R
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKAR

Query:  SRSSARRKKRDMITNNNNTTNRTATAA----VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
             RRKKR+   N++ TT   +TA+      +N+IS+TNLLVSD YLSEILS+KASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  SRSSARRKKRDMITNNNNTTNRTATAA----VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A1S3B8G2 uncharacterized protein LOC1034869135.9e-6876.06Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA-----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        MKN+IRCCISCILPCGALDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSS S+     D++S+LPKIVIVPPEADLQRGKIYFLMPLPP+PDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA-----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNRTATAA------VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
        R     RRKKR+   N++ T    +TA+       ++N IS+TNLLVSD YLSEILS+KASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  RSRSSARRKKRDMITNNNNTTNRTATAA------VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A5A7UMG1 DUF4228 domain-containing protein5.9e-6876.06Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA-----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        MKN+IRCCISCILPCGALDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSS S+     D++S+LPKIVIVPPEADLQRGKIYFLMPLPP+PDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASA-----DSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNRTATAA------VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI
        R     RRKKR+   N++ T    +TA+       ++N IS+TNLLVSD YLSEILS+KASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  RSRSSARRKKRDMITNNNNTTNRTATAA------VDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A6J1KLR7 rhoGEF domain-containing protein gxcI-like3.3e-7176.12Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA
        M+N+IRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS     A AD +SALPKIVIVPPEADLQRGKIYFLMPLPPNPDK 
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSS-----ASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKA

Query:  RSRSSARRKKRDMITNNNNTTNR-----------TATAAVDSN--------AISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSD
        RSRSS+RRKKR  +TNNNN  N            +ATA  D N        +IS++NL VSDQYLSEILSEKASTHRERRRGRVGVWRPHL+SICESPS 
Subjt:  RSRSSARRKKRDMITNNNNTTNR-----------TATAAVDSN--------AISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSD

Query:  I
        +
Subjt:  I

F6HM99 Uncharacterized protein2.6e-6372.57Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKARSRSS
        MKNTIRCCISCILPCGALDV+RIVHSNG+VEEISG+I AS++MKAHPKHVLKKPSS+S +    +PKIV+VPP+A+LQRGKIYFLMP+PP P+K RSRSS
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKARSRSS

Query:  ARRKKRDMITNNNNTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPS
         R+K+RD  +NN NT +  +  A  +N IS+TNLL+SD+YLSEILSEK ST R+RRRGRVGVWRPHL+SI E+PS
Subjt:  ARRKKRDMITNNNNTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06980.1 unknown protein1.4e-2943.26Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKARSRSS
        M N++RCC++C+LPCGALD+IRIVH NGYVEEI+ SI A ++++A+P HVL KP      S   + KI+I+ PE++L+RG IYFL+P    P+K R R  
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKARSRSS

Query:  ARRKKRDMITNNNNTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKAS--THRERRR----GRVGVWRPHLQSICE
          R+K+       N  N +A AA          + + ++YL E++S  ++   HR RRR      V  WRP L SI E
Subjt:  ARRKKRDMITNNNNTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKAS--THRERRR----GRVGVWRPHLQSICE

AT1G29195.1 unknown protein1.3e-4655.56Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSS----ALPKIVIVPPEADLQRGKIYFLMPLPPNPD---
        MK TIRCCI+CILPCGALDVIRIVHSNG+VEEISG+I AS++MKAHPKHVLKKPSS ++D       +  KIVIVPPEA+LQRGKIYFLMP   +     
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSS----ALPKIVIVPPEADLQRGKIYFLMPLPPNPD---

Query:  -----KARSRSSARRKKRDMITNNNNTTNRTATAAVDSNAISITN-----LLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICE
             + +S ++A  KKR      +   +       +SN +   N     L+ SD+YL+EILSEK +T ++RR+GRVGVWRPHL+SI E
Subjt:  -----KARSRSSARRKKRDMITNNNNTTNRTATAAVDSNAISITN-----LLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICE

AT2G30230.1 unknown protein3.1e-2939.11Show/hide
Query:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMP--LPPNPDKARSR
        M N++RCC++C+LPCGALD+IRIVH NG+V+EI+  + A ++++A+P HVL KP      S   + KI+I+ PE++L+RG IYFL+P    P   K + R
Subjt:  MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMP--LPPNPDKARSR

Query:  SSARRKKRDMITNNN-NTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKASTHRERRR----GRVGVWRPHLQSICE
           R +K+ + + N+ N+ + +    +D + +++    + D  LSE +S     +R RR+      V  WRPHL SI E
Subjt:  SSARRKKRDMITNNN-NTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKASTHRERRR----GRVGVWRPHLQSICE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACACCATAAGATGCTGCATATCTTGCATTTTACCGTGTGGAGCTCTGGATGTGATTCGCATAGTTCACTCCAATGGCTACGTCGAAGAAATCAGCGGTTCCAT
TAAAGCCTCCGACGTCATGAAAGCCCATCCTAAACACGTCCTTAAAAAGCCCTCCTCCGCCTCCGCCGACTCCTCCTCCGCCCTCCCCAAGATCGTCATCGTCCCTCCGG
AGGCCGACCTCCAGCGCGGTAAGATTTATTTCCTCATGCCGCTCCCTCCCAACCCCGACAAGGCTCGCTCCCGATCCTCCGCCAGAAGAAAGAAAAGAGATATGATTACT
AATAATAACAACACCACCAATCGAACCGCCACCGCCGCCGTCGACAGCAACGCCATTTCCATCACCAACCTCCTCGTTTCCGATCAGTACCTCTCCGAAATACTCTCCGA
GAAGGCCTCCACCCACCGCGAACGGCGGCGCGGCCGTGTCGGCGTCTGGAGACCTCACTTACAGAGCATTTGTGAGTCGCCCAGTGATATCTAG
mRNA sequenceShow/hide mRNA sequence
GTTGTGTCCAAAATATAAAACATTAATTAATAAAGAAGGGAAGGAAAAAGAAAATGAATCTGTCTCTCTCTCGATTTCTCAATCTCTTCCTAATTCTCTCTTTTTCCGTT
TTATCAAATTTTCTCTTTCCTTCCACTCTCTCTCCTCCGAGAGGTCGACGAAACTGAAGAAACCCAAAAAAAAAAACAGAGGCGATGAAAAACACCATAAGATGCTGCAT
ATCTTGCATTTTACCGTGTGGAGCTCTGGATGTGATTCGCATAGTTCACTCCAATGGCTACGTCGAAGAAATCAGCGGTTCCATTAAAGCCTCCGACGTCATGAAAGCCC
ATCCTAAACACGTCCTTAAAAAGCCCTCCTCCGCCTCCGCCGACTCCTCCTCCGCCCTCCCCAAGATCGTCATCGTCCCTCCGGAGGCCGACCTCCAGCGCGGTAAGATT
TATTTCCTCATGCCGCTCCCTCCCAACCCCGACAAGGCTCGCTCCCGATCCTCCGCCAGAAGAAAGAAAAGAGATATGATTACTAATAATAACAACACCACCAATCGAAC
CGCCACCGCCGCCGTCGACAGCAACGCCATTTCCATCACCAACCTCCTCGTTTCCGATCAGTACCTCTCCGAAATACTCTCCGAGAAGGCCTCCACCCACCGCGAACGGC
GGCGCGGCCGTGTCGGCGTCTGGAGACCTCACTTACAGAGCATTTGTGAGTCGCCCAGTGATATCTAGAAACCGGCTACTCAGGTTGTGGGTTTTCTTTTTTCTTTTTTA
TAAACATAAAAGCATATATATCATCATAATTAAATTCCATTTCTTCTATATCATGATGATTCATTTCTTATTAAAAAAAAATCCCTAGAAAAAAAACAGTTCATGTAATA
TCCCTTTTCCAATTTTACCAGATTGGATTATTGAAGAGAAGGAGAGAGGGGGATCTGATTCTGGAATAATAAGAACGTGGTAATTAAATTAATTACTGTGGTAAATGAAT
TTCAGTTTGTGGAAATTTATGGTTTTGGGGAGAATTTTGAGTGAAAATCAGTTACTGTTGAGAGAGAGAGAGAGAGGAAGAAAGAGAGCCCACTTTCCAGTTTCCACTGT
GTTAAGAAATATTCATGTGTGTGTGCATGAACATAATTTTGAGTGATTGTGATATGATGATGATTGTACAGCTCCAATAATCTACCTTTCTTTCTCTCCCCAATTTTTCA
TGGCTCATGTGCCATCATTTTTACCCCCTTTTTTTACAGTGAATAATACTTTATATTCTCATATCTCAC
Protein sequenceShow/hide protein sequence
MKNTIRCCISCILPCGALDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSASADSSSALPKIVIVPPEADLQRGKIYFLMPLPPNPDKARSRSSARRKKRDMIT
NNNNTTNRTATAAVDSNAISITNLLVSDQYLSEILSEKASTHRERRRGRVGVWRPHLQSICESPSDI