; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G001802 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G001802
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionDUF4228 domain-containing protein
Genome locationchr8:62539736..62541124
RNA-Seq ExpressionBhi08G001802
SyntenyBhi08G001802
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583958.1 hypothetical protein SDJN03_19890, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7277.5Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAA---HDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNP
        M+NSIRCCISCILPCG LDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSS+A    DA+  PKIVIVPPEADLQRGKIYFLMPL   PPNP
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAA---HDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNP

Query:  DKPR-----RRKKRDSNNN--INHHSHRTTAAVSAAVP--------NNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        DKPR     RRKKR +NNN   NHHS RT  A ++A          NNNN+ISM+NLLVSD YLSEILS+KASTHRERRRGRVGVWRPHL+SICESPSD+
Subjt:  DKPR-----RRKKRDSNNN--INHHSHRTTAAVSAAVP--------NNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

KGN65978.2 hypothetical protein Csa_019723 [Cucumis sativus]5.6e-7685.41Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP SSAAHDAA+A PKIVIVPPEADLQRGKIYFLMPL   PP+PD
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD

Query:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        KPRRRKKR+ +N  NHH   T A+ ++AVP+   N+ISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

XP_004150211.1 uncharacterized protein LOC101205379 [Cucumis sativus]5.6e-7685.41Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP SSAAHDAA+A PKIVIVPPEADLQRGKIYFLMPL   PP+PD
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD

Query:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        KPRRRKKR+ +N  NHH   T A+ ++AVP+   N+ISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

XP_008443296.1 PREDICTED: uncharacterized protein LOC103486913 [Cucumis melo]1.0e-7785.64Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP  SSAAHDAA++ PKIVIVPPEADLQRGKIYFLMPL   PP+P
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP

Query:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        DKPRRRKKR+ +N  NHH   TTA+ ++AVP+   NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

XP_038894960.1 uncharacterized protein LOC120083324 [Benincasa hispida]5.8e-97100Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP
        MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP

Query:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

TrEMBL top hitse value%identityAlignment
A0A0A0M094 Uncharacterized protein2.7e-7685.41Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP SSAAHDAA+A PKIVIVPPEADLQRGKIYFLMPL   PP+PD
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP-SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNPD

Query:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        KPRRRKKR+ +N  NHH   T A+ ++AVP+   N+ISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  KPRRRKKRDSNNNINHHSHRTTAAVSAAVPN-NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A1S3B8G2 uncharacterized protein LOC1034869135.0e-7885.64Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP  SSAAHDAA++ PKIVIVPPEADLQRGKIYFLMPL   PP+P
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP

Query:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        DKPRRRKKR+ +N  NHH   TTA+ ++AVP+   NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A5A7UMG1 DUF4228 domain-containing protein5.0e-7885.64Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP
        MKNSIRCCISCILPCG LDVIRIVHSNGYVEEI+GSIKASDVMKAHPKHVLKKPSSP  SSAAHDAA++ PKIVIVPPEADLQRGKIYFLMPL   PP+P
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSP--SSAAHDAATA-PKIVIVPPEADLQRGKIYFLMPLPPTPPNP

Query:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
        DKPRRRKKR+ +N  NHH   TTA+ ++AVP+   NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI
Subjt:  DKPRRRKKRDSNNNINHHSHRTTAAVSAAVPN---NNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI

A0A6J1KLR7 rhoGEF domain-containing protein gxcI-like5.0e-7075.25Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSA---AHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNP
        M+NSIRCCISCILPCG LDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSS+   A DA+  PKIVIVPPEADLQRGKIYFLMPL   PPNP
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSA---AHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNP

Query:  DKPR-----RRKKRDSNNN--INHHSHRTTAAVSAAVP----------NNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPS
        DKPR     RRKKR +NNN   NHHS RT  A  A+            NNN++ISM+NL VSD YLSEILS+KASTHRERRRGRVGVWRPHL+SICESPS
Subjt:  DKPR-----RRKKRDSNNN--INHHSHRTTAAVSAAVP----------NNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPS

Query:  DI
         +
Subjt:  DI

A0A6P5WYR0 ELMO domain-containing protein F-like6.7e-5967.55Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTP------
        MKN+IRCCISCILPCG LDVIRI+HSNG VEEISGSIKAS++MKAHPKHVLKKPSSPS    D    PKIVIVPP+A+LQRGKIYFLMP+P TP      
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTP------

Query:  -PNPDKPRRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSD
          +  K +RR   DS+NN N +SH  +    +    NNN ISMTNLL+SD YLSEILS+K ST R+RRRGRVGVWRPHL+SI E+P+D
Subjt:  -PNPDKPRRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06980.1 unknown protein1.1e-2640.44Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP
        M NS+RCC++C+LPCG LD+IRIVH NGYVEEI+ SI A ++++A+P HVL KP S            KI+I+ PE++L+RG IYFL+   P    P+K 
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP

Query:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKAS--THRERRR----GRVGVWRPHLQSICE
        RRRK  D+     +  + +  A    +  +   + +      + YL E++S  ++   HR RRR      V  WRP L SI E
Subjt:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKAS--THRERRR----GRVGVWRPHLQSICE

AT1G29195.1 unknown protein3.8e-4656.38Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSS--AAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPD
        MK +IRCCI+CILPCG LDVIRIVHSNG+VEEISG+I AS++MKAHPKHVLKKPSSP+S     D  +A KIVIVPPEA+LQRGKIYFLMP   +     
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSS--AAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPD

Query:  KPR-RRKKRDSNNNINHHS-----HRTTAAVSAAVPNN---NNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICE
          + RR+K ++N  +   S     HR          +N   + N     L+ SD YL+EILS+K +T ++RR+GRVGVWRPHL+SI E
Subjt:  KPR-RRKKRDSNNNINHHS-----HRTTAAVSAAVPNN---NNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICE

AT2G30230.1 unknown protein4.3e-2637.02Show/hide
Query:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP
        M NS+RCC++C+LPCG LD+IRIVH NG+V+EI+  + A ++++A+P HVL KP S            KI+I+ PE++L+RG IYFL+P    P      
Subjt:  MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKP

Query:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRR----GRVGVWRPHLQSICE
        +R++ R     +   +   +  VS     + + +++    + D  LSE +S     +R RR+      V  WRPHL SI E
Subjt:  RRRKKRDSNNNINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRR----GRVGVWRPHLQSICE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAATAGCATAAGATGCTGCATATCTTGCATTCTTCCATGTGGAGTTCTTGATGTAATTCGCATAGTACACTCCAATGGCTACGTCGAAGAAATCAGTGGCTCCAT
CAAAGCTTCGGACGTCATGAAAGCCCATCCAAAACACGTCCTCAAGAAGCCCTCCTCCCCTTCCTCCGCCGCCCACGATGCCGCCACCGCCCCGAAGATCGTCATCGTCC
CACCAGAAGCCGACCTCCAACGCGGTAAGATTTACTTCCTCATGCCACTCCCTCCTACTCCTCCCAACCCCGACAAGCCCCGCCGAAGAAAGAAGAGAGATTCAAATAAC
AACATTAATCATCATTCTCACCGAACAACCGCCGCCGTCTCTGCCGCCGTACCCAACAACAACAACAACATTTCCATGACCAACCTCCTCGTTTCCGACCATTACCTCTC
CGAAATACTCTCCGACAAAGCCTCCACCCACCGCGAACGGCGGCGCGGCCGTGTCGGCGTTTGGAGACCTCACTTACAAAGCATTTGTGAATCACCCAGTGATATCTAA
mRNA sequenceShow/hide mRNA sequence
GTTGTGTCCAAAATATAAAACATTAAATTAAATAAAGAAGAATGAAAAAAGGAAAAAGAAGAGAAAAAAAATGAAAGAAATTTCTCTCTATCGACCTCTCTCCCATCTCT
CATCTTCCTCTCCCATATCCCATTTTATTAATCTCTCTCCCATATTTTCTCTTTCCTTCGACTCTCTCTCCTCCAAGAGGTTGACGAAACAAAAGAAACCCAAAAATTAA
TATTTTTTTTTTTTTTTAAAAAAAACCACACAGGCAATGAAAAATAGCATAAGATGCTGCATATCTTGCATTCTTCCATGTGGAGTTCTTGATGTAATTCGCATAGTACA
CTCCAATGGCTACGTCGAAGAAATCAGTGGCTCCATCAAAGCTTCGGACGTCATGAAAGCCCATCCAAAACACGTCCTCAAGAAGCCCTCCTCCCCTTCCTCCGCCGCCC
ACGATGCCGCCACCGCCCCGAAGATCGTCATCGTCCCACCAGAAGCCGACCTCCAACGCGGTAAGATTTACTTCCTCATGCCACTCCCTCCTACTCCTCCCAACCCCGAC
AAGCCCCGCCGAAGAAAGAAGAGAGATTCAAATAACAACATTAATCATCATTCTCACCGAACAACCGCCGCCGTCTCTGCCGCCGTACCCAACAACAACAACAACATTTC
CATGACCAACCTCCTCGTTTCCGACCATTACCTCTCCGAAATACTCTCCGACAAAGCCTCCACCCACCGCGAACGGCGGCGCGGCCGTGTCGGCGTTTGGAGACCTCACT
TACAAAGCATTTGTGAATCACCCAGTGATATCTAAAAAACCGGCTATTCAGGTTGTGGGTTTTTTCCTTTTTCTTTTTCTTTTTTTTAAACATAAATATATTTCATCATA
ATTAAATTCCATTTCTTCAATATATATATCATGATCATTCACAAGTAAATAATAATAATAATAAAAAAGTCTTTTTTTCTAGAGGGATCTGAAATTAATTAAGAATTAAT
GTGGTAATTTTAATTAGTTACTGATTTATGCTTAATTAATTTTCAGTTTTTTGTGGAGATTTGTGGTTGTGGTGAAATTTTGAGTGCTTGTAGAGAGAGAGAGAGAGAGA
GAGAGAGAGCCCACTTTCCACTGTTTGAGAAATAATATAATGTGTGCATGAACATAATTTTGAGTGATTGTGATATGATGATATGATTGTACAGCTGCAATAATCTACTT
CCTTTCCCCATTTTTTATGCCCCATTATGCCTTCATTTTTTCTTTCCTATTTTACACTTTTAACTTATTTCATATTTATTCTTAACTTTTCGTTTTTTTAGAATTTTTTC
ATGATTTAAACTTTTTTCAGGTCTATTTAAAATGACAATTGAAGAATTTTTAAATGCGGTAAAAATAAC
Protein sequenceShow/hide protein sequence
MKNSIRCCISCILPCGVLDVIRIVHSNGYVEEISGSIKASDVMKAHPKHVLKKPSSPSSAAHDAATAPKIVIVPPEADLQRGKIYFLMPLPPTPPNPDKPRRRKKRDSNN
NINHHSHRTTAAVSAAVPNNNNNISMTNLLVSDHYLSEILSDKASTHRERRRGRVGVWRPHLQSICESPSDI