; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g24930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g24930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF659 domain-containing protein
Genome locationchr9:18644817..18650605
RNA-Seq ExpressionMoc09g24930
SyntenyMoc09g24930
Gene Ontology termsNA
InterPro domainsIPR007021 - Domain of unknown function DUF659
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY84497.1 hypothetical protein Acr_03g0012710 [Actinidia rufa]3.3e-6545.94Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE--VTSYSRY
        MA+  GG MFLKAVDCSGE+KD++F ANLMK VINEVG QNV+QI+TDN PNCKAA QLIE+QFP+I+W PCVV TLNLALKNICAA+++E   T+YS  
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE--VTSYSRY

Query:  TF----------------------CLHDD-------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDI---------
        ++                       ++++                   MLKRFKLIK GLQAMVINDKWSCYREDDVG+A+ VKD V  D+         
Subjt:  TF----------------------CLHDD-------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDI---------

Query:  -----------------------------------------------------WYYSEEWLQEDKNRVPPHQDLEVTRERMKCLKRYFSTNEEHSKVHLE
                                                             WYYS++WL E  NRVPPH+D EV+RER+KCLKRYF  + E +KV+ E
Subjt:  -----------------------------------------------------WYYSEEWLQEDKNRVPPHQDLEVTRERMKCLKRYFSTNEEHSKVHLE

Query:  FANFSTMAAEFVDYDSIRER
        FA FS     F D DSI++R
Subjt:  FANFSTMAAEFVDYDSIRER

RWR74797.1 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein [Cinnamomum micranthum f. kanehirae]7.4e-6542.07Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MA+ EGG MFLKAVDCSGE KD++FIANLMK VIN+VGH+NV+Q++TDN PNCK A Q+IESQFPNI+W PCVVHTLNLAL NICAA+N+E         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIK GLQAMVI+DKWSCYRE DVG A+ VK+ + DDIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYS+EWLQED +RVPP++D
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        +EV+RER KCL +YF T+EE + V++EFANFS    EF +YDS+ +R
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

XP_022156304.1 uncharacterized protein LOC111023231 isoform X1 [Momordica charantia]3.9e-7448.13Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MAI +G  +FLK VDCSGEVKD++FI NL+K VINEVGHQN+IQI+TDN PNC+AA Q+IESQF NIVW PCVV TLNLALKNIC+++NIE         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIKSGLQAM I+DKWSCYREDDVGKAKH+KDLV +DIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYSE+WLQEDKNRVPPHQD
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        LEVTRERMK +KRYFS+NEE +KV LEFANFSTMAAEF DYDSIRER
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

XP_022156306.1 uncharacterized protein LOC111023231 isoform X2 [Momordica charantia]3.9e-7448.13Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MAI +G  +FLK VDCSGEVKD++FI NL+K VINEVGHQN+IQI+TDN PNC+AA Q+IESQF NIVW PCVV TLNLALKNIC+++NIE         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIKSGLQAM I+DKWSCYREDDVGKAKH+KDLV +DIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYSE+WLQEDKNRVPPHQD
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        LEVTRERMK +KRYFS+NEE +KV LEFANFSTMAAEF DYDSIRER
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

XP_031743157.1 uncharacterized protein LOC116404561 [Cucumis sativus]3.3e-6541.71Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE----VTSYS
        MAI  G  MFLK+VDCSGE+KD++FIAN MK VINEVGH+NV+Q++TDN PNCK A QLIE+QFP I+W PCVVHTLNLALKNICA +N+E    V    
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE----VTSYS

Query:  RYTFCLHDD---------------------------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
         + F +  D                                       MLKRFKLIK GLQAMVI+DKW  YREDDV KA HVK+LV D IW        
Subjt:  RYTFCLHDD---------------------------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYSEEWL ED NRV PHQD
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRERTTFLTEDGLFDMIR--ASSDTKAPQRQEKLPKKVQAVQAKSKFACLSDWNT
        +E+TRERMKC+KRYFS++E+ +KV++E A FST   +F DYDSI ER T        D I   A+  T AP  Q K+  KV   Q  S   C  +W+T
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRERTTFLTEDGLFDMIR--ASSDTKAPQRQEKLPKKVQAVQAKSKFACLSDWNT

TrEMBL top hitse value%identityAlignment
A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein3.6e-6542.07Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MA+ EGG MFLKAVDCSGE KD++FIANLMK VIN+VGH+NV+Q++TDN PNCK A Q+IESQFPNI+W PCVVHTLNLAL NICAA+N+E         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIK GLQAMVI+DKWSCYRE DVG A+ VK+ + DDIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYS+EWLQED +RVPP++D
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        +EV+RER KCL +YF T+EE + V++EFANFS    EF +YDS+ +R
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

A0A5B7AFB0 Uncharacterized protein2.7e-6038.13Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MA+ E G MFLK VDCSGE KD++FIANLM+ VINEVGH+NVIQI+TDN PNCK A Q+IESQF NI W PCVVHTLNLALKNICAA+N+E         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     M +RFKLIK GLQAMVI+DKWS Y+EDDVG+ + VK+ V +DIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYS EWL E+ NRVPP+++
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRERTTFLTEDGLFDMIRASSDTKAPQRQEKLPKKVQAVQAKSKFACLSDWNT
         E+++ER+KCLKRYFS +E+ +KV +E+A FST + +F   DSI +R  ++ +   + +I  SS   AP  Q    K +  VQ  S   C  +W+T
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRERTTFLTEDGLFDMIRASSDTKAPQRQEKLPKKVQAVQAKSKFACLSDWNT

A0A6J1DT13 uncharacterized protein LOC111023231 isoform X11.9e-7448.13Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MAI +G  +FLK VDCSGEVKD++FI NL+K VINEVGHQN+IQI+TDN PNC+AA Q+IESQF NIVW PCVV TLNLALKNIC+++NIE         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIKSGLQAM I+DKWSCYREDDVGKAKH+KDLV +DIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYSE+WLQEDKNRVPPHQD
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        LEVTRERMK +KRYFS+NEE +KV LEFANFSTMAAEF DYDSIRER
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

A0A6J1DUJ6 uncharacterized protein LOC111023231 isoform X21.9e-7448.13Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------
        MAI +G  +FLK VDCSGEVKD++FI NL+K VINEVGHQN+IQI+TDN PNC+AA Q+IESQF NIVW PCVV TLNLALKNIC+++NIE         
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE---------

Query:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------
                                          + S +   F     MLKRFKLIKSGLQAM I+DKWSCYREDDVGKAKH+KDLV +DIW        
Subjt:  ----------------------------------VTSYSRYTFCLHDDMLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIW--------

Query:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD
                                                                                         YYSE+WLQEDKNRVPPHQD
Subjt:  ---------------------------------------------------------------------------------YYSEEWLQEDKNRVPPHQD

Query:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER
        LEVTRERMK +KRYFS+NEE +KV LEFANFSTMAAEF DYDSIRER
Subjt:  LEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRER

A0A7J0EFU0 DUF659 domain-containing protein1.6e-6545.94Show/hide
Query:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE--VTSYSRY
        MA+  GG MFLKAVDCSGE+KD++F ANLMK VINEVG QNV+QI+TDN PNCKAA QLIE+QFP+I+W PCVV TLNLALKNICAA+++E   T+YS  
Subjt:  MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIE--VTSYSRY

Query:  TF----------------------CLHDD-------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDI---------
        ++                       ++++                   MLKRFKLIK GLQAMVINDKWSCYREDDVG+A+ VKD V  D+         
Subjt:  TF----------------------CLHDD-------------------MLKRFKLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDI---------

Query:  -----------------------------------------------------WYYSEEWLQEDKNRVPPHQDLEVTRERMKCLKRYFSTNEEHSKVHLE
                                                             WYYS++WL E  NRVPPH+D EV+RER+KCLKRYF  + E +KV+ E
Subjt:  -----------------------------------------------------WYYSEEWLQEDKNRVPPHQDLEVTRERMKCLKRYFSTNEEHSKVHLE

Query:  FANFSTMAAEFVDYDSIRER
        FA FS     F D DSI++R
Subjt:  FANFSTMAAEFVDYDSIRER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G17450.1 hAT dimerisation domain-containing protein2.3e-0828.57Show/hide
Query:  GSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKN
        G  F  ++D +  V+D   +   +  +++++G +NV+Q++T NT   ++A +L+E +  N+ W PC +H   L L++
Subjt:  GSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKN

AT3G22220.1 hAT transposon superfamily2.0e-0731.08Show/hide
Query:  MFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALK
        +FLK+VD S  +     +  L+K V+ E+G  NV+Q++T    +  AA + +   +P++ W+PC  H ++  L+
Subjt:  MFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALK

AT3G22220.2 hAT transposon superfamily2.0e-0731.08Show/hide
Query:  MFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALK
        +FLK+VD S  +     +  L+K V+ E+G  NV+Q++T    +  AA + +   +P++ W+PC  H ++  L+
Subjt:  MFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALK

AT4G08267.1 hAT transposon superfamily protein8.9e-0850Show/hide
Query:  IVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICA----ARNIEVTSYSRY
        +VT+N  N   +  LI ++F  I W PCVVHTLNLALKN CA     RN EV   + Y
Subjt:  IVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICA----ARNIEVTSYSRY

AT5G31412.1 hAT transposon superfamily protein2.1e-0937.5Show/hide
Query:  EGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNI
        +GG  FL + D S       +I   +   I +VG +NV+Q+VTDN  N   A ++++ + PNI W  CV HT++L L+ I
Subjt:  EGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTATTGAAGGAGGATCTATGTTTCTTAAAGCTGTAGATTGCTCTGGTGAGGTTAAAGACAGACATTTCATAGCGAATTTGATGAAAGGAGTCATTAATGAGGT
GGGACATCAGAATGTAATCCAAATAGTCACTGATAATACTCCTAACTGTAAAGCTGCAAGACAACTTATTGAGTCGCAATTCCCAAACATAGTCTGGATGCCATGTGTAG
TTCACACTCTCAATCTTGCTTTGAAGAATATTTGTGCTGCAAGAAATATTGAAGTTACTTCCTATAGCAGATACACGTTTTGCCTCCACGATGATATGCTTAAAAGATTT
AAGCTCATTAAGAGCGGTTTGCAAGCAATGGTTATTAACGATAAATGGTCTTGCTACAGAGAAGATGACGTGGGGAAAGCAAAACATGTGAAGGATTTGGTATTTGATGA
TATTTGGTATTATAGTGAAGAGTGGCTTCAGGAAGACAAGAATCGAGTCCCACCACATCAAGATTTGGAAGTAACTCGAGAAAGAATGAAATGTCTTAAAAGATATTTTA
GTACAAATGAGGAACACTCAAAAGTACACTTGGAGTTCGCCAATTTCTCTACAATGGCTGCAGAATTTGTTGACTATGATTCTATACGTGAAAGAACTACTTTTCTCACC
GAGGATGGTTTATTTGACATGATTCGTGCATCATCCGACACTAAAGCTCCTCAAAGACAAGAAAAACTCCCAAAGAAAGTACAAGCAGTACAAGCAAAGAGTAAATTTGC
CTGCTTATCTGATTGGAATACTGATGACACTAGGACTATTTCTGTTTGGATTATTGTTGTCAGATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGTCTC
CCATTTCAAAGGTAAGACCAGGTAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTATTGAAGGAGGATCTATGTTTCTTAAAGCTGTAGATTGCTCTGGTGAGGTTAAAGACAGACATTTCATAGCGAATTTGATGAAAGGAGTCATTAATGAGGT
GGGACATCAGAATGTAATCCAAATAGTCACTGATAATACTCCTAACTGTAAAGCTGCAAGACAACTTATTGAGTCGCAATTCCCAAACATAGTCTGGATGCCATGTGTAG
TTCACACTCTCAATCTTGCTTTGAAGAATATTTGTGCTGCAAGAAATATTGAAGTTACTTCCTATAGCAGATACACGTTTTGCCTCCACGATGATATGCTTAAAAGATTT
AAGCTCATTAAGAGCGGTTTGCAAGCAATGGTTATTAACGATAAATGGTCTTGCTACAGAGAAGATGACGTGGGGAAAGCAAAACATGTGAAGGATTTGGTATTTGATGA
TATTTGGTATTATAGTGAAGAGTGGCTTCAGGAAGACAAGAATCGAGTCCCACCACATCAAGATTTGGAAGTAACTCGAGAAAGAATGAAATGTCTTAAAAGATATTTTA
GTACAAATGAGGAACACTCAAAAGTACACTTGGAGTTCGCCAATTTCTCTACAATGGCTGCAGAATTTGTTGACTATGATTCTATACGTGAAAGAACTACTTTTCTCACC
GAGGATGGTTTATTTGACATGATTCGTGCATCATCCGACACTAAAGCTCCTCAAAGACAAGAAAAACTCCCAAAGAAAGTACAAGCAGTACAAGCAAAGAGTAAATTTGC
CTGCTTATCTGATTGGAATACTGATGACACTAGGACTATTTCTGTTTGGATTATTGTTGTCAGATCTGCATGGGTGAGAGCAGCTCAACAGCGCTGGCTCAATAAGTCTC
CCATTTCAAAGGTAAGACCAGGTAGATAG
Protein sequenceShow/hide protein sequence
MAIIEGGSMFLKAVDCSGEVKDRHFIANLMKGVINEVGHQNVIQIVTDNTPNCKAARQLIESQFPNIVWMPCVVHTLNLALKNICAARNIEVTSYSRYTFCLHDDMLKRF
KLIKSGLQAMVINDKWSCYREDDVGKAKHVKDLVFDDIWYYSEEWLQEDKNRVPPHQDLEVTRERMKCLKRYFSTNEEHSKVHLEFANFSTMAAEFVDYDSIRERTTFLT
EDGLFDMIRASSDTKAPQRQEKLPKKVQAVQAKSKFACLSDWNTDDTRTISVWIIVVRSAWVRAAQQRWLNKSPISKVRPGR