; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33460 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33460
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF659 domain-containing protein
Genome locationchr8:24297974..24308565
RNA-Seq ExpressionMoc08g33460
SyntenyMoc08g33460
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5540905.1 hypothetical protein RHGRI_020969 [Rhododendron griersonianum]1.4e-6136.23Show/hide
Query:  CRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSMEHK--KRKGSISAFEKSFNMATRDQLHSEIARMFYSSGNEDVFNEC
        C+ VT++ LA M+KL++EA+   ++NAPKKVPLPP + S   S   +  Y    +     R  +IS  EK+F+ A   +             N+  ++EC
Subjt:  CRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSMEHK--KRKGSISAFEKSFNMATRDQLHSEIARMFYSSGNEDVFNEC

Query:  SWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYIL
        SWI+ ++ DV  +KN+IMNHSMRLA+FN+F+ LKLLS+A TRF+S ++MLK FKL+K+SLQ  +IS +W  YREDD GKAK VK+ VLDDIWWD IDYIL
Subjt:  SWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYIL

Query:  SFTGPIYDMIRACDTDRPCLHLVYD--------------------------------------------------------------IVGGPS-------
        SFT P+YDM+R CDTD+PCLHLVYD                                                              +  GPS       
Subjt:  SFTGPIYDMIRACDTDRPCLHLVYD--------------------------------------------------------------IVGGPS-------

Query:  -----------------------------------------------------------GAYAPTLQALAMKLLVQPSSSSCS-----------------
                                                                   GA AP LQ++A+KLLVQPSSSSCS                 
Subjt:  -----------------------------------------------------------GAYAPTLQALAMKLLVQPSSSSCS-----------------

Query:  ---------IFIFYQEELQNMKKEKQNCGSSL------DSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGT--QIGEDGAKN
                 +FI     L + +  +   G +       D+FD+F DVG LEIA LSLDEPELEAVVFTDDG   +IG++  ++
Subjt:  ---------IFIFYQEELQNMKKEKQNCGSSL------DSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGT--QIGEDGAKN

XP_022156304.1 uncharacterized protein LOC111023231 isoform X1 [Momordica charantia]2.5e-7444.55Show/hide
Query:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---
        + + +C+KVT KD+AEMQ+LEDEA+I KEKNAPKKV LPPPSH+Q+QS G+MS YSFS    + KKRK S S  EKSFNM T DQLHSEIA+MFYSS   
Subjt:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM
                                                                             GNEDVFNEC WIS+ SGDVM+VK+FIMNH M
Subjt:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM

Query:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL
        RLAMF EF+SLKLLSIAETRF+ TI MLK FKLIKS LQA  ISDKW+CYREDDVGKAKH+KDLVL+DIWWDKIDYILSFT PIYDMIRACDTD+PCLHL
Subjt:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL

Query:  VYDI
        +YD+
Subjt:  VYDI

XP_022156306.1 uncharacterized protein LOC111023231 isoform X2 [Momordica charantia]2.5e-7444.55Show/hide
Query:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---
        + + +C+KVT KD+AEMQ+LEDEA+I KEKNAPKKV LPPPSH+Q+QS G+MS YSFS    + KKRK S S  EKSFNM T DQLHSEIA+MFYSS   
Subjt:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM
                                                                             GNEDVFNEC WIS+ SGDVM+VK+FIMNH M
Subjt:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM

Query:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL
        RLAMF EF+SLKLLSIAETRF+ TI MLK FKLIKS LQA  ISDKW+CYREDDVGKAKH+KDLVL+DIWWDKIDYILSFT PIYDMIRACDTD+PCLHL
Subjt:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL

Query:  VYDI
        +YD+
Subjt:  VYDI

XP_028073086.1 uncharacterized protein LOC114275269 [Camellia sinensis]4.3e-5839.38Show/hide
Query:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW
        N   F EC+WISEV GDVM++KNFI NHSMRLAM+NEF+SLKLLS+AETRF+S+I+MLK  KLIK  LQ  VISDKW+CYR+DD+GKAK VKD +LDD W
Subjt:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW

Query:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDI------------------------------------------------------------------
        WD++DYILSFT PIYDMIR CDTDRP LHLVYD+                                                                  
Subjt:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDI------------------------------------------------------------------

Query:  -----------------------------------------VGGP---------------------SGAYAPTLQALAMKLLVQPSSSSCS---------
                                                   GP                      GA APTLQ LA++LLVQPSSSSC+         
Subjt:  -----------------------------------------VGGP---------------------SGAYAPTLQALAMKLLVQPSSSSCS---------

Query:  -----------------IFIFYQEELQNMKKEKQNCGSSL------DSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGTQIGEDG
                         +F+     L + +  + + G +       D+FDSF+DVGMLE+A+LSLDEPE+E V+FTDDG    E G
Subjt:  -----------------IFIFYQEELQNMKKEKQNCGSSL------DSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGTQIGEDG

XP_038891577.1 uncharacterized protein LOC120080967 [Benincasa hispida]5.4e-6140.88Show/hide
Query:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQ--SQSFGA-----MSSYSFS---MEHKKRKGSISAFEKSFNMATRDQLHSEIARM
        F + +C+K+T KDLAEMQKLEDEA+    +NAPK+VPLPP  H Q  SQSFG      + SYS S   ME KKRKG++SA EKSFN+A RDQ+ SEIARM
Subjt:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQ--SQSFGA-----MSSYSFS---MEHKKRKGSISAFEKSFNMATRDQLHSEIARM

Query:  FYSSG-----------------------------------------------------------------------------------------------
        F SSG                                                                                               
Subjt:  FYSSG-----------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------NEDVFNECSWISEVSGDVMVVKN
                                                                                     N+ VF E SWISE+S DVM VK+
Subjt:  -----------------------------------------------------------------------------NEDVFNECSWISEVSGDVMVVKN

Query:  FIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDT
        FIMNHSMRLAMFNEF+SLKLL++AETRFSSTII+L+ FKLIK  LQ  VIS+KW CYREDD+ KA+ VK LVL+DIWWDKIDYIL FT  IYDMIR CDT
Subjt:  FIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDT

Query:  DRPCLHLVYDI
        D+PCLHLVYDI
Subjt:  DRPCLHLVYDI

TrEMBL top hitse value%identityAlignment
A0A443N8D6 DUF659 domain-containing protein/Dimer_Tnp_hAT domain-containing protein1.3e-5539.42Show/hide
Query:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW
        N+  + ECSWI ++ GDVM +K+FIMNHSMRLAMFNEF++LKLLS+A+TRF+S+I+MLK FKLIK  LQA VISDKW+CYRE DVG A+ VK+ +LDDIW
Subjt:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW

Query:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYD--------------------------------------------------------------IVGGP
        WD IDYILSFT PIYDM+R CDTD+PCLHLVYD                                                              +   P
Subjt:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYD--------------------------------------------------------------IVGGP

Query:  S------------------------------------------------------------------GAYAPTLQALAMKLLVQPSSSSCS---------
        S                                                                  GA AP LQ+LA KLL+QPSSSSC          
Subjt:  S------------------------------------------------------------------GAYAPTLQALAMKLLVQPSSSSCS---------

Query:  -----------------IFIFYQEEL------QNMKKEKQNCGSSLDSFDSFEDVGMLEIASLSLDEPELEAVVFTDD
                         +FI     L      Q M+ E +    + D++DSFEDVG+LE+A+LSLDEPELE VVFTDD
Subjt:  -----------------IFIFYQEEL------QNMKKEKQNCGSSLDSFDSFEDVGMLEIASLSLDEPELEAVVFTDD

A0A5B7AFB0 Uncharacterized protein1.0e-5730.05Show/hide
Query:  CRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSMEHKKRK----GSISAFEKSFNMATRDQLHSEIARMFYSSG------
        C KVT KD+ EMQKLEDE ++  + NA KKVPLP   HS     G+ S      + KKRK    GS +  EK+FNM   +QLH+EIARMFYSSG      
Subjt:  CRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSMEHKKRK----GSISAFEKSFNMATRDQLHSEIARMFYSSG------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAM
                                                                          N+  +NECSWIS+++GDVM +K+FIMNHS+RL M
Subjt:  ------------------------------------------------------------------NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAM

Query:  FNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDI
        FNEF++LKLLS+A+TRF+S I+M + FKLIK  LQA VISDKW+ Y+EDDVG+ + VK+ VL+DIWWD IDYILSFT PIY+M++ACDTD+PCLHLVYD+
Subjt:  FNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDI

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------VGGPS------GAYAPTLQALAMKLLVQPSSSSCS--------------------------IFIFYQEEL------QN
                              +  P       G+ AP LQ+LA+KLLVQPSSSSC                           +F+     L      Q 
Subjt:  ----------------------VGGPS------GAYAPTLQALAMKLLVQPSSSSCS--------------------------IFIFYQEEL------QN

Query:  MKKEKQNCGSSLDSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGTQIGEDGAKNVGEN
        M+ E +    + D+FDSFEDVG+LE+A+LSLDEPELEAVVF DDG +  E+    VG N
Subjt:  MKKEKQNCGSSLDSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGTQIGEDGAKNVGEN

A0A6J1DT13 uncharacterized protein LOC111023231 isoform X11.2e-7444.55Show/hide
Query:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---
        + + +C+KVT KD+AEMQ+LEDEA+I KEKNAPKKV LPPPSH+Q+QS G+MS YSFS    + KKRK S S  EKSFNM T DQLHSEIA+MFYSS   
Subjt:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM
                                                                             GNEDVFNEC WIS+ SGDVM+VK+FIMNH M
Subjt:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM

Query:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL
        RLAMF EF+SLKLLSIAETRF+ TI MLK FKLIKS LQA  ISDKW+CYREDDVGKAKH+KDLVL+DIWWDKIDYILSFT PIYDMIRACDTD+PCLHL
Subjt:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL

Query:  VYDI
        +YD+
Subjt:  VYDI

A0A6J1DUJ6 uncharacterized protein LOC111023231 isoform X21.2e-7444.55Show/hide
Query:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---
        + + +C+KVT KD+AEMQ+LEDEA+I KEKNAPKKV LPPPSH+Q+QS G+MS YSFS    + KKRK S S  EKSFNM T DQLHSEIA+MFYSS   
Subjt:  FTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSM---EHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSS---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM
                                                                             GNEDVFNEC WIS+ SGDVM+VK+FIMNH M
Subjt:  ---------------------------------------------------------------------GNEDVFNECSWISEVSGDVMVVKNFIMNHSM

Query:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL
        RLAMF EF+SLKLLSIAETRF+ TI MLK FKLIKS LQA  ISDKW+CYREDDVGKAKH+KDLVL+DIWWDKIDYILSFT PIYDMIRACDTD+PCLHL
Subjt:  RLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSFTGPIYDMIRACDTDRPCLHL

Query:  VYDI
        +YD+
Subjt:  VYDI

A0A7J0FQI8 BED-type domain-containing protein6.3e-5542.01Show/hide
Query:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW
        NE  ++ECSWIS+++GD  ++KNFI NHSMRLAM+NEF+SLKLLS+AETRF+STI+MLK FKLIK  LQA VI+DKW+CYREDDVG+A+ VKD VL D+W
Subjt:  NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIW

Query:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDIVG--------------GPSGAYAPTLQALAMKLLVQPSSSSCSIF---------IFYQEEL-----
        WD IDYIL FTGPIYDMIRACDTD PCLHLVYD+                G       T   +  ++LV   + + +            +Y ++      
Subjt:  WDKIDYILSFTGPIYDMIRACDTDRPCLHLVYDIVG--------------GPSGAYAPTLQALAMKLLVQPSSSSCSIF---------IFYQEEL-----

Query:  --------QNMKKEKQNC----------------------------------------GSSL----------------------DSFDSFEDVGMLEIAS
                + + +E+  C                                         S+L                      D+FDS +DVG+LEIA+
Subjt:  --------QNMKKEKQNC----------------------------------------GSSL----------------------DSFDSFEDVGMLEIAS

Query:  LSLDEPELEAVVFTDDGTQ
        LSLDEPE+EAV+F D G +
Subjt:  LSLDEPELEAVVFTDDGTQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G08267.1 hAT transposon superfamily protein7.7e-0541.67Show/hide
Query:  LHSEIARMFYSSGNEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIA
        L +  A    +  NE V+  C WI  +S +V  +KN IMN+ +RL MF E   LKLL+I+
Subjt:  LHSEIARMFYSSGNEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGAGTTTGCAATAATTGTTTTTCACAGTAGTGAATGGGATGATAGTCACTGCTACATGAATTACAAAACTATTTGTATTTTGGTTGATGAAGATATGATCTTTCA
CAATTTTAGGGATTTGATATTGAATGAAGTGAAGTTGGATCCATCAATATGTTTCGTCCAGTTTCCAGTTTTATTAAATTTTGGTAGTAATGGAATTCAAACTGTTGTCG
AAATTAATGAAGATAAAGATGTTGCTTGGTTTTTAACTTTGGTTAAGGATGACAGCACAAGATATCTTTTAGTTGCTCATGTAATTACTATGTCTTTGGAGGAGTCTAGT
GTTATTAATTCTGGAAGTGAGAATTTAGGGCTGGTTGTTGCTTCTTCTACTATAATTGAAAGAGATTTTCAAGTTTATAATGATGTTGATATTACTAGTATGTCTTATGC
ATTTCATCTAAAGGAGAATGATCTATTTGCAAATAAAATGGCTACCAATGGTATGGTTGAGAAATTTAGATTTAGTTCTTCTGACCGTTCAACTCCGAAAGATATCATTC
ATCACATGCGCACAAATTATGGAGTTGGTGTTAGTTATAATAAAATTTGGAGGGCAAAAACAACTGTTAATAAATTGTTGAAAGGAGATGCTGATGATTCTTATGCATTG
ATTCCGAAGTTTTTTGTGAAGTTGAAAGAAATGAATCCAGATTTCTTCACGCTACAAGTGTGTCGGAAGGTTACCCGAAAAGATCTTGCTGAAATGCAAAAGTTGGAAGA
TGAAGCAGAAATTTATAAGGAAAAGAATGCTCCTAAAAAAGTTCCTTTACCACCTCCATCCCATAGTCAATCTCAATCATTTGGAGCTATGAGTAGCTATTCCTTTTCAA
TGGAACATAAGAAAAGAAAAGGTAGTATTAGTGCCTTTGAGAAGTCATTTAACATGGCAACCCGTGATCAATTACACTCCGAAATTGCTAGGATGTTTTATTCCTCAGGG
AATGAAGATGTGTTTAATGAGTGTAGTTGGATTTCTGAAGTTTCCGGTGATGTAATGGTTGTGAAGAATTTTATCATGAACCATTCCATGAGGCTTGCTATGTTCAACGA
ATTTTTGTCTCTAAAGTTGCTTTCCATTGCAGAAACACGTTTTTCATCCACAATCATTATGCTTAAAAGTTTTAAGCTTATTAAGAGCAGTTTGCAAGCTACAGTGATTA
GCGATAAATGGGCATGCTATAGAGAAGACGATGTGGGGAAAGCAAAGCATGTGAAAGATTTGGTACTTGATGATATTTGGTGGGATAAGATCGATTATATTCTTTCTTTT
ACTGGACCCATATATGATATGATCAGAGCGTGTGATACAGACAGACCTTGTCTTCATTTGGTATATGATATAGTTGGTGGGCCATCTGGTGCCTATGCACCAACTTTGCA
GGCATTAGCTATGAAGCTACTTGTGCAACCTTCATCTTCCTCGTGTTCAATCTTCATCTTCTATCAAGAAGAACTCCAGAATATGAAAAAGGAGAAACAAAATTGTGGGT
CATCGTTGGATTCTTTTGATTCGTTTGAAGATGTAGGCATGCTTGAGATAGCTAGTCTATCTTTGGATGAGCCAGAATTGGAAGCCGTAGTTTTTACTGATGATGGGACT
CAAATTGGTGAAGATGGTGCTAAAAATGTTGGGGAAAATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGAGTTTGCAATAATTGTTTTTCACAGTAGTGAATGGGATGATAGTCACTGCTACATGAATTACAAAACTATTTGTATTTTGGTTGATGAAGATATGATCTTTCA
CAATTTTAGGGATTTGATATTGAATGAAGTGAAGTTGGATCCATCAATATGTTTCGTCCAGTTTCCAGTTTTATTAAATTTTGGTAGTAATGGAATTCAAACTGTTGTCG
AAATTAATGAAGATAAAGATGTTGCTTGGTTTTTAACTTTGGTTAAGGATGACAGCACAAGATATCTTTTAGTTGCTCATGTAATTACTATGTCTTTGGAGGAGTCTAGT
GTTATTAATTCTGGAAGTGAGAATTTAGGGCTGGTTGTTGCTTCTTCTACTATAATTGAAAGAGATTTTCAAGTTTATAATGATGTTGATATTACTAGTATGTCTTATGC
ATTTCATCTAAAGGAGAATGATCTATTTGCAAATAAAATGGCTACCAATGGTATGGTTGAGAAATTTAGATTTAGTTCTTCTGACCGTTCAACTCCGAAAGATATCATTC
ATCACATGCGCACAAATTATGGAGTTGGTGTTAGTTATAATAAAATTTGGAGGGCAAAAACAACTGTTAATAAATTGTTGAAAGGAGATGCTGATGATTCTTATGCATTG
ATTCCGAAGTTTTTTGTGAAGTTGAAAGAAATGAATCCAGATTTCTTCACGCTACAAGTGTGTCGGAAGGTTACCCGAAAAGATCTTGCTGAAATGCAAAAGTTGGAAGA
TGAAGCAGAAATTTATAAGGAAAAGAATGCTCCTAAAAAAGTTCCTTTACCACCTCCATCCCATAGTCAATCTCAATCATTTGGAGCTATGAGTAGCTATTCCTTTTCAA
TGGAACATAAGAAAAGAAAAGGTAGTATTAGTGCCTTTGAGAAGTCATTTAACATGGCAACCCGTGATCAATTACACTCCGAAATTGCTAGGATGTTTTATTCCTCAGGG
AATGAAGATGTGTTTAATGAGTGTAGTTGGATTTCTGAAGTTTCCGGTGATGTAATGGTTGTGAAGAATTTTATCATGAACCATTCCATGAGGCTTGCTATGTTCAACGA
ATTTTTGTCTCTAAAGTTGCTTTCCATTGCAGAAACACGTTTTTCATCCACAATCATTATGCTTAAAAGTTTTAAGCTTATTAAGAGCAGTTTGCAAGCTACAGTGATTA
GCGATAAATGGGCATGCTATAGAGAAGACGATGTGGGGAAAGCAAAGCATGTGAAAGATTTGGTACTTGATGATATTTGGTGGGATAAGATCGATTATATTCTTTCTTTT
ACTGGACCCATATATGATATGATCAGAGCGTGTGATACAGACAGACCTTGTCTTCATTTGGTATATGATATAGTTGGTGGGCCATCTGGTGCCTATGCACCAACTTTGCA
GGCATTAGCTATGAAGCTACTTGTGCAACCTTCATCTTCCTCGTGTTCAATCTTCATCTTCTATCAAGAAGAACTCCAGAATATGAAAAAGGAGAAACAAAATTGTGGGT
CATCGTTGGATTCTTTTGATTCGTTTGAAGATGTAGGCATGCTTGAGATAGCTAGTCTATCTTTGGATGAGCCAGAATTGGAAGCCGTAGTTTTTACTGATGATGGGACT
CAAATTGGTGAAGATGGTGCTAAAAATGTTGGGGAAAATGTTTAG
Protein sequenceShow/hide protein sequence
MSEFAIIVFHSSEWDDSHCYMNYKTICILVDEDMIFHNFRDLILNEVKLDPSICFVQFPVLLNFGSNGIQTVVEINEDKDVAWFLTLVKDDSTRYLLVAHVITMSLEESS
VINSGSENLGLVVASSTIIERDFQVYNDVDITSMSYAFHLKENDLFANKMATNGMVEKFRFSSSDRSTPKDIIHHMRTNYGVGVSYNKIWRAKTTVNKLLKGDADDSYAL
IPKFFVKLKEMNPDFFTLQVCRKVTRKDLAEMQKLEDEAEIYKEKNAPKKVPLPPPSHSQSQSFGAMSSYSFSMEHKKRKGSISAFEKSFNMATRDQLHSEIARMFYSSG
NEDVFNECSWISEVSGDVMVVKNFIMNHSMRLAMFNEFLSLKLLSIAETRFSSTIIMLKSFKLIKSSLQATVISDKWACYREDDVGKAKHVKDLVLDDIWWDKIDYILSF
TGPIYDMIRACDTDRPCLHLVYDIVGGPSGAYAPTLQALAMKLLVQPSSSSCSIFIFYQEELQNMKKEKQNCGSSLDSFDSFEDVGMLEIASLSLDEPELEAVVFTDDGT
QIGEDGAKNVGENV