; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g11080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g11080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:8294132..8304961
RNA-Seq ExpressionMoc04g11080
SyntenyMoc04g11080
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG57245.1 hypothetical protein EZV62_018558 [Acer yangbiense]1.3e-2528.89Show/hide
Query:  GVWVMAHRYPSTLPTGFLEIL------LSRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSMEFLMRALKRCG----------------------
        G W  A  + S LP     IL       +R D L WHFD+ G +SV+SGY     L  +A  +S  +    R L +C                       
Subjt:  GVWVMAHRYPSTLPTGFLEIL------LSRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSMEFLMRALKRCG----------------------

Query:  -----VDIVDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLG-------LWASDYVRSYRVANSLRLSVAASQGD
             VD++D C   G   E   HV W C          +L+V +W +       RN      G++G        W  ++   +RVAN        S   
Subjt:  -----VDIVDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLG-------LWASDYVRSYRVANSLRLSVAASQGD

Query:  QPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLS
        QP   W PP  G FK+N D  F       G+G+II N +G    A ++ +  C S++  EA A   G+ LAID  +  + LESD+    R L   +   +
Subjt:  QPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLS

Query:  EVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIV
        E+G+I+     L + +  +      R  N+ AH LA LA S     VWL+  PP + R+V
Subjt:  EVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIV

TXG60924.1 hypothetical protein EZV62_012287 [Acer yangbiense]2.9e-2530.46Show/hide
Query:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPS----SSSMEFLMR-----------------------ALKRCGVDIVDACSLSGCPGEDSFHVFWF
        +R D L WHFD+ G +SV+SGY     L  +A  S    SS   FL +                        L R  V+++D C L G   E   HV W 
Subjt:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPS----SSSMEFLMR-----------------------ALKRCGVDIVDACSLSGCPGEDSFHVFWF

Query:  CEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIII
        C          +L+V +W +          G      L  W  ++   +RVAN        S   QP   W PP  G FK+N DA F       G+G+II
Subjt:  CEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIII

Query:  CNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKL
         N +G    A ++ +  C S++  EA A   G+ LAID  +  + LESD+    R L   +   +E+G+I+     L + +  +      R  NA AH L
Subjt:  CNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKL

Query:  ATLALSRSLDCVWLEDCPPAVNRIV
        A LA S     VWL+  PP + R+V
Subjt:  ATLALSRSLDCVWLEDCPPAVNRIV

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]2.7e-3932.89Show/hide
Query:  GFLEILLSR---PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSL
        G L I + R    D+LIW+++KTG++SV+SGY +A  +      PSSSS E                    FL R           L + GV+I + C  
Subjt:  GFLEILLSR---PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSL

Query:  SGCPGEDSFHVFWFCE------------------------ESLSRAAFEELSVFLWAIWNFRNG-VRNFGARPPGSLGL----WASDYVRSYRVANSLRL
         G  GEDS H+FW C+                        ESLS+A FEEL V +W +WN RN    N   +    +G+    WA+ Y   +R A S  +
Subjt:  SGCPGEDSFHVFWFCE------------------------ESLSRAAFEELSVFLWAIWNFRNG-VRNFGARPPGSLGL----WASDYVRSYRVANSLRL

Query:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL
        +   +  +   + W PP +G +K+NTDA FLA     GLGIII N RG+V  AA   L    SVD AEA+A   G+ LA +                 G+
Subjt:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL

Query:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVLSQVLWDLV
           +EDLSE G I++  ++  +      F+F  R GN AAH LA  AL      +W+ED P  +   +  + L +L+
Subjt:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVLSQVLWDLV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.1e-2730.3Show/hide
Query:  EILLSR---PDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSM-----------------------------EFLMRALKRCGVDIV--DACSLSGC
        +I LSR   PD + W +   G+FSVKS Y   R +   A+   +SM                             E L  A+      I+  D CS+   
Subjt:  EILLSR---PDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSM-----------------------------EFLMRALKRCGVDIV--DACSLSGC

Query:  PGEDSFHVFWFC----------------------------EESLSRAAFEELSVF---LWAIWNFRNGVRNFG-ARPPGSLGLWASDYVRSYRVANSLRL
          E + H  W C                            EE L R   +EL +F    W +W  RN + + G  + P SL L A +Y+  +R A + RL
Subjt:  PGEDSFHVFWFC----------------------------EESLSRAAFEELSVF---LWAIWNFRNGVRNFG-ARPPGSLGLWASDYVRSYRVANSLRL

Query:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL
         V  +Q     + W PPP G FKLN DA   +   R G G II N +GEV  A + S P   + D+AE LA    +   +DA   RL +E D++     +
Subjt:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL

Query:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLAL-SRSLDCVWLEDCPP
           + + S  G +L DI  L   +  V    T R GN  AH LA  A  S   D  W+ED PP
Subjt:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLAL-SRSLDCVWLEDCPP

XP_030939896.1 uncharacterized protein LOC115964783 [Quercus lobata]5.8e-2632.01Show/hide
Query:  DQLIWHFDKTGIFSVKSGYIATRHLTAQ---AHPSSSSMEFLMRALKRC--GVDI----VDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIW
        D + W   K  IF+VKS Y   R L  +   A PSS S E  +  L  C    DI    V A    G    D   +  +    + +   E + V  W IW
Subjt:  DQLIWHFDKTGIFSVKSGYIATRHLTAQ---AHPSSSSMEFLMRALKRC--GVDI----VDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIW

Query:  NFRNGVRNFGA-RPPGSLGLWASDYVRSY-RVANSLRLSVAASQGDQ----PHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSL
        N RN V + G    PG L   A+  +  + +  NSL+     +   Q      V W PPP+  +KLN DA      +  G G +I ++ GEV  A     
Subjt:  NFRNGVRNFGA-RPPGSLGLWASDYVRSY-RVANSLRLSVAASQGDQ----PHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSL

Query:  PRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLE
        P     D+AE LA    +  AIDA    L +E DS+   RG+    E+ S +G ++ DIR L   +     S   R GN  AH LA  A   S D  W+E
Subjt:  PRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLE

Query:  DCP
        + P
Subjt:  DCP

TrEMBL top hitse value%identityAlignment
A0A5C7HKG9 Uncharacterized protein6.3e-2628.89Show/hide
Query:  GVWVMAHRYPSTLPTGFLEIL------LSRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSMEFLMRALKRCG----------------------
        G W  A  + S LP     IL       +R D L WHFD+ G +SV+SGY     L  +A  +S  +    R L +C                       
Subjt:  GVWVMAHRYPSTLPTGFLEIL------LSRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSMEFLMRALKRCG----------------------

Query:  -----VDIVDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLG-------LWASDYVRSYRVANSLRLSVAASQGD
             VD++D C   G   E   HV W C          +L+V +W +       RN      G++G        W  ++   +RVAN        S   
Subjt:  -----VDIVDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLG-------LWASDYVRSYRVANSLRLSVAASQGD

Query:  QPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLS
        QP   W PP  G FK+N D  F       G+G+II N +G    A ++ +  C S++  EA A   G+ LAID  +  + LESD+    R L   +   +
Subjt:  QPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLS

Query:  EVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIV
        E+G+I+     L + +  +      R  N+ AH LA LA S     VWL+  PP + R+V
Subjt:  EVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIV

A0A5C7HVU6 CCHC-type domain-containing protein1.4e-2530.46Show/hide
Query:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPS----SSSMEFLMR-----------------------ALKRCGVDIVDACSLSGCPGEDSFHVFWF
        +R D L WHFD+ G +SV+SGY     L  +A  S    SS   FL +                        L R  V+++D C L G   E   HV W 
Subjt:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPS----SSSMEFLMR-----------------------ALKRCGVDIVDACSLSGCPGEDSFHVFWF

Query:  CEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIII
        C          +L+V +W +          G      L  W  ++   +RVAN        S   QP   W PP  G FK+N DA F       G+G+II
Subjt:  CEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIII

Query:  CNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKL
         N +G    A ++ +  C S++  EA A   G+ LAID  +  + LESD+    R L   +   +E+G+I+     L + +  +      R  NA AH L
Subjt:  CNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKL

Query:  ATLALSRSLDCVWLEDCPPAVNRIV
        A LA S     VWL+  PP + R+V
Subjt:  ATLALSRSLDCVWLEDCPPAVNRIV

A0A6J1DAR4 uncharacterized protein LOC1110189541.3e-3932.89Show/hide
Query:  GFLEILLSR---PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSL
        G L I + R    D+LIW+++KTG++SV+SGY +A  +      PSSSS E                    FL R           L + GV+I + C  
Subjt:  GFLEILLSR---PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSL

Query:  SGCPGEDSFHVFWFCE------------------------ESLSRAAFEELSVFLWAIWNFRNG-VRNFGARPPGSLGL----WASDYVRSYRVANSLRL
         G  GEDS H+FW C+                        ESLS+A FEEL V +W +WN RN    N   +    +G+    WA+ Y   +R A S  +
Subjt:  SGCPGEDSFHVFWFCE------------------------ESLSRAAFEELSVFLWAIWNFRNG-VRNFGARPPGSLGL----WASDYVRSYRVANSLRL

Query:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL
        +   +  +   + W PP +G +K+NTDA FLA     GLGIII N RG+V  AA   L    SVD AEA+A   G+ LA +                 G+
Subjt:  SVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGL

Query:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVLSQVLWDLV
           +EDLSE G I++  ++  +      F+F  R GN AAH LA  AL      +W+ED P  +   +  + L +L+
Subjt:  DKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVLSQVLWDLV

A0A803PJK4 Uncharacterized protein3.3e-2729.67Show/hide
Query:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSME------------------FLMRA----------LKRCGVDIVDACSLSGCPGEDSFHVFW
        S  D++IWH + TGI++VKSGYI       Q  PS SS                    FL +A          L++C +   D CSL     E   H  +
Subjt:  SRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSSME------------------FLMRA----------LKRCGVDIVDACSLSGCPGEDSFHVFW

Query:  FCE-------------------------------ESLSRAAFEELSVFLWAIWNFRN-GVRNFGARPPGSLGLWASDYVRSY---RVANSLRLSVAASQG
         C+                               E+  +   E+ +  LW+IW  RN        +P   L + A  Y+  Y   R A    L     + 
Subjt:  FCE-------------------------------ESLSRAAFEELSVFLWAIWNFRN-GVRNFGARPPGSLGLWASDYVRSY---RVANSLRLSVAASQG

Query:  DQPHV--RWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVE
        ++P V  +W  PP G  KLNTDA    V    G G I+ ++ G+V  A +   P C   +  EALAL   +      +L    +E+DSL   +GL     
Subjt:  DQPHV--RWSPPPDGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVE

Query:  DLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVL
        ++S+  S+L DI  L S   GV  S   R+ N  AH LA  ALS   +CVW+E+ PP +  IVL
Subjt:  DLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVL

M5VU98 Reverse transcriptase domain-containing protein8.2e-2628.61Show/hide
Query:  PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSLSGCPGEDSFHVF
        PD+++W++DK G+F+VKS Y +A R  +     SSSS                      F  R           L + GVD+ D C   G   E + HV 
Subjt:  PDQLIWHFDKTGIFSVKSGY-IATRHLTAQAHPSSSSME--------------------FLMRA----------LKRCGVDIVDACSLSGCPGEDSFHVF

Query:  WFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGI
          C            +V  W I            R P  +  +A  YV  +  AN     V     D   VRW+ PP G  K N D  F     RG +G+
Subjt:  WFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPPDGFFKLNTDAGFLAVPSRGGLGI

Query:  IICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAH
        +  +A G    A A S+    S + AE LA   G++LA+         E DS      + +  +D S +G+I+ D++ L       LF FTPR  N  AH
Subjt:  IICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVLFSFTPRTGNAAAH

Query:  KLATLALSRSLDCVWLEDCPPAVNRIVLSQVL
        +LA   L    + +W E  P  +   +L  VL
Subjt:  KLATLALSRSLDCVWLEDCPPAVNRIVLSQVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCTTCATGTAATATTTCACCTTTTACTATTCTTGCAACACTCCTCCCACTGGTTCTTTTATGGCTGTCCAGCTTGGCTGGGATCCGTCATTCATCTGACGTAGT
TTGCTTTGGGGCTGTTCGCTTTTGCTGTCTGGTTGTCGGTGGCGTGTGGGTGATGGCACATCGATACCCGTCTACTCTGCCAACTGGATTCCTCGAGATTCTGCTCTCCA
GGCCGGATCAACTTATCTGGCATTTTGACAAAACTGGGATCTTCTCTGTTAAAAGTGGTTATATTGCTACACGACATCTTACTGCCCAGGCCCACCCCTCGTCCTCTTCT
ATGGAGTTTCTGATGCGTGCGCTCAAGCGTTGCGGGGTTGATATTGTGGATGCGTGCTCCTTGTCTGGATGCCCAGGGGAGGATAGTTTTCATGTTTTCTGGTTTTGTGA
GGAATCTTTATCCCGTGCAGCTTTTGAGGAATTGTCGGTATTTTTATGGGCGATTTGGAATTTTCGTAATGGTGTTCGGAATTTCGGGGCGCGCCCTCCGGGTTCTCTTG
GCTTATGGGCGTCGGATTATGTTCGTTCCTACCGGGTGGCCAATTCCCTGCGGTTGTCGGTTGCGGCTTCACAGGGGGATCAGCCTCATGTTCGATGGTCTCCTCCTCCG
GACGGCTTTTTCAAACTTAATACAGATGCTGGTTTCCTTGCTGTCCCGAGTCGGGGTGGTTTAGGGATAATTATTTGCAATGCTAGGGGTGAGGTTTTTCTTGCTGCTGC
TACCTCTCTCCCGCGGTGTGCTTCGGTGGATCAAGCAGAGGCTTTGGCTTTGCACCCGGGGATTTCGCTTGCTATTGATGCCGACCTCTTCCGTTTGCAACTTGAGTCAG
ACTCACTGCAGTGCTTCCGTGGGCTTGACAAGATGGTAGAGGATCTGTCGGAAGTGGGGTCTATTCTTGTTGATATTCGGTCCCTCTGCTCCCTTATTGGTGGTGTTCTG
TTTAGCTTCACGCCTCGTACAGGAAATGCTGCGGCTCACAAGCTGGCGACTTTGGCACTTTCTCGCTCTTTGGACTGTGTGTGGCTGGAGGATTGTCCTCCAGCTGTAAA
TAGAATTGTTCTCAGTCAAGTTTTGTGGGATCTTGTTTCTAGTGATGATGCGAAGGAAGTGGAAGATCAAAGTGTGGCAAAGCGTTATTTGCTTTGCGAACTTATATTGA
CAAGGAGTATATCGAGCATGTTCGTGATGAAAAGTCTCCAAAAAAAGTGTGAGATACACTTGAAAGAACTGGATGAGGAGGAGCCCATTAGTGATGCTCATTTGCGTCGT
TATCTCATTCATGGACTGCGAAAGGAGTTTATGCCATTTATTTCTTCGATACAAGATGTTCTTTATGCAAAAGACAAAGGAAAATATAATTCTTATTCCAAGCATTCTTC
AGATGACAGCAAACACTCCAAGACTGAAGGGCAGTCCAATGATGAAGCTACAGAAAGTGCAGTTCGAAAGTCTTCTAGAGAAAGAACTCGACCCAGTTATTTATCTGATT
ACGAGGAAATTTGGATATCTGGTCCATCGTATCTTGTTATGGACCTGTCGTCGGATCAAGAGTCGGGACGTGATAGGTTCGGGCCGACATGCATCGCAGTCGAGTCGGAT
CTTTTGGAGTTGTTCTCGACCGGGGGGCTGAGAACTGCTTGGCTTCGAGACCGCCCTAGTCAACGCCGAGTAGTGCTTGGTGCCGACCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTCTTCATGTAATATTTCACCTTTTACTATTCTTGCAACACTCCTCCCACTGGTTCTTTTATGGCTGTCCAGCTTGGCTGGGATCCGTCATTCATCTGACGTAGT
TTGCTTTGGGGCTGTTCGCTTTTGCTGTCTGGTTGTCGGTGGCGTGTGGGTGATGGCACATCGATACCCGTCTACTCTGCCAACTGGATTCCTCGAGATTCTGCTCTCCA
GGCCGGATCAACTTATCTGGCATTTTGACAAAACTGGGATCTTCTCTGTTAAAAGTGGTTATATTGCTACACGACATCTTACTGCCCAGGCCCACCCCTCGTCCTCTTCT
ATGGAGTTTCTGATGCGTGCGCTCAAGCGTTGCGGGGTTGATATTGTGGATGCGTGCTCCTTGTCTGGATGCCCAGGGGAGGATAGTTTTCATGTTTTCTGGTTTTGTGA
GGAATCTTTATCCCGTGCAGCTTTTGAGGAATTGTCGGTATTTTTATGGGCGATTTGGAATTTTCGTAATGGTGTTCGGAATTTCGGGGCGCGCCCTCCGGGTTCTCTTG
GCTTATGGGCGTCGGATTATGTTCGTTCCTACCGGGTGGCCAATTCCCTGCGGTTGTCGGTTGCGGCTTCACAGGGGGATCAGCCTCATGTTCGATGGTCTCCTCCTCCG
GACGGCTTTTTCAAACTTAATACAGATGCTGGTTTCCTTGCTGTCCCGAGTCGGGGTGGTTTAGGGATAATTATTTGCAATGCTAGGGGTGAGGTTTTTCTTGCTGCTGC
TACCTCTCTCCCGCGGTGTGCTTCGGTGGATCAAGCAGAGGCTTTGGCTTTGCACCCGGGGATTTCGCTTGCTATTGATGCCGACCTCTTCCGTTTGCAACTTGAGTCAG
ACTCACTGCAGTGCTTCCGTGGGCTTGACAAGATGGTAGAGGATCTGTCGGAAGTGGGGTCTATTCTTGTTGATATTCGGTCCCTCTGCTCCCTTATTGGTGGTGTTCTG
TTTAGCTTCACGCCTCGTACAGGAAATGCTGCGGCTCACAAGCTGGCGACTTTGGCACTTTCTCGCTCTTTGGACTGTGTGTGGCTGGAGGATTGTCCTCCAGCTGTAAA
TAGAATTGTTCTCAGTCAAGTTTTGTGGGATCTTGTTTCTAGTGATGATGCGAAGGAAGTGGAAGATCAAAGTGTGGCAAAGCGTTATTTGCTTTGCGAACTTATATTGA
CAAGGAGTATATCGAGCATGTTCGTGATGAAAAGTCTCCAAAAAAAGTGTGAGATACACTTGAAAGAACTGGATGAGGAGGAGCCCATTAGTGATGCTCATTTGCGTCGT
TATCTCATTCATGGACTGCGAAAGGAGTTTATGCCATTTATTTCTTCGATACAAGATGTTCTTTATGCAAAAGACAAAGGAAAATATAATTCTTATTCCAAGCATTCTTC
AGATGACAGCAAACACTCCAAGACTGAAGGGCAGTCCAATGATGAAGCTACAGAAAGTGCAGTTCGAAAGTCTTCTAGAGAAAGAACTCGACCCAGTTATTTATCTGATT
ACGAGGAAATTTGGATATCTGGTCCATCGTATCTTGTTATGGACCTGTCGTCGGATCAAGAGTCGGGACGTGATAGGTTCGGGCCGACATGCATCGCAGTCGAGTCGGAT
CTTTTGGAGTTGTTCTCGACCGGGGGGCTGAGAACTGCTTGGCTTCGAGACCGCCCTAGTCAACGCCGAGTAGTGCTTGGTGCCGACCACTAG
Protein sequenceShow/hide protein sequence
MNSSCNISPFTILATLLPLVLLWLSSLAGIRHSSDVVCFGAVRFCCLVVGGVWVMAHRYPSTLPTGFLEILLSRPDQLIWHFDKTGIFSVKSGYIATRHLTAQAHPSSSS
MEFLMRALKRCGVDIVDACSLSGCPGEDSFHVFWFCEESLSRAAFEELSVFLWAIWNFRNGVRNFGARPPGSLGLWASDYVRSYRVANSLRLSVAASQGDQPHVRWSPPP
DGFFKLNTDAGFLAVPSRGGLGIIICNARGEVFLAAATSLPRCASVDQAEALALHPGISLAIDADLFRLQLESDSLQCFRGLDKMVEDLSEVGSILVDIRSLCSLIGGVL
FSFTPRTGNAAAHKLATLALSRSLDCVWLEDCPPAVNRIVLSQVLWDLVSSDDAKEVEDQSVAKRYLLCELILTRSISSMFVMKSLQKKCEIHLKELDEEEPISDAHLRR
YLIHGLRKEFMPFISSIQDVLYAKDKGKYNSYSKHSSDDSKHSKTEGQSNDEATESAVRKSSRERTRPSYLSDYEEIWISGPSYLVMDLSSDQESGRDRFGPTCIAVESD
LLELFSTGGLRTAWLRDRPSQRRVVLGADH