; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy02g010010 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy02g010010
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr02:26642087..26648491
RNA-Seq ExpressionLcy02g010010
SyntenyLcy02g010010
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015388020.1 uncharacterized protein LOC107177951 [Citrus sinensis]4.9e-2328.11Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        +GFR++S FNQA++AKQSWR+I+ P +L+ +V++ +YFR+  F++A VG+N S  WR+ +WGR +  KG RW+                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR
                                                             D KG  SVKS Y LA+ ++     SSSKE   K  W  +W L+I  +
Subjt:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR

Query:  EK--------------ICLWKLLNDIIP-----TKFNLASKGMDNSLYCQLSGWSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSL
         K              +  +K +   +P     +  NLAS+   N   C  +GW                + N+ A +   I +Q   LG VIR+S+G L
Subjt:  EK--------------ICLWKLLNDIIP-----TKFNLASKGMDNSLYCQLSGWSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSL

Query:  WSAGSKNIRSNWSIAM--LEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLAR
         +A  K   S W   M   EA A + GL         P   LIVE+DSK + NL++ +    +E+  I+ EV    +++  V+    PR+ NG AH LA+
Subjt:  WSAGSKNIRSNWSIAM--LEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLAR

Query:  RA
         A
Subjt:  RA

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.3e-2323.31Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        MGFR++S FNQA++AKQ WR+++ P++L+ +VL+ RYF++  F+ A +G+  S  WR+ VWGR +  KG RW+                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR
                                                             D KG  SVKS Y +A+ ++  +D S S  D+    W+ +W L I  +
Subjt:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR

Query:  EKICLWKLLNDIIPTKFNLASK-----GMDNSLYCQLSGWSVA--------------------------------DFWTEMINHFTAEEPNIVAII--MW
         KI LW+  +D++PT  NL  K      M  S +C +   S A                                 FW     H   E   + A++  +W
Subjt:  EKICLWKLLNDIIPTKFNLASK-----GMDNSLYCQLSGWSVA--------------------------------DFWTEMINHFTAEEPNIVAII--MW

Query:  -------------------------------------------------------------------NICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWS
                                                                           ++ +Q+  LG V+RDS G+  +A  K++R   S
Subjt:  -------------------------------------------------------------------NICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWS

Query:  IAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQ
        +AM EA A++ GL    K  I+     I ESDS  +I+L+N++S  L+E+  ++ ++          +    PR  N  AH LA+ A Q
Subjt:  IAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQ

XP_024039324.1 uncharacterized protein LOC112097962 [Citrus clementina]2.6e-2424.73Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        +GFR+   FNQAM+AKQ WRLI+ PN+L+ KVLR RYFR+ SFL A  G+N S  WR+ +WGR +  KG RW                            
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  ----------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQ------
                        D  G  SV+S Y +A+ L+   +  SS  +  +  WK +W +++  + KI +W+   +++PT  NL  +       CQ      
Subjt:  ----------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQ------

Query:  LSGWSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPH-----------
         +   + +F  E+ + +   E        W +        +  +       +A ++++   +  A     +    +   ++    PPP            
Subjt:  LSGWSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPH-----------

Query:  -------LIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA
               LI+E+D K +++LLN      + +  ++ ++         V+F   PR+ N  AH LA+ A
Subjt:  -------LIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA

XP_030502610.1 uncharacterized protein LOC115717775 [Cannabis sativa]5.3e-2527.71Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        +GFR +   NQA+LAKQ+WR+   P +L  ++L+ RYF+N +FL+A  G++ S +W + +WGR+L   G  WK                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  -------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQLSG
                           D  G+++VKSAYHL   + L    SSS       +WKK W L I P+ K   W+    I+PT +NL  K +  S  C    
Subjt:  -------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQLSG

Query:  WSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLL
          V      +++   A +            S++G LG +I+D  G + +     I +  S  M EA A+K  L       I   P   + +DSK +I+ +
Subjt:  WSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLL

Query:  NRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQL
        N    +LS +  I+ ++ + +         + PR +N  AH +A+ A  L
Subjt:  NRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQL

XP_030508875.1 uncharacterized protein LOC115723521 [Cannabis sativa]2.8e-2626.32Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        +GFR ++  NQA+LAKQ+WR+  NPN+L++ +L+ RYF++  FL+AP+G+N S TWR+  WG  L   G  WK                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  --------------------------------------------DS-------KGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREK
                                                    DS        GI++VKSAYHLA S+      SSS  +  K +WK LW LKI P+ K
Subjt:  --------------------------------------------DS-------KGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREK

Query:  ICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVADFWTEM---------INHFTAEE-----PNIVAIIMWNICSQVGS---LGWVIRDSKGSLWSAG
           WK  N ++P   NL  K    S  C   G  +    +E          +  F A       P+  A+ +    SQ  S   LG V       +    
Subjt:  ICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVADFWTEM---------INHFTAEE-----PNIVAIIMWNICSQVGS---LGWVIRDSKGSLWSAG

Query:  SKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQL
        S N     S    EA A+ EG+   L  ++ P     + SD  ++++ +    HD S +  ++ ++     +      +      N  AH LA++A +L
Subjt:  SKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQL

TrEMBL top hitse value%identityAlignment
A0A803P9U8 Uncharacterized protein1.5e-2229.3Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        +GFR  S  NQA+LAKQ+WR+  +  +L++ +L+ RYF++  FL APVG NSS TWR+ +WGR L  KG  WK                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  ---------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREK
                                                            S G+++VKSAYHLA SL      S S  +  + +WK+LW L + P+ K
Subjt:  ---------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREK

Query:  ICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVADFWTEMINHFTAEEPNIVAI
           WK  N I+P   NL  K +    YC + G +     TE + H   + P    I
Subjt:  ICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVADFWTEMINHFTAEEPNIVAI

A0A803PAK3 Uncharacterized protein3.7e-2426.65Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        MGF+    FNQA+LAKQ+W L+ NPN+L+ ++L+ RYFR+ SFL A +G++ SLTWR  VWG+ L  KG RWK                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR
                                                             +S G+ SVKS Y LA  LE ++  SS   ++ + +WKK WGLK+  +
Subjt:  -----------------------------------------------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPR

Query:  EKICLWKLLNDIIP----------------TKFNLASKGMDNSL-YCQ-------LSGWSVADFWTEMINHFTAE--------------EPNIVAIIMWN
         +I LW+ ++D +P                T  N A++ + ++L YC+       + G   A        H TA               + N  A I  N
Subjt:  EKICLWKLLNDIIP----------------TKFNLASKGMDNSL-YCQ-------LSGWSVADFWTEMINHFTAE--------------EPNIVAIIMWN

Query:  ICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVR
                G V+RD  G + +A S      ++  + EA A+   L  ++KD +  P H I E++S  ++  L+     +S+   ++  +  L       +
Subjt:  ICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVR

Query:  FVWCPRSANGIAHDLARRAAQLTS
             RSAN  AH LAR A  + S
Subjt:  FVWCPRSANGIAHDLARRAAQLTS

A0A803PDG2 Uncharacterized protein1.5e-2237.58Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK------------DSKGIVSVKSAYHLA
        MGFR    FNQA+LAKQ+WR+    N+L+ ++L+ RYF N SFL++ +G++ SLTW+   WGR L  KG R+K             + G+ +VKS +H  
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK------------DSKGIVSVKSAYHLA

Query:  VSLE-LEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQL
          LE ++K++SS+     K +WK  W LK+  + KI  W+++ + +P    L  K +  S  C L
Subjt:  VSLE-LEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQL

A0A803PM23 Uncharacterized protein2.4e-2324.78Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------
        MGF+ +  FNQ++LAKQ W++I NP++L+ +VL+  Y+ N +FL+A +G   S  WR+ +WGR +  KG RW+                           
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK---------------------------

Query:  ---------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVA
                          GI +V S Y +AV+ ELE +AS+ K    + +W K+W L+I P+ +  +W+     +  K     +     +YC  S  SV 
Subjt:  ---------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVA

Query:  DFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQS
              +N                     G LG +IR+  G + +A  +  +  +S+ + E  A++ G+   +K    P    I+++D   ++N LN  S
Subjt:  DFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQS

Query:  HDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA
           ++   ++ ++    + +  +      R  N  A+ LA+ A
Subjt:  HDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA

A0A803QCJ3 Uncharacterized protein9.1e-2328.05Show/hide
Query:  FNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK-----------------------------------
        + +AMLA Q+WR+  NP +L+  VL+ +YFR+  FL+A +G++ S TW + +WGR+L   G RWK                                   
Subjt:  FNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWK-----------------------------------

Query:  ------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNL-ASKGMDN---SLYCQLSGWSV
                    ++ GI SV+SAYHLA S  L    SSS       +WK LW L + P+ K   W++ + I+P   NL   K +     S+Y Q +  +V
Subjt:  ------------DSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWKKLWGLKIVPREKICLWKLLNDIIPTKFNL-ASKGMDN---SLYCQLSGWSV

Query:  ADFWTEMINHFTAEE----PNIVAIIM-WNICSQVGSLGW--VIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLI--TYLKDSISPPPHLIVESDSK
          +        T  E     N   + M   +C++   LG+  VI+D +G+L    S     N    M +A A++ GLI   ++K  ++     +V++DSK
Subjt:  ADFWTEMINHFTAEE----PNIVAIIM-WNICSQVGSLGW--VIRDSKGSLWSAGSKNIRSNWSIAMLEACAIKEGLI--TYLKDSISPPPHLIVESDSK

Query:  HIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA
         ++N +  +  DL  + +++ ++ +       V      +  N  AH LARRA
Subjt:  HIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRA

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003101.2e-1144.93Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKG
        +GFR++  FNQA+LAKQS+R+I  P+ L+ ++LR RYF + S ++  VG   S  WR+ + GR L ++G
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKG

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.9e-0936.62Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYR
        +GF++I  FN A+L KQ WR++  P +L+ KV + RYF     L AP+G+  S  W++    + +  +G R
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.4e-1344.93Show/hide
Query:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKG
        +GFR++  FNQA+LAKQS+R+I  P+ L+ ++LR RYF + S ++  VG   S  WR+ + GR L ++G
Subjt:  MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTTAGGGAGATCAGTCTCTTTAACCAAGCCATGCTTGCTAAGCAGAGCTGGAGATTAATTAGGAACCCAAACAACTTGATCTATAAAGTCCTAAGGGGTAGATA
TTTCAGGAACGGATCCTTTTTGAAAGCCCCTGTTGGAAACAACTCGTCCCTTACTTGGCGCAACAATGTGTGGGGTAGAAACCTTTTCACCAAAGGCTACAGGTGGAAAG
ATAGTAAGGGCATCGTTTCGGTTAAATCAGCGTATCACCTGGCGGTTAGCTTGGAGCTTGAAAAGGATGCATCTAGTTCGAAGGAAGATGAGCTTAAGCTGTTTTGGAAG
AAATTATGGGGTCTTAAGATTGTTCCCAGAGAAAAAATATGCCTTTGGAAGTTGTTAAATGATATCATCCCTACGAAATTCAACTTAGCTTCCAAAGGCATGGACAACTC
CCTCTACTGCCAGTTGAGTGGATGGTCTGTGGCGGACTTTTGGACCGAGATGATCAACCATTTCACGGCTGAAGAACCCAACATTGTTGCGATCATCATGTGGAACATAT
GCTCTCAAGTTGGCAGTTTGGGGTGGGTGATTCGTGACTCCAAAGGGTCTCTGTGGTCAGCGGGAAGCAAAAATATTAGGAGCAATTGGTCCATTGCTATGCTTGAAGCT
TGTGCAATTAAAGAAGGATTGATCACGTACCTCAAAGATAGCATCTCTCCTCCTCCTCATTTGATCGTTGAATCCGATTCAAAGCACATTATCAATCTCCTCAATCGCCA
GAGTCATGATCTCTCGGAAGTGCAAGAAATCATGTTCGAAGTGCATGCCCTCGCCGATAAGATTGGGGTGGTCCGTTTCGTTTGGTGCCCAAGATCTGCTAATGGGATCG
CGCACGATTTGGCGCGAAGAGCGGCCCAGCTGACGTCGTATCGAGTACTTCAGGCGGGGTCTTCGTCTTTAAGCTTAGAGTCAGTTGATTTTTGTTCTCCAGATCCCTCT
GCTTCTTCCACGAAAGTGTGTTTTTTTGGTAGGGATTCTTTTTTGTTTGTTAGGCTTTTTTCTGCTATAATTGAGGAAACCCCTTCCGGTGTTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATTTAGGGAGATCAGTCTCTTTAACCAAGCCATGCTTGCTAAGCAGAGCTGGAGATTAATTAGGAACCCAAACAACTTGATCTATAAAGTCCTAAGGGGTAGATA
TTTCAGGAACGGATCCTTTTTGAAAGCCCCTGTTGGAAACAACTCGTCCCTTACTTGGCGCAACAATGTGTGGGGTAGAAACCTTTTCACCAAAGGCTACAGGTGGAAAG
ATAGTAAGGGCATCGTTTCGGTTAAATCAGCGTATCACCTGGCGGTTAGCTTGGAGCTTGAAAAGGATGCATCTAGTTCGAAGGAAGATGAGCTTAAGCTGTTTTGGAAG
AAATTATGGGGTCTTAAGATTGTTCCCAGAGAAAAAATATGCCTTTGGAAGTTGTTAAATGATATCATCCCTACGAAATTCAACTTAGCTTCCAAAGGCATGGACAACTC
CCTCTACTGCCAGTTGAGTGGATGGTCTGTGGCGGACTTTTGGACCGAGATGATCAACCATTTCACGGCTGAAGAACCCAACATTGTTGCGATCATCATGTGGAACATAT
GCTCTCAAGTTGGCAGTTTGGGGTGGGTGATTCGTGACTCCAAAGGGTCTCTGTGGTCAGCGGGAAGCAAAAATATTAGGAGCAATTGGTCCATTGCTATGCTTGAAGCT
TGTGCAATTAAAGAAGGATTGATCACGTACCTCAAAGATAGCATCTCTCCTCCTCCTCATTTGATCGTTGAATCCGATTCAAAGCACATTATCAATCTCCTCAATCGCCA
GAGTCATGATCTCTCGGAAGTGCAAGAAATCATGTTCGAAGTGCATGCCCTCGCCGATAAGATTGGGGTGGTCCGTTTCGTTTGGTGCCCAAGATCTGCTAATGGGATCG
CGCACGATTTGGCGCGAAGAGCGGCCCAGCTGACGTCGTATCGAGTACTTCAGGCGGGGTCTTCGTCTTTAAGCTTAGAGTCAGTTGATTTTTGTTCTCCAGATCCCTCT
GCTTCTTCCACGAAAGTGTGTTTTTTTGGTAGGGATTCTTTTTTGTTTGTTAGGCTTTTTTCTGCTATAATTGAGGAAACCCCTTCCGGTGTTGGTTAA
Protein sequenceShow/hide protein sequence
MGFREISLFNQAMLAKQSWRLIRNPNNLIYKVLRGRYFRNGSFLKAPVGNNSSLTWRNNVWGRNLFTKGYRWKDSKGIVSVKSAYHLAVSLELEKDASSSKEDELKLFWK
KLWGLKIVPREKICLWKLLNDIIPTKFNLASKGMDNSLYCQLSGWSVADFWTEMINHFTAEEPNIVAIIMWNICSQVGSLGWVIRDSKGSLWSAGSKNIRSNWSIAMLEA
CAIKEGLITYLKDSISPPPHLIVESDSKHIINLLNRQSHDLSEVQEIMFEVHALADKIGVVRFVWCPRSANGIAHDLARRAAQLTSYRVLQAGSSSLSLESVDFCSPDPS
ASSTKVCFFGRDSFLFVRLFSAIIEETPSGVG