; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001557 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001557
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr4:32817838..32823000
RNA-Seq ExpressionLag0001557
SyntenyLag0001557
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004293042.2 PREDICTED: uncharacterized protein LOC101293660 [Fragaria vesca subsp. vesca]7.0e-3022.94Show/hide
Query:  SSFVNRRRSVPSQMVTSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLR--------KKAVKEMHP
        +SF +R+ S       +RK+KN + G+ ++   W + ++ + +VV +YF  +F++       M+  ++ ++  +  + N+QLR        K A+ +M+P
Subjt:  SSFVNRRRSVPSQMVTSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLR--------KKAVKEMHP

Query:  TNALGPDDAHALFYKKFWNTVGIKAKA-----------------------------------------------------------EELSRILGIGRTDS
        T +  PD    LF++ +W+T+G                                                                E+ S  +G+   D+
Subjt:  TNALGPDDAHALFYKKFWNTVGIKAKA-----------------------------------------------------------EELSRILGIGRTDS

Query:  LGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSWKKIYF
          +YLG+P+  GR     F  +  K                G++ILI+ + Q++    MS F+L  KI  ++ +LCA+FW GS  +K K HW +WK +  
Subjt:  LGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSWKKIYF

Query:  KDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDD-------------REVREQPQRCL----------------GQVEQALDSEDFHPIDAEDIL
          E            L + S+V   +       WRV  D             R +  +P                    G    AL S  F   DA  I 
Subjt:  KDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDD-------------REVREQPQRCL----------------GQVEQALDSEDFHPIDAEDIL

Query:  SIPYGSPLAKDEIIWSLDSKGRFSMKVLIGLRWSWNLNLTHPPF-----AAVW----------GEENLSPSSIEEINRDIDRVWEDLPEPLPFAELSKKM
        +IP  S    D I+W L+  G FS+K     R+S++ +  H PF      A W          G +NL    +  +   +  V   + + L    +S   
Subjt:  SIPYGSPLAKDEIIWSLDSKGRFSMKVLIGLRWSWNLNLTHPPF-----AAVW----------GEENLSPSSIEEINRDIDRVWEDLPEPLPFAELSKKM

Query:  ENHMIHQSWKPLPADCWKLNSDASISASGSSCDVGWVVRDSFSSLICRGALSIKDIWSVNSLETLAIREGL
           ++  SWKP P    K+N D S  +   S   G+V+RD+  S +  G  + K + S   +E LA ++ +
Subjt:  ENHMIHQSWKPLPADCWKLNSDASISASGSSCDVGWVVRDSFSSLICRGALSIKDIWSVNSLETLAIREGL

XP_022145148.1 uncharacterized protein LOC111014662 [Momordica charantia]1.6e-2937.56Show/hide
Query:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG
        A  +   L + RT+ +GQYLG+PSQ  RN C++FN ++++               GGKE+LIKA+AQAI   +MSCFR P+ +  EIN L ARFW GS+ 
Subjt:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG

Query:  DKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD
         + K HW SWK++                                            YFK  NF+ A+ G   S VW SI+WGK+LF KG RWR+G+
Subjt:  DKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD

XP_023892689.1 uncharacterized protein LOC112004687 [Quercus suber]7.8e-2924.82Show/hide
Query:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRKK--------AVKEMHPTNALGPDDAHALF--
        T R++KN I GI D    W  K + +  +   YF ++FSS  P   + +  +D +  V+  + N+ L +         A+K+M P    GPDD   LF  
Subjt:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRKK--------AVKEMHPTNALGPDDAHALF--

Query:  ---------------YKK-------------FWNTVGIKAKAEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKE
                       Y+K             F++         ++   LG+        YLG+P+  GRN    F+Q+  K                G+E
Subjt:  ---------------YKK-------------FWNTVGIKAKAEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKE

Query:  ILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSW--------------------------------------------KKIYF
        +LIK++ QAI T  MSCF+LP+ + +EI  L  +FW G  GD+ K HW+ W                                            K  +F
Subjt:  ILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSW--------------------------------------------KKIYF

Query:  KDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDREVR-----EQPQRCLGQV-------------------------EQALDSEDFHPIDAEDI
           +FLEAK     S  W SI+ G+++ +KG  WRVGD ++++       P + L ++                         E+ +D   F+  DA  I
Subjt:  KDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDREVR-----EQPQRCLGQV-------------------------EQALDSEDFHPIDAEDI

Query:  LSIPYGSPLAKDEIIWSLDSKGRFSMK
         +IP       D ++W     G++S+K
Subjt:  LSIPYGSPLAKDEIIWSLDSKGRFSMK

XP_024033484.1 uncharacterized protein LOC112095607 [Citrus clementina]4.1e-3025.88Show/hide
Query:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRK--------KAVKEMHPTNALGPDDAHALFYK
        +SRKKKN I GI +    W DK +++      YF+KLF++ QPS+  +  A++G+   + +  N+QL +        +A+ +M PT A GPD   A FY+
Subjt:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRK--------KAVKEMHPTNALGPDDAHALFYK

Query:  KFWNTVGIK-------------------------------------------------------------------------------------------
        K WN V  K                                                                                           
Subjt:  KFWNTVGIK-------------------------------------------------------------------------------------------

Query:  -----AKAEELSRILGIGRTDSLG---QYLGMPSQGGRNMCRIFNQV---------------VDKGGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKL
              K EE+S I  I + + +    +YLG+PS  GR     F  V                  GGKE+LIKA+AQA+    MS F++PL + ++I + 
Subjt:  -----AKAEELSRILGIGRTDSLG---QYLGMPSQGGRNMCRIFNQV---------------VDKGGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKL

Query:  CARFWSGSHGDKDKAHWLSWKKI---YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDREVR----------------EQPQRCLGQVEQA
         A FW GS   K+K + L  + +   YFK  +FL AK G     +W SI+WG+ +   G RWR+G   +V+                  P          
Subjt:  CARFWSGSHGDKDKAHWLSWKKI---YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDREVR----------------EQPQRCLGQVEQA

Query:  LDSED-----------FHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK
        L ++D           F  +DA+ I+S+P       D+I+W  D +GR+S+K
Subjt:  LDSED-----------FHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.9e-3024.9Show/hide
Query:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLR--------KKAVKEMHPTNALGPDDAHALFYK
        +SRK+KN I G++D    W D  + + +    YF+ LF++  PSE  ++ A+ G++  +  + N QL           A+ +M PT A GPD   A F++
Subjt:  TSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLR--------KKAVKEMHPTNALGPDDAHALFYK

Query:  KFWNTV-------------------GIKAKAEELSRILGI----------------------------------------------------------GR
        K W++V                    +  +AE+   I GI                                                           R
Subjt:  KFWNTV-------------------GIKAKAEELSRILGI----------------------------------------------------------GR

Query:  TDSL---------------GQYLGMPSQGGRNMCRIFNQV---------------VDKGGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSG
         D +                +YLG+PS  GR     FN+V                  GGKE+LIKA+AQAI T  MS F++PL +  +I K  ARFW G
Subjt:  TDSL---------------GQYLGMPSQGGRNMCRIFNQV---------------VDKGGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSG

Query:  SHGDKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD
        +  D+   HW  W++I                                            YFK   F+ A  G   S VW SIVWG+ +  KG RWR+G+
Subjt:  SHGDKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD

Query:  DREVREQPQRCLGQ-----------------VEQALDS----------EDFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK
         + V       + +                 V + +D           + F P DAE I+ IP      +D++IW  D KG +S+K
Subjt:  DREVREQPQRCLGQ-----------------VEQALDS----------EDFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK

TrEMBL top hitse value%identityAlignment
A0A2N9GVR0 RNase H domain-containing protein1.3e-3228.72Show/hide
Query:  RKKKNEINGI-LDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRKK--------AVKEMHPTNALGPDDAHALFYKK
        RK+ N I  +  D E    D E+ IG     Y+ +LF++    +  +D  +DG++  + ++ NQ L           A+K+M P  A GPD    +FY+ 
Subjt:  RKKKNEINGI-LDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQNQQLRKK--------AVKEMHPTNALGPDDAHALFYKK

Query:  FWNTVGIKAKA------------------------------EELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQ---------------VVDKGGKEI
        +W+ VG    A                              EEL  ILG+       +YLG+PS  G+     F+Q               ++ + G+EI
Subjt:  FWNTVGIKAKA------------------------------EELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQ---------------VVDKGGKEI

Query:  LIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSWKKI---------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRV
        LIKA+ QAI T  M+CF+LP+ +  EI  +  RFW G + DK K HWLSW+K+         +F + N LEA      S  W SI+  K L   G  WRV
Subjt:  LIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKDKAHWLSWKKI---------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRV

Query:  GDDREV--------REQPQRCL----------GQVEQALDSED-----------FHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK
        GD  ++         E+  RC+           +V + +               F P DAE IL IP      +D++ W     G+++M+
Subjt:  GDDREV--------REQPQRCL----------GQVEQALDSED-----------FHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK

A0A6J1CV63 uncharacterized protein LOC1110146627.6e-3037.56Show/hide
Query:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG
        A  +   L + RT+ +GQYLG+PSQ  RN C++FN ++++               GGKE+LIKA+AQAI   +MSCFR P+ +  EIN L ARFW GS+ 
Subjt:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG

Query:  DKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD
         + K HW SWK++                                            YFK  NF+ A+ G   S VW SI+WGK+LF KG RWR+G+
Subjt:  DKDKAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD

A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-2730.25Show/hide
Query:  LSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKD
        +  IL +   +   QYLG+P+   RN    FN + D+               GGKE+LIKA+AQAI    MSCFRLP ++  E + + ARFW GS  +  
Subjt:  LSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHGDKD

Query:  KAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD------
        K HW++W  +                                            YFKD +F+EAK     S +W SI+WG+DL +KG RWR+G+      
Subjt:  KAHWLSWKKI--------------------------------------------YFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGD------

Query:  --DREVREQPQ---------RCLGQVEQALDSE-----------DFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK
          D  V  QP            + +V   +D E           +F P +A+ ILSIP G    +D +IW+ +  G +S++
Subjt:  --DREVREQPQ---------RCLGQVEQALDSE-----------DFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK

A0A803P8L6 Uncharacterized protein1.3e-2927.96Show/hide
Query:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG
        AE L+  LG+    +  +YLGMP+  G+N   +F ++ D+                GKEILIKAI QA+    MSCFR+   I +EI  + A FW G+  
Subjt:  AEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKLCARFWSGSHG

Query:  DKDKAHWLSWKK--------------------------------------------IYFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDRE
         K K HW SW+K                                            +YF + NFLEAK G   S +W  IVWG++L  KGYRW +G+   
Subjt:  DKDKAHWLSWKK--------------------------------------------IYFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDRE

Query:  VR--EQPQRCLGQVEQALDSEDFHPIDAEDILSIPYG---SPLAKDEIIWSLDS-KGRFSMKVLIGLRWSWNLNLTHPPFAAV-WG---EENLSPSSIEE
        +R  E P    G            P      +++P G     L   +  W  D   G F  +    + W   +N        + W        + +S  +
Subjt:  VR--EQPQRCLGQVEQALDSEDFHPIDAEDILSIPYG---SPLAKDEIIWSLDS-KGRFSMKVLIGLRWSWNLNLTHPPFAAV-WG---EENLSPSSIEE

Query:  INRDIDRVWEDLPEPLPFAELSKKMENHMIHQSWKPLPADCWKLNSDASISASGSSCDVGWVVRDSFSSLICRGALSIKDIWSVNSLETLAIREGLE
        +     ++  D   PLP  + S          SW P P+  + +N+DAS+      C +G V+RD   +++    + I    SVN  E+LAIR GL+
Subjt:  INRDIDRVWEDLPEPLPFAELSKKMENHMIHQSWKPLPADCWKLNSDASISASGSSCDVGWVVRDSFSSLICRGALSIKDIWSVNSLETLAIREGLE

A0A803PM68 Uncharacterized protein7.1e-2831.4Show/hide
Query:  FWNTVGIKAKAEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKL
        F  TV    K+ +L+ ++G+   D+ G+YLG+PS  GR   + F  +  K                GKEILIKAI QAI T  MSCFRLP K  N I+ +
Subjt:  FWNTVGIKAKAEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDK---------------GGKEILIKAIAQAIHTLNMSCFRLPLKISNEINKL

Query:  CARFWSGSHGDKDKAHWLSW--------------------------------------------KKIYFKDENFLEAKAGPGASLVWISIVWGKDLFRKG
         ARFW GS     K HW  W                                            K  YF +   LEAK+G  AS VW S+VWGK + +KG
Subjt:  CARFWSGSHGDKDKAHWLSW--------------------------------------------KKIYFKDENFLEAKAGPGASLVWISIVWGKDLFRKG

Query:  YRWRVGDDREVR--EQP-------------------------QRCLGQVEQALDSEDFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK
        YRWR+G+   VR  E P                          +  G+ ++      F+P DAE IL +P      +D+I+W     G +S++
Subjt:  YRWRVGDDREVR--EQP-------------------------QRCLGQVEQALDSEDFHPIDAEDILSIPYGSPLAKDEIIWSLDSKGRFSMK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATCAAGAAGACAAGGCGATACGAGATTATTTCTTGTCTGAGTTACCATTGAACTACTATGGGATAACATATGAGCCAATAGCTACTGAACATTTTGAGCTCGA
GGCTAGTCTTATTCAAATGGTTCGAGAGAGTGCTTTCAAAGGACCTCCATCAAAAGATCCACACAGTCATTTGCATTCATTTTTGGATATCTATGGAACGGATCAGAGAT
TCCAAGCTTCGCCTAAAGCAAGCAATGAAAAATCTCAAGTGGAAGTGGAGTCTCAACCAGAAGAAACAATTCATCAAGAGAAGAAAAATTCTTTGAAGGACATGTTAGGA
AAGTTTATTGAAGAGACCAAGAATTTGTCGAATGATGTTCGAGCTTGTCTCTCTCTCAGATTTGCCGCCTCCGTGAGTCCAAATGCATGTCGTCTATCGTCGTTCGTGAA
TCGCCGCCGGTCAGTCCCATCTCAGATGGTAACCTCCAGGAAGAAGAAGAATGAAATTAATGGTATCTTAGATAATGAAAGGTGTTGGAAGGATAAAGAAAAAGATATTG
GAGAGGTGGTTGGCAACTATTTCTCCAAGCTTTTTAGCTCCATTCAGCCCTCCGAATACCTAATGGACAGAGCAGTAGATGGGGTTGAGCATGTCCTTTTAGATCAGCAG
AATCAGCAGCTCAGGAAGAAAGCCGTGAAAGAGATGCATCCTACCAATGCCCTCGGGCCCGATGATGCTCATGCCCTCTTTTATAAAAAATTTTGGAATACAGTGGGGAT
CAAGGCCAAGGCTGAGGAGCTTAGTCGGATCCTTGGTATAGGGAGAACAGATTCGCTAGGTCAATACCTTGGTATGCCTTCCCAAGGGGGTCGGAACATGTGTAGAATCT
TCAACCAAGTCGTGGACAAGGGTGGTAAGGAGATCCTGATAAAAGCCATCGCCCAAGCTATCCATACCTTGAACATGAGCTGCTTTAGGCTTCCCCTCAAGATTAGCAAC
GAGATCAATAAGCTCTGTGCCAGATTCTGGTCGGGTTCCCACGGGGACAAGGACAAAGCGCACTGGCTAAGCTGGAAAAAGATATACTTTAAGGATGAGAATTTTCTAGA
GGCCAAAGCAGGTCCTGGAGCTTCGTTGGTGTGGATAAGCATTGTGTGGGGCAAAGATCTATTCCGAAAGGGATACAGGTGGAGAGTTGGGGACGACAGGGAGGTTCGTG
AGCAACCTCAAAGATGTCTCGGGCAGGTGGAACAGGCACTTGATTCAGAGGATTTCCACCCGATAGATGCAGAAGATATCCTTAGCATTCCCTATGGATCTCCCCTGGCT
AAGGATGAGATTATTTGGAGTTTAGACTCCAAGGGGAGATTCTCCATGAAAGTGCTCATAGGCTTGCGATGGAGTTGGAATCTAAATCTAACCCATCCCCCTTTTGCAGC
AGTGTGGGGAGAAGAGAATTTGAGTCCCTCTTCGATCGAAGAGATCAACAGAGACATTGATAGGGTGTGGGAAGACTTACCGGAGCCCTTACCTTTTGCGGAGCTGAGCA
AGAAGATGGAGAACCACATGATTCACCAATCGTGGAAGCCTTTGCCTGCGGATTGCTGGAAATTAAACTCAGACGCCTCAATTAGTGCTTCGGGATCGAGTTGCGATGTA
GGCTGGGTGGTTCGTGACTCCTTCAGTTCTCTGATCTGCAGAGGTGCCCTTTCGATCAAGGACATTTGGAGCGTGAATTCGCTTGAAACTCTCGCGATTAGGGAGGGGTT
GGAAACTCTGAAGGCCAAGAAAATCTTCCCTCCAAAATCCCTCTGTGTGGAATTGACATCTTCTAAGCGGGGAAGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGATCAAGAAGACAAGGCGATACGAGATTATTTCTTGTCTGAGTTACCATTGAACTACTATGGGATAACATATGAGCCAATAGCTACTGAACATTTTGAGCTCGA
GGCTAGTCTTATTCAAATGGTTCGAGAGAGTGCTTTCAAAGGACCTCCATCAAAAGATCCACACAGTCATTTGCATTCATTTTTGGATATCTATGGAACGGATCAGAGAT
TCCAAGCTTCGCCTAAAGCAAGCAATGAAAAATCTCAAGTGGAAGTGGAGTCTCAACCAGAAGAAACAATTCATCAAGAGAAGAAAAATTCTTTGAAGGACATGTTAGGA
AAGTTTATTGAAGAGACCAAGAATTTGTCGAATGATGTTCGAGCTTGTCTCTCTCTCAGATTTGCCGCCTCCGTGAGTCCAAATGCATGTCGTCTATCGTCGTTCGTGAA
TCGCCGCCGGTCAGTCCCATCTCAGATGGTAACCTCCAGGAAGAAGAAGAATGAAATTAATGGTATCTTAGATAATGAAAGGTGTTGGAAGGATAAAGAAAAAGATATTG
GAGAGGTGGTTGGCAACTATTTCTCCAAGCTTTTTAGCTCCATTCAGCCCTCCGAATACCTAATGGACAGAGCAGTAGATGGGGTTGAGCATGTCCTTTTAGATCAGCAG
AATCAGCAGCTCAGGAAGAAAGCCGTGAAAGAGATGCATCCTACCAATGCCCTCGGGCCCGATGATGCTCATGCCCTCTTTTATAAAAAATTTTGGAATACAGTGGGGAT
CAAGGCCAAGGCTGAGGAGCTTAGTCGGATCCTTGGTATAGGGAGAACAGATTCGCTAGGTCAATACCTTGGTATGCCTTCCCAAGGGGGTCGGAACATGTGTAGAATCT
TCAACCAAGTCGTGGACAAGGGTGGTAAGGAGATCCTGATAAAAGCCATCGCCCAAGCTATCCATACCTTGAACATGAGCTGCTTTAGGCTTCCCCTCAAGATTAGCAAC
GAGATCAATAAGCTCTGTGCCAGATTCTGGTCGGGTTCCCACGGGGACAAGGACAAAGCGCACTGGCTAAGCTGGAAAAAGATATACTTTAAGGATGAGAATTTTCTAGA
GGCCAAAGCAGGTCCTGGAGCTTCGTTGGTGTGGATAAGCATTGTGTGGGGCAAAGATCTATTCCGAAAGGGATACAGGTGGAGAGTTGGGGACGACAGGGAGGTTCGTG
AGCAACCTCAAAGATGTCTCGGGCAGGTGGAACAGGCACTTGATTCAGAGGATTTCCACCCGATAGATGCAGAAGATATCCTTAGCATTCCCTATGGATCTCCCCTGGCT
AAGGATGAGATTATTTGGAGTTTAGACTCCAAGGGGAGATTCTCCATGAAAGTGCTCATAGGCTTGCGATGGAGTTGGAATCTAAATCTAACCCATCCCCCTTTTGCAGC
AGTGTGGGGAGAAGAGAATTTGAGTCCCTCTTCGATCGAAGAGATCAACAGAGACATTGATAGGGTGTGGGAAGACTTACCGGAGCCCTTACCTTTTGCGGAGCTGAGCA
AGAAGATGGAGAACCACATGATTCACCAATCGTGGAAGCCTTTGCCTGCGGATTGCTGGAAATTAAACTCAGACGCCTCAATTAGTGCTTCGGGATCGAGTTGCGATGTA
GGCTGGGTGGTTCGTGACTCCTTCAGTTCTCTGATCTGCAGAGGTGCCCTTTCGATCAAGGACATTTGGAGCGTGAATTCGCTTGAAACTCTCGCGATTAGGGAGGGGTT
GGAAACTCTGAAGGCCAAGAAAATCTTCCCTCCAAAATCCCTCTGTGTGGAATTGACATCTTCTAAGCGGGGAAGAGGATGA
Protein sequenceShow/hide protein sequence
MADQEDKAIRDYFLSELPLNYYGITYEPIATEHFELEASLIQMVRESAFKGPPSKDPHSHLHSFLDIYGTDQRFQASPKASNEKSQVEVESQPEETIHQEKKNSLKDMLG
KFIEETKNLSNDVRACLSLRFAASVSPNACRLSSFVNRRRSVPSQMVTSRKKKNEINGILDNERCWKDKEKDIGEVVGNYFSKLFSSIQPSEYLMDRAVDGVEHVLLDQQ
NQQLRKKAVKEMHPTNALGPDDAHALFYKKFWNTVGIKAKAEELSRILGIGRTDSLGQYLGMPSQGGRNMCRIFNQVVDKGGKEILIKAIAQAIHTLNMSCFRLPLKISN
EINKLCARFWSGSHGDKDKAHWLSWKKIYFKDENFLEAKAGPGASLVWISIVWGKDLFRKGYRWRVGDDREVREQPQRCLGQVEQALDSEDFHPIDAEDILSIPYGSPLA
KDEIIWSLDSKGRFSMKVLIGLRWSWNLNLTHPPFAAVWGEENLSPSSIEEINRDIDRVWEDLPEPLPFAELSKKMENHMIHQSWKPLPADCWKLNSDASISASGSSCDV
GWVVRDSFSSLICRGALSIKDIWSVNSLETLAIREGLETLKAKKIFPPKSLCVELTSSKRGRG