; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012211 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012211
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein kinase domain-containing protein
Genome locationchr1:38663675..38667076
RNA-Seq ExpressionLag0012211
SyntenyLag0012211
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]4.2e-1936.41Show/hide
Query:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN
        MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +     L+ +    S    L+ VD  L+     R       FY  + K  G ++ GPTS+K W  
Subjt:  MFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKN

Query:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEE
         WF+ SG WL + E G  FF VP RFG  V        T   F       E+  R    G  + D+ +LE+  L     + P E
Subjt:  GWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTAKKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.4e-2733.33Show/hide
Query:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL + E G  FF VP RFG  V        T 
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  KKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREARNRAGTSRASVDLTEDEAPRVTAETSRRPAT
          F       E+  R    G  + D+ +LE+  L     + P E        S R   E     G +      ++  A  + A  S +PAT
Subjt:  KKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREARNRAGTSRASVDLTEDEAPRVTAETSRRPAT

XP_031737075.1 uncharacterized protein LOC105435920 isoform X1 [Cucumis sativus]5.5e-1935.21Show/hide
Query:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA
        +D+    T  TS       R  R   R+S +   L   I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R AFT+ 
Subjt:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA

Query:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR
        + S   + +++N L LN VPR E ++A  RL EA   L   +A   S   +++  +AQL EAK+ L+ A  L E+F KT EF  MQ++I   G+ W+ ++
Subjt:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR

Query:  YNKHHPTVDGSFI
         +  HP +D SF+
Subjt:  YNKHHPTVDGSFI

XP_031737083.1 uncharacterized protein LOC105435920 isoform X3 [Cucumis sativus]5.5e-1935.21Show/hide
Query:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA
        +D+    T  TS       R  R   R+S +   L   I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R AFT+ 
Subjt:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA

Query:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR
        + S   + +++N L LN VPR E ++A  RL EA   L   +A   S   +++  +AQL EAK+ L+ A  L E+F KT EF  MQ++I   G+ W+ ++
Subjt:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR

Query:  YNKHHPTVDGSFI
         +  HP +D SF+
Subjt:  YNKHHPTVDGSFI

XP_031737089.1 uncharacterized protein LOC105435920 isoform X5 [Cucumis sativus]5.5e-1935.21Show/hide
Query:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA
        +D+    T  TS       R  R   R+S +   L   I  F     Y S  +EVE L + F  W+GL     EGE G +DP+QG        R AFT+ 
Subjt:  EDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKA

Query:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR
        + S   + +++N L LN VPR E ++A  RL EA   L   +A   S   +++  +AQL EAK+ L+ A  L E+F KT EF  MQ++I   G+ W+ ++
Subjt:  TASAISLANKINELQLNSVPRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKR

Query:  YNKHHPTVDGSFI
         +  HP +D SF+
Subjt:  YNKHHPTVDGSFI

TrEMBL top hitse value%identityAlignment
A0A2N9ETX6 Uncharacterized protein2.8e-2125.92Show/hide
Query:  MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYH
        MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  RRKY IP+DV LR+P +DE   +   GDVAFY 
Subjt:  MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYH

Query:  AMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNG
        A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   + PA    +  +  ++G   ++  P+S ++WK+ 
Subjt:  AMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNG

Query:  WFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVL-TLEKINRHGPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREAR
        + FV G NW  L+  +D  F  V   +G      +RR    +     VL  L     H    +   +L +          P E        ++++   A+
Subjt:  WFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVL-TLEKINRHGPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREAR

Query:  NRAGTSRASVDLTEDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQG--
              +  +   +DEAP VT           ++ + +T S     + S   P    P+   +G     V     +  +  +S   + +LG  D      
Subjt:  NRAGTSRASVDLTEDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQG--

Query:  MREFQRH-LRDAFTKATASAISLANKINELQLNSVPRSEFIQAQERL----NEANRLLEEVRAKLKSRDAELESTRAQLMEA--------------KAHL
        +R    H L    T+ T  A         L    V  SE  QA +RL    NE     + V  +L +  A+L      L  A              ++ L
Subjt:  MREFQRH-LRDAFTKATASAISLANKINELQLNSVPRSEFIQAQERL----NEANRLLEEVRAKLKSRDAELESTRAQLMEA--------------KAHL

Query:  ASADF-LTEEFKKTSEFYAMQDEIWNDGIKWAQKRYNKHHPTVD
        A+A+    E+FK +  F     + +  G  + +K+  + +P +D
Subjt:  ASADF-LTEEFKKTSEFYAMQDEIWNDGIKWAQKRYNKHHPTVD

A0A2N9H8T4 Uncharacterized protein2.8e-2125.92Show/hide
Query:  MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYH
        MA EGS    V S D+ E  SD    +      P IS SS+G  P         LS  + AD ++  RRKY IP+DV LR+P +DE   +   GDVAFY 
Subjt:  MASEGS----VTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSD----LSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYH

Query:  AMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNG
        A F  GVR PL   +++ L    LAP QLAPN W  ++G   +W  +  G   +T+D+ L  +   + PA    +  +  ++G   ++  P+S ++WK+ 
Subjt:  AMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNG

Query:  WFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVL-TLEKINRHGPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREAR
        + FV G NW  L+  +D  F  V   +G      +RR    +     VL  L     H    +   +L +          P E        ++++   A+
Subjt:  WFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVL-TLEKINRHGPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREAR

Query:  NRAGTSRASVDLTEDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQG--
              +  +   +DEAP VT           ++ + +T S     + S   P    P+   +G     V     +  +  +S   + +LG  D      
Subjt:  NRAGTSRASVDLTEDEAPRVTAETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQG--

Query:  MREFQRH-LRDAFTKATASAISLANKINELQLNSVPRSEFIQAQERL----NEANRLLEEVRAKLKSRDAELESTRAQLMEA--------------KAHL
        +R    H L    T+ T  A         L    V  SE  QA +RL    NE     + V  +L +  A+L      L  A              ++ L
Subjt:  MREFQRH-LRDAFTKATASAISLANKINELQLNSVPRSEFIQAQERL----NEANRLLEEVRAKLKSRDAELESTRAQLMEA--------------KAHL

Query:  ASADF-LTEEFKKTSEFYAMQDEIWNDGIKWAQKRYNKHHPTVD
        A+A+    E+FK +  F     + +  G  + +K+  + +P +D
Subjt:  ASADF-LTEEFKKTSEFYAMQDEIWNDGIKWAQKRYNKHHPTVD

A0A2N9I8F6 Uncharacterized protein4.4e-2232.09Show/hide
Query:  GGAPDTSGVPAWLQARLGLGKPRPTS-PDGTPRSCHVKYLAWLLVRAILLLSFATH--------MASEG----SVTSPDVEESYSDDGPSSSGCFVDPEI
        G   + +G+      R    + RP S  D  P SC    L WL      L+S A +        M SEG    SV S D+ E  SD    +      P I
Subjt:  GGAPDTSGVPAWLQARLGLGKPRPTS-PDGTPRSCHVKYLAWLLVRAILLLSFATH--------MASEG----SVTSPDVEESYSDDGPSSSGCFVDPEI

Query:  SDSSDGEPPAHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCH
        S SS+   P  S    S LS  +  D L+  RRKY IP+D+ LR+P +DE   +   GDVAFY A F  GVR PL   +++ L    L+P QLAPN W  
Subjt:  SDSSDGEPPAHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCH

Query:  LIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVR
        ++GC  +W  +  G   +T+D+ L  +   + PA    +     ++G   ++  P+S ++WK+ + FV G NW  L   +D  F  V  ++G      +R
Subjt:  LIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLISGPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVR

Query:  RSPTAKKFSKYVLTLEKINRH
        R   +      VL     N+H
Subjt:  RSPTAKKFSKYVLTLEKINRH

A0A2N9IZM5 Uncharacterized protein7.4e-2232.84Show/hide
Query:  WLLVRAILLLSFATHMASEGSVTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFE
        W +V+ +++    T   S  SV S D+ E  SD G  S      P +  SS    P  S    S LS  +  D L+  RRK+ IP+DV LR+P +DE   
Subjt:  WLLVRAILLLSFATHMASEGSVTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHS----SDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFE

Query:  NPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLIS
        +   GDVAFY A F  GVR P+   +++ L    LAP QLAPN W  ++GC  +W  +  G   +TVD+ L  +   + PA    +     ++G   ++ 
Subjt:  NPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLW-AMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKG--TLIS

Query:  GPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVLTLEKINRH
         P+S ++WK+ + FV G NW  L   +D  F  V   +G      +RR   +      VL     N+H
Subjt:  GPTSVKKWKNGWFFVSG-NW--LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVLTLEKINRH

A0A6J1DXS5 uncharacterized protein LOC1110255021.2e-2733.33Show/hide
Query:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        + SD        +  S +    L  LRR + IP+++ LRLP   E  +NPP+G V  Y  MF++G+RLPL  F+Q+FL  TGLAPAQ+APNGW  +    
Subjt:  DSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA
         L+ +    S    L  VD  L+     R       FY  + K  G ++ GPTS+K W   WF+ SG WL + E G  FF VP RFG  V        T 
Subjt:  TLWAMHGGGS----LMTVDDFLSLHTINRNPAFGDLFYYASTK-KGTLISGPTSVKKWKNGWFFVSGNWLERTEDG-CFFGVPMRFGEYVPRNVRRSPTA

Query:  KKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREARNRAGTSRASVDLTEDEAPRVTAETSRRPAT
          F       E+  R    G  + D+ +LE+  L     + P E        S R   E     G +      ++  A  + A  S +PAT
Subjt:  KKFSKYVLTLEKINRH---GPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREARNRAGTSRASVDLTEDEAPRVTAETSRRPAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G46696.1 Protein of unknown function, DUF6014.1e-0434.83Show/hide
Query:  DGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN
        DG+   + + LS   T+ RL  LR  + IP  + L  P      ENPP G    +   F   G+  PLP  L D +   G+A  QL PN
Subjt:  DGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMFKF-GVRLPLPLFLQDFLVCTGLAPAQLAPN

AT1G51172.1 unknown protein4.6e-0827.84Show/hide
Query:  SSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF
        S DG+   + + LS   T  RL  LR  + IP  + L  P    + E+PP G    +   F + G+  PLP  L D +   G+A  QL PN    ++   
Subjt:  SSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGDVAFYHAMF-KFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCF

Query:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--SGPTSVKKWKNGWFFVSGNWLERTEDGCFF
        TL      G  + + DFL L+ + ++    +  ++ S +KG  +    P   + W+  +FF   N L   E    F
Subjt:  TLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLI--SGPTSVKKWKNGWFFVSGNWLERTEDGCFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGCTTCATCAAATCGACTTCTCCCTGAAAAGTCAGGGACGGAGGGACCTCCCTCCTTTGATTTTTCAGCGAAACCTGTTGATGCCAAAATCTGGTCAAGCGCTCAC
CTGCCCTTCACGTGACACCAACAATAAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAAGGTTTTTTCCTTTCCGCATCGTCCACCGCACGTAGCCTCC
TGCTACTGGGCCCAAGGGGATTGACCCGGGCAGCCCAAGCGGCCAGTGGGATTAATTGGCCTGGTCGACCTTGGCATTGGGCCGAGGCCGAGGTGTCATTCCACCTCTTG
CGGGCTTTCATCTCCTCGGGCCGATCTCAGCTGGGTCCGAAACCCTCCTTCTTTGGTGTACCTGCCAAGCCAAAATTACCCACAACATTAAGCCCCCGCTCTCTTAACCG
GGCTCCGGAAGTTCAAACTTTGAACTTGAATTTCCCGCCATTTCGATCAGACAAATGCTTCCCAAAGGGAGAGGAGAGAGAAAAAACGTGGACTGAGGCTCATCTGCCTC
GGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGCCTGGCTGCAGGCACGCCTCGGCCTCGGCAAGCCGAGGCCGACGTCACCTGACGGGACCCCACGG
TCCTGTCATGTCAAGTACCTTGCCTGGCTTCTTGTGAGAGCTATCCTTTTACTTTCTTTTGCTACGCATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGA
ATCTTATTCCGATGACGGTCCCTCAAGCTCAGGCTGCTTTGTGGACCCGGAGATTTCGGATAGCAGTGATGGGGAGCCCCCTGCACACTCATCGGACTTATCATCCTCGT
TGACCGCAGACCGCTTAGAGTTCTTGCGGCGCAAGTATGATATTCCCGACGATGTGCATCTTCGGCTCCCTAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAT
GTTGCTTTTTACCATGCCATGTTTAAGTTTGGGGTTCGCTTGCCGCTGCCATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAA
CGGGTGGTGCCACCTCATCGGTTGCTTCACCCTTTGGGCGATGCACGGTGGGGGATCCCTAATGACTGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCG
CTTTTGGTGACCTTTTTTATTACGCAAGTACCAAAAAAGGCACCTTAATCAGCGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTCGTTAGTGGCAATTGG
CTTGAAAGAACGGAAGACGGTTGTTTTTTCGGGGTTCCAATGAGGTTTGGAGAATATGTGCCTCGCAACGTTCGACGCTCCCCGACTGCTAAGAAGTTTTCCAAATACGT
CCTGACCCTCGAAAAGATTAATCGCCACGGTCCCTTTTTAGTCGATCAAAGTGTCCTCGAAGCGTCTGGGCTAGCAAGGCGCCGCACCATCAATCCTGAAGAAATGGCCT
TCCGTGGAATGTACGATTCTCAGCGGAAAAGACGTGAAGCACGTAACAGGGCTGGAACCTCCCGGGCCTCTGTGGACCTAACCGAGGATGAGGCTCCACGGGTTACTGCT
GAGACCTCTCGTCGCCCTGCAACTGCTACCCGCAGAACCCGGTATCAGACGCGCTCCTCGGTCACCGAGACAGATCTTAGCACAGGCATCCCGGTCTTTGCCCTTCCCGA
GGACTACAAGAGCGGCGGCAATGAGGTAGAGGTCCTAACCCAAAACTTCATGTGCTGGCAAGGGTTGCAATCTCGGAGGCCAGAAGGTGAGCTAGGAGTTGAGGATCCTG
CCCAAGGAATGCGAGAATTCCAGAGGCATCTTCGTGACGCTTTCACGAAGGCCACTGCCTCTGCGATTAGCCTGGCAAACAAAATCAACGAGCTTCAATTGAACAGCGTC
CCACGGAGCGAGTTCATTCAAGCCCAAGAGAGGCTCAACGAGGCCAACCGCCTGCTAGAGGAAGTGCGAGCGAAACTCAAATCCAGGGATGCTGAGCTGGAGTCCACAAG
AGCTCAACTCATGGAGGCTAAAGCCCATTTGGCCAGCGCCGATTTTCTGACTGAAGAATTCAAGAAAACCAGCGAGTTCTATGCCATGCAAGACGAGATATGGAACGATG
GCATAAAGTGGGCACAAAAGAGATACAACAAACACCACCCCACCGTGGATGGTTCCTTCATCCAAGAAGACCTTGCTGCTCTCGCCGCCAATCCTGATGCCTTTGCCTCT
TCTGATGACTCCTCCGGCGGTAGAGATCATATGGACCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGCTTCATCAAATCGACTTCTCCCTGAAAAGTCAGGGACGGAGGGACCTCCCTCCTTTGATTTTTCAGCGAAACCTGTTGATGCCAAAATCTGGTCAAGCGCTCAC
CTGCCCTTCACGTGACACCAACAATAAATTTGTTGAGATATGGAGAAGACGTGACCTGAAAAGAAAGAAAGGTTTTTTCCTTTCCGCATCGTCCACCGCACGTAGCCTCC
TGCTACTGGGCCCAAGGGGATTGACCCGGGCAGCCCAAGCGGCCAGTGGGATTAATTGGCCTGGTCGACCTTGGCATTGGGCCGAGGCCGAGGTGTCATTCCACCTCTTG
CGGGCTTTCATCTCCTCGGGCCGATCTCAGCTGGGTCCGAAACCCTCCTTCTTTGGTGTACCTGCCAAGCCAAAATTACCCACAACATTAAGCCCCCGCTCTCTTAACCG
GGCTCCGGAAGTTCAAACTTTGAACTTGAATTTCCCGCCATTTCGATCAGACAAATGCTTCCCAAAGGGAGAGGAGAGAGAAAAAACGTGGACTGAGGCTCATCTGCCTC
GGTCTGGCGTGTCAGGCGGCGCACCTGACACGTCTGGCGTGCCTGCCTGGCTGCAGGCACGCCTCGGCCTCGGCAAGCCGAGGCCGACGTCACCTGACGGGACCCCACGG
TCCTGTCATGTCAAGTACCTTGCCTGGCTTCTTGTGAGAGCTATCCTTTTACTTTCTTTTGCTACGCATATGGCCAGCGAAGGCTCTGTTACCTCTCCGGACGTGGAGGA
ATCTTATTCCGATGACGGTCCCTCAAGCTCAGGCTGCTTTGTGGACCCGGAGATTTCGGATAGCAGTGATGGGGAGCCCCCTGCACACTCATCGGACTTATCATCCTCGT
TGACCGCAGACCGCTTAGAGTTCTTGCGGCGCAAGTATGATATTCCCGACGATGTGCATCTTCGGCTCCCTAATGCTGACGAGAACTTTGAGAATCCCCCGGATGGAGAT
GTTGCTTTTTACCATGCCATGTTTAAGTTTGGGGTTCGCTTGCCGCTGCCATTGTTTTTGCAAGACTTCCTGGTCTGTACGGGTCTAGCCCCTGCCCAGCTTGCTCCAAA
CGGGTGGTGCCACCTCATCGGTTGCTTCACCCTTTGGGCGATGCACGGTGGGGGATCCCTAATGACTGTTGACGATTTCTTGTCCTTGCATACCATCAATCGCAATCCCG
CTTTTGGTGACCTTTTTTATTACGCAAGTACCAAAAAAGGCACCTTAATCAGCGGACCCACTTCCGTAAAAAAGTGGAAAAATGGTTGGTTCTTCGTTAGTGGCAATTGG
CTTGAAAGAACGGAAGACGGTTGTTTTTTCGGGGTTCCAATGAGGTTTGGAGAATATGTGCCTCGCAACGTTCGACGCTCCCCGACTGCTAAGAAGTTTTCCAAATACGT
CCTGACCCTCGAAAAGATTAATCGCCACGGTCCCTTTTTAGTCGATCAAAGTGTCCTCGAAGCGTCTGGGCTAGCAAGGCGCCGCACCATCAATCCTGAAGAAATGGCCT
TCCGTGGAATGTACGATTCTCAGCGGAAAAGACGTGAAGCACGTAACAGGGCTGGAACCTCCCGGGCCTCTGTGGACCTAACCGAGGATGAGGCTCCACGGGTTACTGCT
GAGACCTCTCGTCGCCCTGCAACTGCTACCCGCAGAACCCGGTATCAGACGCGCTCCTCGGTCACCGAGACAGATCTTAGCACAGGCATCCCGGTCTTTGCCCTTCCCGA
GGACTACAAGAGCGGCGGCAATGAGGTAGAGGTCCTAACCCAAAACTTCATGTGCTGGCAAGGGTTGCAATCTCGGAGGCCAGAAGGTGAGCTAGGAGTTGAGGATCCTG
CCCAAGGAATGCGAGAATTCCAGAGGCATCTTCGTGACGCTTTCACGAAGGCCACTGCCTCTGCGATTAGCCTGGCAAACAAAATCAACGAGCTTCAATTGAACAGCGTC
CCACGGAGCGAGTTCATTCAAGCCCAAGAGAGGCTCAACGAGGCCAACCGCCTGCTAGAGGAAGTGCGAGCGAAACTCAAATCCAGGGATGCTGAGCTGGAGTCCACAAG
AGCTCAACTCATGGAGGCTAAAGCCCATTTGGCCAGCGCCGATTTTCTGACTGAAGAATTCAAGAAAACCAGCGAGTTCTATGCCATGCAAGACGAGATATGGAACGATG
GCATAAAGTGGGCACAAAAGAGATACAACAAACACCACCCCACCGTGGATGGTTCCTTCATCCAAGAAGACCTTGCTGCTCTCGCCGCCAATCCTGATGCCTTTGCCTCT
TCTGATGACTCCTCCGGCGGTAGAGATCATATGGACCTCTGA
Protein sequenceShow/hide protein sequence
MWLHQIDFSLKSQGRRDLPPLIFQRNLLMPKSGQALTCPSRDTNNKFVEIWRRRDLKRKKGFFLSASSTARSLLLLGPRGLTRAAQAASGINWPGRPWHWAEAEVSFHLL
RAFISSGRSQLGPKPSFFGVPAKPKLPTTLSPRSLNRAPEVQTLNLNFPPFRSDKCFPKGEEREKTWTEAHLPRSGVSGGAPDTSGVPAWLQARLGLGKPRPTSPDGTPR
SCHVKYLAWLLVRAILLLSFATHMASEGSVTSPDVEESYSDDGPSSSGCFVDPEISDSSDGEPPAHSSDLSSSLTADRLEFLRRKYDIPDDVHLRLPNADENFENPPDGD
VAFYHAMFKFGVRLPLPLFLQDFLVCTGLAPAQLAPNGWCHLIGCFTLWAMHGGGSLMTVDDFLSLHTINRNPAFGDLFYYASTKKGTLISGPTSVKKWKNGWFFVSGNW
LERTEDGCFFGVPMRFGEYVPRNVRRSPTAKKFSKYVLTLEKINRHGPFLVDQSVLEASGLARRRTINPEEMAFRGMYDSQRKRREARNRAGTSRASVDLTEDEAPRVTA
ETSRRPATATRRTRYQTRSSVTETDLSTGIPVFALPEDYKSGGNEVEVLTQNFMCWQGLQSRRPEGELGVEDPAQGMREFQRHLRDAFTKATASAISLANKINELQLNSV
PRSEFIQAQERLNEANRLLEEVRAKLKSRDAELESTRAQLMEAKAHLASADFLTEEFKKTSEFYAMQDEIWNDGIKWAQKRYNKHHPTVDGSFIQEDLAALAANPDAFAS
SDDSSGGRDHMDL