; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018350 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018350
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr5:23823191..23826132
RNA-Seq ExpressionLag0018350
SyntenyLag0018350
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040039.1 hypothetical protein E6C27_scaffold366G00060 [Cucumis melo var. makuwa]1.9e-4128.77Show/hide
Query:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMIN---------------
        S+ + LE  +    S   L+ TP T +FF +   E   +W+QK  N++G   E  +V   G +  +++P G +  GW  F+++++               
Subjt:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMIN---------------

Query:  -----DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSS---NI-------VTFPRD--------QRNEHNVKGRTVGPSQTPPIQTAECKRKDL
             D FSS +   + K+      +S+A+AV      D  S+S   NI        TF  D        +R  H+   R V         T   K    
Subjt:  -----DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSS---NI-------VTFPRD--------QRNEHNVKGRTVGPSQTPPIQTAECKRKDL

Query:  IINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCME
           PFH DKAL+   + E A+++  NKGW+ +G F +KFE W  + H    V+PSYGGW++ R +PL  W +++F  IG+A GG++E   +   L    E
Subjt:  IINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCME

Query:  VVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIEDRKAKSQIQGVVIKTSHVS---
          IK+K NY GFIPA I +  ++    I QV+   + +    R   IHG FT EAA+ F   + N E     D   +   KA S + G      ++    
Subjt:  VVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIEDRKAKSQIQGVVIKTSHVS---

Query:  --------ISNNGEGSAAEREKGKSFVCRKRVIEDKVG
                ++ +G+  ++E+ K K+      VI  K G
Subjt:  --------ISNNGEGSAAEREKGKSFVCRKRVIEDKVG

KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]2.3e-4229.84Show/hide
Query:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKV
        S+ + L+  +    +   L+ TP T +FF +    +  +W+QK+ N+RG   E  +V   G +  +++P G D  GW  F  M+        ++  DKK 
Subjt:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKV

Query:  VYNPQKSFADAVKGRHKVDRPSSSN------------IVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRK-----------------------DLIIN
        +    + + +  KG+ K+ +P  S+            +V+      ++++    T    +  P   +E +++                       D II+
Subjt:  VYNPQKSFADAVKGRHKVDRPSSSN------------IVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRK-----------------------DLIIN

Query:  ----------------PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIE
                        PFH DKALL   D + A+++  N GW+ +G F +KFE W   +H    V+PSYGGW RFR IPL  W ++TF  IGEA GG+I+
Subjt:  ----------------PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIE

Query:  CDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
           + ++ +   E +IKVK NY GF+PA I I  E+G V I Q+VT  + + L  R   IHG F   AA  F
Subjt:  CDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.0e-4233.23Show/hide
Query:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV
        L+ TP T +FF +    +  +W+Q + N+RG   E  +V   G +  +++P G D  GW  F    ND  + K+    DKK    P + + +  KG+ K+
Subjt:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV

Query:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH
         +   S+      D  +        V  S +     ++C                              +KD      PFH DKALL   D E A+++  
Subjt:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH

Query:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS
        N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G 
Subjt:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS

Query:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
          I Q VT    + L  R   IHG FT  AA  F
Subjt:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

TYK29576.1 hypothetical protein E5676_scaffold655G001820 [Cucumis melo var. makuwa]1.0e-4233.23Show/hide
Query:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV
        L+ TP T +FF +    +  +W+Q + N+RG   E  +V   G +  +++P G D  GW  F    ND  + K+    DKK    P + + +  KG+ K+
Subjt:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV

Query:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH
         +   S+      D  +        V  S +     ++C                              +KD      PFH DKALL   D E A+++  
Subjt:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH

Query:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS
        N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G 
Subjt:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS

Query:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
          I Q VT    + L  R   IHG FT  AA  F
Subjt:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]6.6e-4239.03Show/hide
Query:  SEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRKDLIINPFHPDKAL
        S  W++   M+    ++K +  ++++++   Q+S      G  +V R +    +   R  R+ H+   R +           E      IINPF  DKAL
Subjt:  SEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRKDLIINPFHPDKAL

Query:  LKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCG
        +KCP  + A ++  NKGW   G  T+K E W+  LHGR  + PSYG WV+ RNIPL  W + TFKAIG A GG+I+ DD     + C +V IKVKSNYCG
Subjt:  LKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCG

Query:  FIPAEIDIIQEDGSVAI-AQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIED
        FIPAEI  +  DG +   A+VV+FED + L  + V IHGGF+SEAAR F  +    +    +D+ R+E+
Subjt:  FIPAEIDIIQEDGSVAI-AQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIED

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein1.1e-4229.84Show/hide
Query:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKV
        S+ + L+  +    +   L+ TP T +FF +    +  +W+QK+ N+RG   E  +V   G +  +++P G D  GW  F  M+        ++  DKK 
Subjt:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKV

Query:  VYNPQKSFADAVKGRHKVDRPSSSN------------IVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRK-----------------------DLIIN
        +    + + +  KG+ K+ +P  S+            +V+      ++++    T    +  P   +E +++                       D II+
Subjt:  VYNPQKSFADAVKGRHKVDRPSSSN------------IVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRK-----------------------DLIIN

Query:  ----------------PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIE
                        PFH DKALL   D + A+++  N GW+ +G F +KFE W   +H    V+PSYGGW RFR IPL  W ++TF  IGEA GG+I+
Subjt:  ----------------PFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIE

Query:  CDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
           + ++ +   E +IKVK NY GF+PA I I  E+G V I Q+VT  + + L  R   IHG F   AA  F
Subjt:  CDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

A0A5A7TFK7 DUF4283 domain-containing protein9.2e-4228.77Show/hide
Query:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMIN---------------
        S+ + LE  +    S   L+ TP T +FF +   E   +W+QK  N++G   E  +V   G +  +++P G +  GW  F+++++               
Subjt:  SVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMIN---------------

Query:  -----DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSS---NI-------VTFPRD--------QRNEHNVKGRTVGPSQTPPIQTAECKRKDL
             D FSS +   + K+      +S+A+AV      D  S+S   NI        TF  D        +R  H+   R V         T   K    
Subjt:  -----DFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSS---NI-------VTFPRD--------QRNEHNVKGRTVGPSQTPPIQTAECKRKDL

Query:  IINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCME
           PFH DKAL+   + E A+++  NKGW+ +G F +KFE W  + H    V+PSYGGW++ R +PL  W +++F  IG+A GG++E   +   L    E
Subjt:  IINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCME

Query:  VVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIEDRKAKSQIQGVVIKTSHVS---
          IK+K NY GFIPA I +  ++    I QV+   + +    R   IHG FT EAA+ F   + N E     D   +   KA S + G      ++    
Subjt:  VVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIEDRKAKSQIQGVVIKTSHVS---

Query:  --------ISNNGEGSAAEREKGKSFVCRKRVIEDKVG
                ++ +G+  ++E+ K K+      VI  K G
Subjt:  --------ISNNGEGSAAEREKGKSFVCRKRVIEDKVG

A0A5A7TTA1 DUF4283 domain-containing protein4.9e-4333.23Show/hide
Query:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV
        L+ TP T +FF +    +  +W+Q + N+RG   E  +V   G +  +++P G D  GW  F    ND  + K+    DKK    P + + +  KG+ K+
Subjt:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV

Query:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH
         +   S+      D  +        V  S +     ++C                              +KD      PFH DKALL   D E A+++  
Subjt:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH

Query:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS
        N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G 
Subjt:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS

Query:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
          I Q VT    + L  R   IHG FT  AA  F
Subjt:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

A0A5D3E0Y8 DUF4283 domain-containing protein4.9e-4333.23Show/hide
Query:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV
        L+ TP T +FF +    +  +W+Q + N+RG   E  +V   G +  +++P G D  GW  F    ND  + K+    DKK    P + + +  KG+ K+
Subjt:  LVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKV

Query:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH
         +   S+      D  +        V  S +     ++C                              +KD      PFH DKALL   D E A+++  
Subjt:  DRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECK-----------------------------RKDLIIN--PFHPDKALLKCPDAEFARIVAH

Query:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS
        N GW+ +G F +KFE W    H    V+PSYGGW RFR IPL  W ++TF  IGEAYGG+I+   + ++ +   E +IKVK NY GF+PA I I  E+G 
Subjt:  NKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGS

Query:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF
          I Q VT    + L  R   IHG FT  AA  F
Subjt:  VAIAQVVTFEDPQLLESRRVYIHGGFTSEAARVF

A0A6J1D6X4 uncharacterized protein LOC1110181863.2e-4239.03Show/hide
Query:  SEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRKDLIINPFHPDKAL
        S  W++   M+    ++K +  ++++++   Q+S      G  +V R +    +   R  R+ H+   R +           E      IINPF  DKAL
Subjt:  SEGWRAFLTMINDFFSSKEEQHEDKKVVYNPQKSFADAVKGRHKVDRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRKDLIINPFHPDKAL

Query:  LKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCG
        +KCP  + A ++  NKGW   G  T+K E W+  LHGR  + PSYG WV+ RNIPL  W + TFKAIG A GG+I+ DD     + C +V IKVKSNYCG
Subjt:  LKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHGRINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCG

Query:  FIPAEIDIIQEDGSVAI-AQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIED
        FIPAEI  +  DG +   A+VV+FED + L  + V IHGGF+SEAAR F  +    +    +D+ R+E+
Subjt:  FIPAEIDIIQEDGSVAI-AQVVTFEDPQLLESRRVYIHGGFTSEAARVFFGEDDNREDLCLVDKNRIED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTGGGAAAGAAGTTAGGCATTGATTCAGTTGGTTTAGAGCTTGAGCAGTCTCAAGAGAGTGCGGACTCCATTGATGACCTGGTAATCACCCCCAGTACCCAAAA
ATTCTTTCGGAAGACTTCTTGTGAGAATGGATTCATCTGGCTTCAGAAAGTGTCGAACAAAAGAGGAGTGTTCGTGGAGAAAGCGAAAGTTGCCTCGTCCGGAAATAGAA
GCAACTTGATAATCCCCCGTGGGGAAGATTCAGAAGGATGGAGAGCTTTTCTTACGATGATAAATGATTTCTTCAGTAGCAAAGAAGAGCAGCACGAAGACAAAAAGGTT
GTTTACAATCCCCAAAAGTCCTTTGCAGATGCTGTCAAGGGGAGACACAAGGTGGATAGGCCAAGTTCGTCAAATATCGTTACCTTCCCAAGAGACCAGCGCAATGAGCA
CAACGTAAAGGGGCGAACAGTGGGCCCAAGCCAGACTCCTCCTATCCAAACGGCTGAATGTAAACGAAAGGATTTGATTATCAACCCTTTTCATCCAGACAAGGCCCTCT
TAAAGTGCCCAGACGCAGAGTTTGCTCGGATAGTGGCTCATAATAAAGGATGGTCAGTGTTAGGGAATTTCACTCTGAAGTTTGAATATTGGGATTATGAGTTGCACGGA
AGGATCAACGTGGTCCCATCTTATGGGGGATGGGTGAGATTTCGAAACATTCCTCTGCAGAACTGGTGTGTGGACACATTTAAAGCGATTGGGGAAGCCTATGGAGGGTA
TATTGAATGCGATGATAAATGCCTCTCATTAGTGGGTTGTATGGAAGTGGTTATTAAAGTTAAAAGTAATTATTGTGGCTTCATTCCAGCAGAAATTGATATTATTCAGG
AAGATGGTTCAGTTGCGATTGCTCAAGTAGTCACGTTTGAAGATCCTCAATTGCTGGAAAGTAGAAGAGTTTACATCCATGGCGGTTTTACCAGTGAAGCTGCTAGGGTT
TTCTTTGGGGAAGACGATAACAGAGAAGACCTTTGTCTAGTGGATAAAAACCGAATAGAGGATCGGAAGGCCAAAAGTCAGATACAAGGGGTTGTGATAAAGACTAGTCA
TGTCAGTATTAGTAACAATGGGGAAGGTTCAGCAGCTGAGAGAGAAAAAGGCAAAAGTTTTGTATGCAGAAAACGTGTGATAGAGGACAAAGTGGGGCCCAGTGATCAGA
TTACCAGAACCAATGAAAATTGGGACAATAATAAAAAAGCAGATGAAATGGGAGTGAATACCACGCGCGTGATAGGCTACCAATGGAAAGAAAAAGTCAAAGATAAAAAA
GGGGTCTCTTTTTCCCCCAAAGCAGAGTTCAAAACGTACGTAAAAAAAGGTGTTATAGGGTCCACGTCTAATGATAGAAAAAATCAGAAAGAAGGCCCGAGAGATGAAAT
ACTTGATGACCTGGAGAACGATGGCTCGTTCTTCAGCGAATCTAGCTGGGAGGACGAAATCCAGGGATTTGAAGGCCAAAAGGAGGCGTGGAACCAGGCTGAAGCTCTCG
AAGATATAGCAGCTATGTTTGAAGATGAAGAGAAAGATGAACAGAATCCTCCTCCAAAGGACAATCTGTCCATAGTTACAAGTATGCCTCAAGAAAGTATGATTGATGCT
CCTCCTCTTAGATCAGAAGACACAGGTAAGGGGGAGATTAATGATTTTGGGAGCCCGATGACGAGATCAGAGGAGCCAGTGGGGATTAATGACTTGAATCAAAGACTGGA
TATTCCAATGCTTAAGGAAACGGTTGAAATTTTATGGCAAAATGGACTTTGTATAAGGCCAATTCCTAAGAAGGTGGGAGGAGGCCGTAACAAAGGAAAAAACAGAAAGA
ACCCAATGAAGAGGGAGATAAAAGGCCTTATAAACTCTTGGGAAAAACCGATGGAGAATGCAAATTCTTCATCTCAGGGCCAAAATGTTGATCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTGGGAAAGAAGTTAGGCATTGATTCAGTTGGTTTAGAGCTTGAGCAGTCTCAAGAGAGTGCGGACTCCATTGATGACCTGGTAATCACCCCCAGTACCCAAAA
ATTCTTTCGGAAGACTTCTTGTGAGAATGGATTCATCTGGCTTCAGAAAGTGTCGAACAAAAGAGGAGTGTTCGTGGAGAAAGCGAAAGTTGCCTCGTCCGGAAATAGAA
GCAACTTGATAATCCCCCGTGGGGAAGATTCAGAAGGATGGAGAGCTTTTCTTACGATGATAAATGATTTCTTCAGTAGCAAAGAAGAGCAGCACGAAGACAAAAAGGTT
GTTTACAATCCCCAAAAGTCCTTTGCAGATGCTGTCAAGGGGAGACACAAGGTGGATAGGCCAAGTTCGTCAAATATCGTTACCTTCCCAAGAGACCAGCGCAATGAGCA
CAACGTAAAGGGGCGAACAGTGGGCCCAAGCCAGACTCCTCCTATCCAAACGGCTGAATGTAAACGAAAGGATTTGATTATCAACCCTTTTCATCCAGACAAGGCCCTCT
TAAAGTGCCCAGACGCAGAGTTTGCTCGGATAGTGGCTCATAATAAAGGATGGTCAGTGTTAGGGAATTTCACTCTGAAGTTTGAATATTGGGATTATGAGTTGCACGGA
AGGATCAACGTGGTCCCATCTTATGGGGGATGGGTGAGATTTCGAAACATTCCTCTGCAGAACTGGTGTGTGGACACATTTAAAGCGATTGGGGAAGCCTATGGAGGGTA
TATTGAATGCGATGATAAATGCCTCTCATTAGTGGGTTGTATGGAAGTGGTTATTAAAGTTAAAAGTAATTATTGTGGCTTCATTCCAGCAGAAATTGATATTATTCAGG
AAGATGGTTCAGTTGCGATTGCTCAAGTAGTCACGTTTGAAGATCCTCAATTGCTGGAAAGTAGAAGAGTTTACATCCATGGCGGTTTTACCAGTGAAGCTGCTAGGGTT
TTCTTTGGGGAAGACGATAACAGAGAAGACCTTTGTCTAGTGGATAAAAACCGAATAGAGGATCGGAAGGCCAAAAGTCAGATACAAGGGGTTGTGATAAAGACTAGTCA
TGTCAGTATTAGTAACAATGGGGAAGGTTCAGCAGCTGAGAGAGAAAAAGGCAAAAGTTTTGTATGCAGAAAACGTGTGATAGAGGACAAAGTGGGGCCCAGTGATCAGA
TTACCAGAACCAATGAAAATTGGGACAATAATAAAAAAGCAGATGAAATGGGAGTGAATACCACGCGCGTGATAGGCTACCAATGGAAAGAAAAAGTCAAAGATAAAAAA
GGGGTCTCTTTTTCCCCCAAAGCAGAGTTCAAAACGTACGTAAAAAAAGGTGTTATAGGGTCCACGTCTAATGATAGAAAAAATCAGAAAGAAGGCCCGAGAGATGAAAT
ACTTGATGACCTGGAGAACGATGGCTCGTTCTTCAGCGAATCTAGCTGGGAGGACGAAATCCAGGGATTTGAAGGCCAAAAGGAGGCGTGGAACCAGGCTGAAGCTCTCG
AAGATATAGCAGCTATGTTTGAAGATGAAGAGAAAGATGAACAGAATCCTCCTCCAAAGGACAATCTGTCCATAGTTACAAGTATGCCTCAAGAAAGTATGATTGATGCT
CCTCCTCTTAGATCAGAAGACACAGGTAAGGGGGAGATTAATGATTTTGGGAGCCCGATGACGAGATCAGAGGAGCCAGTGGGGATTAATGACTTGAATCAAAGACTGGA
TATTCCAATGCTTAAGGAAACGGTTGAAATTTTATGGCAAAATGGACTTTGTATAAGGCCAATTCCTAAGAAGGTGGGAGGAGGCCGTAACAAAGGAAAAAACAGAAAGA
ACCCAATGAAGAGGGAGATAAAAGGCCTTATAAACTCTTGGGAAAAACCGATGGAGAATGCAAATTCTTCATCTCAGGGCCAAAATGTTGATCAATGA
Protein sequenceShow/hide protein sequence
MGVGKKLGIDSVGLELEQSQESADSIDDLVITPSTQKFFRKTSCENGFIWLQKVSNKRGVFVEKAKVASSGNRSNLIIPRGEDSEGWRAFLTMINDFFSSKEEQHEDKKV
VYNPQKSFADAVKGRHKVDRPSSSNIVTFPRDQRNEHNVKGRTVGPSQTPPIQTAECKRKDLIINPFHPDKALLKCPDAEFARIVAHNKGWSVLGNFTLKFEYWDYELHG
RINVVPSYGGWVRFRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEVVIKVKSNYCGFIPAEIDIIQEDGSVAIAQVVTFEDPQLLESRRVYIHGGFTSEAARV
FFGEDDNREDLCLVDKNRIEDRKAKSQIQGVVIKTSHVSISNNGEGSAAEREKGKSFVCRKRVIEDKVGPSDQITRTNENWDNNKKADEMGVNTTRVIGYQWKEKVKDKK
GVSFSPKAEFKTYVKKGVIGSTSNDRKNQKEGPRDEILDDLENDGSFFSESSWEDEIQGFEGQKEAWNQAEALEDIAAMFEDEEKDEQNPPPKDNLSIVTSMPQESMIDA
PPLRSEDTGKGEINDFGSPMTRSEEPVGINDLNQRLDIPMLKETVEILWQNGLCIRPIPKKVGGGRNKGKNRKNPMKREIKGLINSWEKPMENANSSSQGQNVDQ