; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005419 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005419
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1985 domain-containing protein
Genome locationchr6:17337850..17342108
RNA-Seq ExpressionLag0005419
SyntenyLag0005419
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]2.4e-3231.07Show/hide
Query:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE
        NT ST E   N+ D  +E  M  T++      IEE  I  E R+      TR++++       K +E   F+V E + +   +  ++  + +    Q+  
Subjt:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------
             K DG  K  K  K G+            ++   Y+  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                   
Subjt:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------

Query:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN
                                       K   L TGLNCG LP +DM+K+  G+F  +YFG E+TIKR+ +  +F   D+ + KD VK++KLY L  
Subjt:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN

Query:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        F+LGKQ+ TG+  ++ +L+DD EQFD+YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]2.4e-3231.07Show/hide
Query:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE
        NT ST E   N+ D  +E  M  T++      IEE  I  E R+      TR++++       K +E   F+V E + +   +  ++  + +    Q+  
Subjt:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------
             K DG  K  K  K G+            ++   Y+  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                   
Subjt:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------

Query:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN
                                       K   L TGLNCG LP +DM+K+  G+F  +YFG E+TIKR+ +  +F   D+ + KD VK++KLY L  
Subjt:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN

Query:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        F+LGKQ+ TG+  ++ +L+DD EQFD+YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

XP_038883718.1 uncharacterized protein LOC120074618 isoform X4 [Benincasa hispida]1.9e-3231.52Show/hide
Query:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE
        NT ST E   N+ D  +E  M  T++      IEE  I  E R+      TR++++       K +E   F+V E + +   +  ++  + +    Q+  
Subjt:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFG-RGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF------------------------
             K DG  K  K  K G +         H  +  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                        
Subjt:  SKAPKKNDGQRKIVKSEKFG-RGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF------------------------

Query:  --------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGK
                                  K   L TGLNCG LP +DM+K+  G+F  +YFG E+TIKR+ +  +F   D+ + KD VK++KLY L  F+LGK
Subjt:  --------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGK

Query:  QLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        Q+ TG+  ++ +L+DD EQFD+YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  QLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]2.4e-3231.32Show/hide
Query:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAES
        NT ST E   N+ D  +E  M  T++      IEE  I  E R+      TR++++       K +E   F+V E + +   +  +      V  +  ++
Subjt:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAES

Query:  KAPKKNDGQRKIVKSEKFG-RGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------------
         + +  DG  K  K  K G +         H  +  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                         
Subjt:  KAPKKNDGQRKIVKSEKFG-RGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------------

Query:  -------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQ
                                 K   L TGLNCG LP +DM+K+  G+F  +YFG E+TIKR+ +  +F   D+ + KD VK++KLY L  F+LGKQ
Subjt:  -------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQ

Query:  LGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        + TG+  ++ +L+DD EQFD+YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  LGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

XP_038883720.1 uncharacterized protein LOC120074618 isoform X6 [Benincasa hispida]2.4e-3231.07Show/hide
Query:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE
        NT ST E   N+ D  +E  M  T++      IEE  I  E R+      TR++++       K +E   F+V E + +   +  ++  + +    Q+  
Subjt:  NTISTEETKQNTIDVVDE--MTLTQIVHTENTIEE--IHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSM-IEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------
             K DG  K  K  K G+            ++   Y+  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                   
Subjt:  SKAPKKNDGQRKIVKSEKFGRGEFF------RIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------

Query:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN
                                       K   L TGLNCG LP +DM+K+  G+F  +YFG E+TIKR+ +  +F   D+ + KD VK++KLY L  
Subjt:  -------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTN

Query:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        F+LGKQ+ TG+  ++ +L+DD EQFD+YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  FLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

TrEMBL top hitse value%identityAlignment
A0A0A0KI50 TF-B3 domain-containing protein5.5e-3029.18Show/hide
Query:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQERDLEDESNKGK-EVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAESKAP
        N + T E   N  D + E  +    H  N  + I     N ++TT+ ++    +E    K K +   F+V E+  +  S I+  Q +   +      K  
Subjt:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQERDLEDESNKGK-EVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAESKAP

Query:  KKNDGQRKIVKSEKFG-RGEFFRIERAHS--------YEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF--------------------
        +++  Q +  K +K G RG+   I    S        ++  PLLLPR  W+  Q++ +Y K +V+  IK+ L ++  + F                    
Subjt:  KKNDGQRKIVKSEKFG-RGEFFRIERAHS--------YEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF--------------------

Query:  ------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNF
                                      K   L TGLNCG LP++DM+K+  G+F  +YFG E+TI+R+ +  +F   D+ + KD VK++KLY L  F
Subjt:  ------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNF

Query:  LLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        +LGKQ+ TG+  ++ +L+DD +QFD YPWGRI+Y  T+D +K++IK+ +A+ +
Subjt:  LLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

A0A1S3B1B6 uncharacterized protein LOC103484737 isoform X21.4e-3028.49Show/hide
Query:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQERDLED-----ESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAE
        N + T E   N+ D ++E  +       N  + I     N ++TT+      D+E+     E      V      +K     S ++E  +  + I    +
Subjt:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQERDLED-----ESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFGRGEFFR-----------------IERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF--------
        S+   +   ++KI +  K G G+  +                  +    Y+  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F        
Subjt:  SKAPKKNDGQRKIVKSEKFGRGEFFR-----------------IERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF--------

Query:  ------------------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQ
                                                  K   L TGLNCG LP++DM+K+  G+F  +YFG E+TI+R+ +  +F   D+ + KD 
Subjt:  ------------------------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQ

Query:  VKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        VK++KLY L  F+LGKQL TG+  ++ +L+DD EQFD YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  VKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

A0A5A7UZA2 DUF1985 domain-containing protein7.1e-3056.64Show/hide
Query:  TGLNCGNLPSVDMTKLSGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTT
        TGLNC  LP VD +K+ G+FL+KYF  E  I RS +S LF+   ++K++D++K++K+YFL NFLLGKQ  TG + +HI LLDD++ FD YPWGRI Y   
Subjt:  TGLNCGNLPSVDMTKLSGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTT

Query:  IDSIKRAIKNPEA
        IDSIK++IKNP A
Subjt:  IDSIKRAIKNPEA

A0A5D3CNI7 TF-B3 domain-containing protein1.2e-2928.16Show/hide
Query:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQ-----ERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAE
        N + T E   N+ D ++E  + Q   +  +++    + +    +++ +EQ     E D + +S +G E        + ++      + +K      +S  
Subjt:  NTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQ-----ERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAE

Query:  SKAPKKNDGQRKIVKSEKFGRGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------------
        S +P ++D +  +                   Y+  PLLLPR  WS  Q++ +Y K +V+  IK+ L ++  + F                         
Subjt:  SKAPKKNDGQRKIVKSEKFGRGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVF-------------------------

Query:  -------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQ
                                 K   L TGLNCG LP++DM+K+  G+F  +YFG E+TI+R+ +  +F   D+ + KD VK++KLY L  F+LGKQ
Subjt:  -------------------------KIVVLYTGLNCGNLPSVDMTKL-SGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQ

Query:  LGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV
        L TG+  ++ +L+DD EQFD YPWGRI+Y  TID +K+AIK+ +A+ +
Subjt:  LGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVV

A0A6J1BX50 uncharacterized protein LOC1110055249.3e-3030.06Show/hide
Query:  ENTIEEIHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAESKAPKKNDGQRKIVKSEKFGRGEFFRIERAHS
        EN +E+   +S     TT  R+++   E ++   +E    D  +   + +    + +K    +++  E +  K+    R+   SE          E+A  
Subjt:  ENTIEEIHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSMIEEAQKTDAVIAQSAESKAPKKNDGQRKIVKSEKFGRGEFFRIERAHS

Query:  YEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEV-----------FKIV---------------------------------------VLYT
        Y+  PLL+P+ +WS  Q++ +Y K +V+  IK+ L ++  ++           FKI                                         L T
Subjt:  YEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEV-----------FKIV---------------------------------------VLYT

Query:  GLNCGNLPSVDMTKLSGRFLNK-YFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTT
        G+NCG LP++DM+K+   + +K YFG ERTIKR+ +  +F   D+ + KD VK++KLY L  FLLGKQ+ TG+  ++ +L+DDDEQF+ YPWGR++Y  T
Subjt:  GLNCGNLPSVDMTKLSGRFLNK-YFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTT

Query:  IDSIKRAIKNPEATVV
        ID +K+AIK+ +A+ +
Subjt:  IDSIKRAIKNPEATVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)2.1e-0526.55Show/hide
Query:  VKSEKFGRGEFFRIER-AHSYEGAPLLLPRDKWSNV--QKLTVYGKYEVLRQIKDGLKKKHYEVFKIVVLYTGLNCGNLPSVDMTK--LSGRFL---NKY
        +KS +FG+   F + R +HS +    LL R   +    +   V+G + +   I++         F IV   TGL CG LP+ D  K     ++L   N+ 
Subjt:  VKSEKFGRGEFFRIER-AHSYEGAPLLLPRDKWSNV--QKLTVYGKYEVLRQIKDGLKKKHYEVFKIVVLYTGLNCGNLPSVDMTK--LSGRFL---NKY

Query:  FGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTI
        FG++R +   D+  +   K ++    ++ L+ +  +   ++     + V +D + +L+D + F +YPWGR A+  TI
Subjt:  FGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTGATCTGAGGGCCGACAAGCTTGATCTCAGACGGAGGAAGGTTGAATCTTCAACTTGTATCTTGATGAAGAAACGTCGTTCGTCTCCTCCCTCGCGATTTTC
TTCTTCACGCCGGTCTCCCTCTCCATTTGCCGGCCATCGTCGCTCACTCCCTCTGTGTTTCGCCGTCCATCCTCCGACCAGCGCCACTCTGTTCCCGCCGTCACTCCCTC
TGTGTTTCGCCGTCTATCCTTCGACTGCCGTCACTTCAGCTGGGTTTCGTCGTCACTTACTCTCTTCCCGTTGTCACATCCTTCAAAAGCCGTCTCATCCCTCAAACGCT
ACCGACAGTTCTAAGGAAACAACAAGTTATCTATCAATGGATAATTTTGAACAAGCCGAAACAGTAGAAGTGGTATTAACTCAGTTAGTAAATACCATTTCAACTGAGGA
AACTAAGCAGAATACAATAGATGTAGTTGATGAAATGACGTTAACTCAAATTGTACACACTGAAAACACAATTGAGGAAATTCATGCGGAATCTCGAAATGCTATTGAGA
CGACAGATACAAGAGAACAAGAACGTGATCTGGAGGATGAATCTAACAAAGGAAAAGAAGTAATGCATTTTGATGTAGCTGAGAAAGATGCACAGGTATTTTCTATGATT
GAGGAAGCACAAAAGACAGATGCAGTCATTGCCCAGTCAGCTGAATCAAAGGCACCTAAAAAAAATGATGGTCAAAGGAAGATTGTAAAGTCTGAAAAGTTTGGGAGAGG
GGAATTCTTCAGGATCGAGAGGGCACATTCTTATGAGGGTGCACCTTTATTGTTGCCTAGAGATAAATGGTCGAATGTTCAAAAGTTAACTGTATATGGGAAGTATGAGG
TCCTTAGACAAATTAAAGATGGACTGAAGAAAAAACATTATGAGGTGTTCAAAATAGTTGTTTTGTATACTGGACTGAATTGTGGAAATCTACCATCTGTAGATATGACA
AAATTAAGTGGTAGATTTTTAAACAAGTACTTTGGTAAAGAGCGCACAATCAAACGATCAGACATTAGTAACCTCTTCCACAATAAGGATAGAGTGAAAAAGAAAGACCA
AGTCAAATTGTCCAAACTTTACTTCTTGACGAACTTCCTTCTTGGAAAACAGTTGGGAACAGGGGTAGAAATAGACCATATTATGTTGTTGGATGATGATGAGCAGTTTG
ATAAATACCCGTGGGGTCGTATAGCCTATACCACAACAATAGATTCTATCAAGAGAGCGATTAAGAATCCTGAAGCCACCGTTGTAGTTTGGCTGGCTTTCCATATGCAT
TACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTGATCTGAGGGCCGACAAGCTTGATCTCAGACGGAGGAAGGTTGAATCTTCAACTTGTATCTTGATGAAGAAACGTCGTTCGTCTCCTCCCTCGCGATTTTC
TTCTTCACGCCGGTCTCCCTCTCCATTTGCCGGCCATCGTCGCTCACTCCCTCTGTGTTTCGCCGTCCATCCTCCGACCAGCGCCACTCTGTTCCCGCCGTCACTCCCTC
TGTGTTTCGCCGTCTATCCTTCGACTGCCGTCACTTCAGCTGGGTTTCGTCGTCACTTACTCTCTTCCCGTTGTCACATCCTTCAAAAGCCGTCTCATCCCTCAAACGCT
ACCGACAGTTCTAAGGAAACAACAAGTTATCTATCAATGGATAATTTTGAACAAGCCGAAACAGTAGAAGTGGTATTAACTCAGTTAGTAAATACCATTTCAACTGAGGA
AACTAAGCAGAATACAATAGATGTAGTTGATGAAATGACGTTAACTCAAATTGTACACACTGAAAACACAATTGAGGAAATTCATGCGGAATCTCGAAATGCTATTGAGA
CGACAGATACAAGAGAACAAGAACGTGATCTGGAGGATGAATCTAACAAAGGAAAAGAAGTAATGCATTTTGATGTAGCTGAGAAAGATGCACAGGTATTTTCTATGATT
GAGGAAGCACAAAAGACAGATGCAGTCATTGCCCAGTCAGCTGAATCAAAGGCACCTAAAAAAAATGATGGTCAAAGGAAGATTGTAAAGTCTGAAAAGTTTGGGAGAGG
GGAATTCTTCAGGATCGAGAGGGCACATTCTTATGAGGGTGCACCTTTATTGTTGCCTAGAGATAAATGGTCGAATGTTCAAAAGTTAACTGTATATGGGAAGTATGAGG
TCCTTAGACAAATTAAAGATGGACTGAAGAAAAAACATTATGAGGTGTTCAAAATAGTTGTTTTGTATACTGGACTGAATTGTGGAAATCTACCATCTGTAGATATGACA
AAATTAAGTGGTAGATTTTTAAACAAGTACTTTGGTAAAGAGCGCACAATCAAACGATCAGACATTAGTAACCTCTTCCACAATAAGGATAGAGTGAAAAAGAAAGACCA
AGTCAAATTGTCCAAACTTTACTTCTTGACGAACTTCCTTCTTGGAAAACAGTTGGGAACAGGGGTAGAAATAGACCATATTATGTTGTTGGATGATGATGAGCAGTTTG
ATAAATACCCGTGGGGTCGTATAGCCTATACCACAACAATAGATTCTATCAAGAGAGCGATTAAGAATCCTGAAGCCACCGTTGTAGTTTGGCTGGCTTTCCATATGCAT
TACTAG
Protein sequenceShow/hide protein sequence
MESDLRADKLDLRRRKVESSTCILMKKRRSSPPSRFSSSRRSPSPFAGHRRSLPLCFAVHPPTSATLFPPSLPLCFAVYPSTAVTSAGFRRHLLSSRCHILQKPSHPSNA
TDSSKETTSYLSMDNFEQAETVEVVLTQLVNTISTEETKQNTIDVVDEMTLTQIVHTENTIEEIHAESRNAIETTDTREQERDLEDESNKGKEVMHFDVAEKDAQVFSMI
EEAQKTDAVIAQSAESKAPKKNDGQRKIVKSEKFGRGEFFRIERAHSYEGAPLLLPRDKWSNVQKLTVYGKYEVLRQIKDGLKKKHYEVFKIVVLYTGLNCGNLPSVDMT
KLSGRFLNKYFGKERTIKRSDISNLFHNKDRVKKKDQVKLSKLYFLTNFLLGKQLGTGVEIDHIMLLDDDEQFDKYPWGRIAYTTTIDSIKRAIKNPEATVVVWLAFHMH
Y