; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025191 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025191
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr10:9625583..9627316
RNA-Seq ExpressionLag0025191
SyntenyLag0025191
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5458622.1 hypothetical protein F2P56_022639, partial [Juglans regia]4.7e-3633.72Show/hide
Query:  VIEKL-SALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN
        V+ KL   LKL+ +E   +  S+  +  ++AK +HC +  V++ K  N  A R TM RVW          +G N FL+ L+  + + RI++   W FD N
Subjt:  VIEKL-SALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN

Query:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER
        LI LQ     +  +EM F    FWVQ H LP    N      LG  +G VL ++T+D   C G FLR+RV + + +PL RG  L  G   K  +  KYER
Subjt:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER

Query:  LPDFCYYCGMIDHNDGSCPQI-KSQVKPE---KQFGSWMRVTTPPRRNQFNTSGRGFEGRR-GGRGRF-GNRQRNWQRHFDTEEEEREALQED-------
        LP+FCY+CG I H  G C Q+ K +   E   +Q+G+W+R  T P+    +    G + R+   +GR  G+ QR   R    E+E  E  +E        
Subjt:  LPDFCYYCGMIDHNDGSCPQI-KSQVKPE---KQFGSWMRVTTPPRRNQFNTSGRGFEGRR-GGRGRF-GNRQRNWQRHFDTEEEEREALQED-------

Query:  -SSASQSADDTDEHRQAEIN--RRPTKFSIDEAVPLQSKQMTEI
           A       D   +A+I+  +R T  S++  +    K++TEI
Subjt:  -SSASQSADDTDEHRQAEIN--RRPTKFSIDEAVPLQSKQMTEI

OMP03234.1 hypothetical protein COLO4_10561 [Corchorus olitorius]6.2e-3631.7Show/hide
Query:  EKLSAL----KLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDG
        E LSAL     L+ +E+  +  +   +  + A+ + CL+GK++S++ +NID  R  +  VW    G+ +  IGE L++   +S   ++R+ ++  W F+ 
Subjt:  EKLSAL----KLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDG

Query:  NLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYE
         L++L+        EE+  +   FW+Q H LPLG   + V   +G   G V+ I+T       G FLR+R  + +N+PL RG+ L      K+LV+ +YE
Subjt:  NLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYE

Query:  RLPDFCYYCGMIDHNDGSCPQI----KSQVKPEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGR
        +LPDFCY CG ++H +  C +     +   K +K++G W+R    PR N     G G    R G+
Subjt:  RLPDFCYYCGMIDHNDGSCPQI----KSQVKPEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.1e-3632.54Show/hide
Query:  MDVIEKLSALKLSHKESSGINTSDLG---LKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWC
        MD I  L     S  + +   T D G   L A+  KL  C+V K+ + K I+ +A R  M  VW        + +G N+++I  +S+S + R+L    W 
Subjt:  MDVIEKLSALKLSHKESSGINTSDLG---LKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWC

Query:  FDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAI
        F+ +L++L  PT ++ P +M F+   FW+Q H++P    +  +A+ LG +LG V  IE +  D   GPF+RVRVK+ +++PL RG+ L +     +   +
Subjt:  FDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAI

Query:  KYERLPDFCYYCGMIDHNDGSCPQIKSQV--KPEKQFGSWMRVT------TPPRRNQF---NTSGRGFE--GRRGGRGRFGNRQRNWQRHFDTEEEEREA
        +YE+LPDFCY CG I H+   C Q    V     +Q+G W+R T      + P    F      GRG +  G RGGRG +  R  NW+     E   R A
Subjt:  KYERLPDFCYYCGMIDHNDGSCPQIKSQV--KPEKQFGSWMRVT------TPPRRNQF---NTSGRGFE--GRRGGRGRFGNRQRNWQRHFDTEEEEREA

Query:  LQEDSSASQSADDTDEHRQAEINRRPTKFSIDEAV
        ++E         + +    AEI  R  K +    V
Subjt:  LQEDSSASQSADDTDEHRQAEINRRPTKFSIDEAV

XP_030942000.1 uncharacterized protein At4g02000-like [Quercus lobata]4.7e-3632.06Show/hide
Query:  DVIEKLSALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN
        D+  + + L L  KE+S   T DL L        H L+ K+ +++ +N++A  RT+  +W       + ++G N  LI   + +   +I+ +  W FD  
Subjt:  DVIEKLSALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN

Query:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER
        LI L  P   +  ++  FD  +FWVQ H+LPL   N + A  +G  LGT+ +++ +   ECRG +LRVRV++ I +PLCRG  +N G  +   +A +YE+
Subjt:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER

Query:  LPDFCYYCGMIDHNDGSC----PQIKSQVKPEKQFGSWMRV-------------TTPPRRNQ
        LP FCY+CG+++H++  C        +    ++Q+G+W+R              TT P+ NQ
Subjt:  LPDFCYYCGMIDHNDGSC----PQIKSQVKPEKQFGSWMRV-------------TTPPRRNQ

XP_040990949.1 uncharacterized protein LOC121238174 [Juglans microcarpa x Juglans regia]2.1e-3632.68Show/hide
Query:  DVIEKLSALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN
        D+ E  +   L+ +E   I  +D  ++ N  + + CLVG+V++ K IN +AF+ TM +VW  C G  I  +G+NL++   Q+    +RI     W FD N
Subjt:  DVIEKLSALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGN

Query:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER
        L+ L+       P E+ F+  T WVQ H+LPLG  N  +   +G  +G V+ +E +    C G ++RV+++M + +PL RG  + +    K  +  KYER
Subjt:  LILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYER

Query:  LPDFCYYCGMIDHNDGSCPQIKSQVKPEK-----QFGSWMRVTTPPRRNQFNTS
        LP FC+ CG I H    C  + +  + +K     Q+GSW+R ++    + + +S
Subjt:  LPDFCYYCGMIDHNDGSCPQIKSQVKPEK-----QFGSWMRVTTPPRRNQFNTS

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase9.6e-3530.82Show/hide
Query:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHL
        A+  +CL+GK++S++ +N++  R  M  VW    G+ +  IGENLF+   +S   ++R+ +++ W F+  L++L+     D  E++  D  +FW Q H L
Subjt:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHL

Query:  PLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSCPQI----KSQVK
        PLG  N+ +   +G   GTV  I+T       G FLR R ++ + +PL RG+ L      K+L++ +YE+LPDFCY CG + H +  C +     + + K
Subjt:  PLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSCPQI----KSQVK

Query:  PEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGRFGNRQRNWQRHFDTEEEER-------EALQEDSSASQSADDTD
          K++G W+R    PR       G G    R GR        N +R  +  +  +       +A +++  +S   DDTD
Subjt:  PEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGRFGNRQRNWQRHFDTEEEER-------EALQEDSSASQSADDTD

A0A1R3K847 Uncharacterized protein3.0e-3631.7Show/hide
Query:  EKLSAL----KLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDG
        E LSAL     L+ +E+  +  +   +  + A+ + CL+GK++S++ +NID  R  +  VW    G+ +  IGE L++   +S   ++R+ ++  W F+ 
Subjt:  EKLSAL----KLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDG

Query:  NLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYE
         L++L+        EE+  +   FW+Q H LPLG   + V   +G   G V+ I+T       G FLR+R  + +N+PL RG+ L      K+LV+ +YE
Subjt:  NLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYE

Query:  RLPDFCYYCGMIDHNDGSCPQI----KSQVKPEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGR
        +LPDFCY CG ++H +  C +     +   K +K++G W+R    PR N     G G    R G+
Subjt:  RLPDFCYYCGMIDHNDGSCPQI----KSQVKPEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGR

A0A5C7GPP6 CCHC-type domain-containing protein9.6e-3535.86Show/hide
Query:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHL
        A + HCL+GKV+S K +N +AF   + ++WS    + I+ I +N+F+        RD +     W FD +LI+L+ P  +    E+ FD    WVQ H+L
Subjt:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHL

Query:  PLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVA-IKYERLPDFCYYCGMIDHNDGSCPQIKSQVKP--
        PL   N   A +L   +G V+ I   D  ECRG FLRV+V++ +N+PL R + L    T++++VA + YERLP FCY CG + H    CP  ++++    
Subjt:  PLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVA-IKYERLPDFCYYCGMIDHNDGSCPQIKSQVKP--

Query:  --EKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGR
            QFG+W+RV  P    Q      G +    G G+
Subjt:  --EKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGR

A0A6J1D765 uncharacterized protein LOC1110179021.0e-3632.54Show/hide
Query:  MDVIEKLSALKLSHKESSGINTSDLG---LKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWC
        MD I  L     S  + +   T D G   L A+  KL  C+V K+ + K I+ +A R  M  VW        + +G N+++I  +S+S + R+L    W 
Subjt:  MDVIEKLSALKLSHKESSGINTSDLG---LKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWC

Query:  FDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAI
        F+ +L++L  PT ++ P +M F+   FW+Q H++P    +  +A+ LG +LG V  IE +  D   GPF+RVRVK+ +++PL RG+ L +     +   +
Subjt:  FDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAI

Query:  KYERLPDFCYYCGMIDHNDGSCPQIKSQV--KPEKQFGSWMRVT------TPPRRNQF---NTSGRGFE--GRRGGRGRFGNRQRNWQRHFDTEEEEREA
        +YE+LPDFCY CG I H+   C Q    V     +Q+G W+R T      + P    F      GRG +  G RGGRG +  R  NW+     E   R A
Subjt:  KYERLPDFCYYCGMIDHNDGSCPQIKSQV--KPEKQFGSWMRVT------TPPRRNQF---NTSGRGFE--GRRGGRGRFGNRQRNWQRHFDTEEEEREA

Query:  LQEDSSASQSADDTDEHRQAEINRRPTKFSIDEAV
        ++E         + +    AEI  R  K +    V
Subjt:  LQEDSSASQSADDTDEHRQAEINRRPTKFSIDEAV

A0A6J5TXP7 CCHC-type domain-containing protein6.7e-3632.57Show/hide
Query:  LKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPT
        L L++KE  G++           +L  CLVG V++ +  N +AF++TM R W     + + ++ +NLFL    ++  ++ ++    W FD  L+LL+ P 
Subjt:  LKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPT

Query:  TSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYC
         +  P  MV  +  FWVQ H+LPL          +G RLG  + +      EC G +L +RV++ + +PL R + L    + K ++  KYERLPDFCY C
Subjt:  TSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYC

Query:  GMIDHNDGSCPQIKSQVK--PEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGRFGNRQ
        G I H    C  + +  K   EK +GSW+       R +    G+  EGR   R  FGN +
Subjt:  GMIDHNDGSCPQIKSQVK--PEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGRFGNRQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G31430.1 unknown protein4.6e-1328.8Show/hide
Query:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENL---FLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQF
        A+ +  L G+ V  +  N+ +   +M R+W    G+    I E     F+  L+     + +L    W F+  +ILLQ       P+  +F    FWVQ 
Subjt:  AKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENL---FLINLQSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQF

Query:  HHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSC
          +P    N  V   +G  LG VL  + N     R  F RV +   I  PL              L+  +YERL  FC  CGM+ H+ G+C
Subjt:  HHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSC

AT3G42140.1 zinc ion binding;nucleic acid binding1.6e-0525.69Show/hide
Query:  QSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCR
        QS      IL    W F+  + ++Q  T   L  +  F    FW+Q   +PL      +  ++G R+G  L +ETN                     L R
Subjt:  QSVSRRDRILEESSWCFDGNLILLQPPTTSDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCR

Query:  GLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSCPQIKSQ
                 D  ++  +YE+L +FC  CGM+ H+   CP   +Q
Subjt:  GLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSCPQIKSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTATAGAAAAACTCTCAGCGCTCAAGCTTTCACACAAAGAATCATCTGGAATCAACACTTCGGACCTTGGCCTGAAAGCAAATCAGGCTAAGCTCCAACATTG
CTTAGTAGGAAAGGTTGTTTCCCAAAAAGTTATCAACATTGACGCGTTTAGGCGAACTATGACACGAGTTTGGAGTTTTTGTTGCGGGATAACCATTGATAATATTGGAG
AAAACCTTTTTCTAATTAATCTGCAATCCGTTTCAAGAAGGGATAGAATTTTGGAAGAGAGCTCGTGGTGCTTCGATGGCAATCTGATACTTCTCCAACCACCAACAACT
TCTGATCTACCAGAAGAAATGGTCTTTGATGACCCAACTTTCTGGGTTCAATTCCATCACCTACCTCTAGGCCTAAGAAACGATATTGTGGCACACACCCTAGGGATCCG
ACTAGGGACAGTACTGCGAATTGAGACCAACGATATGGATGAATGCCGGGGTCCTTTTCTTCGAGTTCGGGTCAAGATGAAGATAAATGAACCACTATGTCGAGGCCTAA
CTCTAAATGACGGACCAACAGACAAGCTTTTGGTAGCAATCAAATATGAGAGATTACCAGATTTTTGCTACTACTGTGGGATGATCGACCATAATGATGGATCTTGCCCC
CAAATAAAATCACAAGTTAAACCAGAGAAGCAGTTCGGGTCTTGGATGAGAGTCACAACTCCGCCGAGACGAAACCAATTTAACACCAGTGGCAGAGGCTTCGAGGGACG
AAGAGGGGGCCGAGGTCGGTTTGGTAACAGACAGAGGAATTGGCAAAGGCATTTTGATACGGAGGAAGAGGAAAGGGAAGCGTTGCAAGAAGACTCATCAGCCTCTCAAA
GCGCCGACGACACCGATGAACATAGACAGGCTGAAATCAATCGTCGGCCGACCAAATTTTCCATTGATGAAGCGGTTCCACTGCAATCAAAGCAAATGACGGAAATCACG
GGTATTAATGACCCTCTTTCCTTTGATAACACAAATAATGTGGATATTTTACCTATTAATGCAGATAGAGCGAATCAAGGAATGGATGTTGGAGACATTAATGGAGATAA
CGACCAAAAGAAAGGAAAAACCAGCGAGCTAAATATTGAGAAGGGAAGAAACGCCAATGATATGCTAGGGCAAGGAGTGGATGATGGAACAATTTCTTACGATGGGCCTA
AATTAGAGAAAGTAATCTCTACAGCCCATTTACCATTTAACCCACTCCAGCCCAATGTCACCATGCCAACCAACGAAAGAATCAATATGATGAAGCCCAACCGAGATGTT
ATTTTTCAGAAACAAAGCCAAGCCCATTTGCATGATGCAGATTTGATGGATGAGGATTATTTTAAATTCACGCCACTCAAGCCCACACATCAAACCAGCCCAGAACAGCT
ACAACAGACATCTAAATTAGCCCAAAAAGTAGCTCAAACAAATGATGGAGAAGATTTTCAACCCACGGTGCCTAGGGACGGAAGAAACAAAAATCATGCAAAAGGGTGGA
AGAGATTAAACAGAAAAGACTGTGATCAAAAAAATCTGCCTACAAAACAAACCATGCAGTTTGCTCAAAATAAGAGGGTTGCAGACGATGACAGCTTGGAAATTCTCTCT
AGCAAGAAACTAAAAACTATAATTCCAACAACCGGTGAAATTCTATCGGCGGAGCCTGATCAGCAGGCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTTATAGAAAAACTCTCAGCGCTCAAGCTTTCACACAAAGAATCATCTGGAATCAACACTTCGGACCTTGGCCTGAAAGCAAATCAGGCTAAGCTCCAACATTG
CTTAGTAGGAAAGGTTGTTTCCCAAAAAGTTATCAACATTGACGCGTTTAGGCGAACTATGACACGAGTTTGGAGTTTTTGTTGCGGGATAACCATTGATAATATTGGAG
AAAACCTTTTTCTAATTAATCTGCAATCCGTTTCAAGAAGGGATAGAATTTTGGAAGAGAGCTCGTGGTGCTTCGATGGCAATCTGATACTTCTCCAACCACCAACAACT
TCTGATCTACCAGAAGAAATGGTCTTTGATGACCCAACTTTCTGGGTTCAATTCCATCACCTACCTCTAGGCCTAAGAAACGATATTGTGGCACACACCCTAGGGATCCG
ACTAGGGACAGTACTGCGAATTGAGACCAACGATATGGATGAATGCCGGGGTCCTTTTCTTCGAGTTCGGGTCAAGATGAAGATAAATGAACCACTATGTCGAGGCCTAA
CTCTAAATGACGGACCAACAGACAAGCTTTTGGTAGCAATCAAATATGAGAGATTACCAGATTTTTGCTACTACTGTGGGATGATCGACCATAATGATGGATCTTGCCCC
CAAATAAAATCACAAGTTAAACCAGAGAAGCAGTTCGGGTCTTGGATGAGAGTCACAACTCCGCCGAGACGAAACCAATTTAACACCAGTGGCAGAGGCTTCGAGGGACG
AAGAGGGGGCCGAGGTCGGTTTGGTAACAGACAGAGGAATTGGCAAAGGCATTTTGATACGGAGGAAGAGGAAAGGGAAGCGTTGCAAGAAGACTCATCAGCCTCTCAAA
GCGCCGACGACACCGATGAACATAGACAGGCTGAAATCAATCGTCGGCCGACCAAATTTTCCATTGATGAAGCGGTTCCACTGCAATCAAAGCAAATGACGGAAATCACG
GGTATTAATGACCCTCTTTCCTTTGATAACACAAATAATGTGGATATTTTACCTATTAATGCAGATAGAGCGAATCAAGGAATGGATGTTGGAGACATTAATGGAGATAA
CGACCAAAAGAAAGGAAAAACCAGCGAGCTAAATATTGAGAAGGGAAGAAACGCCAATGATATGCTAGGGCAAGGAGTGGATGATGGAACAATTTCTTACGATGGGCCTA
AATTAGAGAAAGTAATCTCTACAGCCCATTTACCATTTAACCCACTCCAGCCCAATGTCACCATGCCAACCAACGAAAGAATCAATATGATGAAGCCCAACCGAGATGTT
ATTTTTCAGAAACAAAGCCAAGCCCATTTGCATGATGCAGATTTGATGGATGAGGATTATTTTAAATTCACGCCACTCAAGCCCACACATCAAACCAGCCCAGAACAGCT
ACAACAGACATCTAAATTAGCCCAAAAAGTAGCTCAAACAAATGATGGAGAAGATTTTCAACCCACGGTGCCTAGGGACGGAAGAAACAAAAATCATGCAAAAGGGTGGA
AGAGATTAAACAGAAAAGACTGTGATCAAAAAAATCTGCCTACAAAACAAACCATGCAGTTTGCTCAAAATAAGAGGGTTGCAGACGATGACAGCTTGGAAATTCTCTCT
AGCAAGAAACTAAAAACTATAATTCCAACAACCGGTGAAATTCTATCGGCGGAGCCTGATCAGCAGGCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MDVIEKLSALKLSHKESSGINTSDLGLKANQAKLQHCLVGKVVSQKVINIDAFRRTMTRVWSFCCGITIDNIGENLFLINLQSVSRRDRILEESSWCFDGNLILLQPPTT
SDLPEEMVFDDPTFWVQFHHLPLGLRNDIVAHTLGIRLGTVLRIETNDMDECRGPFLRVRVKMKINEPLCRGLTLNDGPTDKLLVAIKYERLPDFCYYCGMIDHNDGSCP
QIKSQVKPEKQFGSWMRVTTPPRRNQFNTSGRGFEGRRGGRGRFGNRQRNWQRHFDTEEEEREALQEDSSASQSADDTDEHRQAEINRRPTKFSIDEAVPLQSKQMTEIT
GINDPLSFDNTNNVDILPINADRANQGMDVGDINGDNDQKKGKTSELNIEKGRNANDMLGQGVDDGTISYDGPKLEKVISTAHLPFNPLQPNVTMPTNERINMMKPNRDV
IFQKQSQAHLHDADLMDEDYFKFTPLKPTHQTSPEQLQQTSKLAQKVAQTNDGEDFQPTVPRDGRNKNHAKGWKRLNRKDCDQKNLPTKQTMQFAQNKRVADDDSLEILS
SKKLKTIIPTTGEILSAEPDQQARREP