; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032163 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032163
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:26393268..26394747
RNA-Seq ExpressionLag0032163
SyntenyLag0032163
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.9e-6247.76Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEA-GLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK
        LLE WK   LT  E++IA D+D++A+      L   L+ KLL+ R I+  V++ T+ IAW+++    S++ +G N+FLF+F+R+ DR R++  GPW FD+
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEA-GLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK

Query:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY
         L++++   ++ +P  M F  V+ WVHFFDL L CMN  MA RLGNA+G FE+V+ N+   CWGS L VR+R D+ KPL RGIKLN+ GP+GG WIP+QY
Subjt:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY

Query:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEG
        ERLP F  HCG  DH   DC        D   +  QYG WLRF+G
Subjt:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]8.9e-6748.35Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDKF
        LL  W+K  LT  E+EIA DVD  AV      L   LVGKLLA R I+  V+ R + +AW+VE  L++E +GKN+FLF F R  D  RV+++GPW+FDK 
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDKF

Query:  LLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQYE
        L+VL+   +    S + FN+VAFW+H FDLP+  +N  MA RLGNA+G F +VDCN  G  WG+SL +R+ +DI KPLRRGIK+NI GP+GG WIP+QYE
Subjt:  LLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQYE

Query:  RLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVE
        RLP FC  CG+  H S DC        D      +YG WLRF GS     K   G+     D   S  ++  E
Subjt:  RLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVE

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]9.2e-6443.41Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVE-AGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK
        LLE WK   LT  EEE A DVDA+A     S+L   LVGKL   RPI   VM+ TM  AW++E     ++ +G N+FLFSF RA DR ++ +SGPW FD+
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVE-AGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK

Query:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY
         L+++    A++ PS + F K+  WV FFDLPLGC+   MA RLGNA+G FE  DC+ +   WGS+L VR+ +DI KPLRRGIKLN+ GPIGG WIP+QY
Subjt:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY

Query:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGDVSSSSTIGTDAVRVV
        ERLP FC HCGL    SS              ++ QYGSWLR++G++K           P + Q   P   +++       K G+ S SS+         
Subjt:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGDVSSSSTIGTDAVRVV

Query:  GENWSPVGGLEVDTSDVVVTQSGKTRGAGSSSRG
        G   +P  G      +  VT++ K +GA  S +G
Subjt:  GENWSPVGGLEVDTSDVVVTQSGKTRGAGSSSRG

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]4.3e-4532.39Show/hide
Query:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY
        MA +LL+  + LSLT +EE+    +   +   +  +   CLVGKLL  RP   + M+ T+   W+   G+ +  +G N+F+F F    D+ RV+  GPW 
Subjt:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY

Query:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITG--PIGGRW
        FDK LL+L  M   VQPS +   +V FWVH  +LPL  MN  + + +GNAVG+F ++D    G+ WG ++ +R+ +D++KPLRRG+KL ++   PI   W
Subjt:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITG--PIGGRW

Query:  IPMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGD----------
        +  +YERLP +C  CG   H   +C   LS      V   QYG+WLR + +IKS G    G     V  G+ PG   + + ++  +   +          
Subjt:  IPMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGD----------

Query:  -VSSSSTIGTDAVRVVGENWSPV-GGLEVDTSDVVVTQSG--------------------KTRGAGSSSR--GLGLTSSVEVQAELGACHGLATMDG---
         V   S    D   + G+  +PV  G +  T+    T SG                    +  G G  S+   LGL  +       G    L +MD    
Subjt:  -VSSSSTIGTDAVRVVGENWSPV-GGLEVDTSDVVVTQSG--------------------KTRGAGSSSR--GLGLTSSVEVQAELGACHGLATMDG---

Query:  -PSESGLLKGIASSGKKWKRKAR
           E   L G     KKWKR AR
Subjt:  -PSESGLLKGIASSGKKWKRKAR

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]6.6e-4631.58Show/hide
Query:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY
        MA +L++  + LSLT +EE+    +   +   +  +   CLVGKLL  RP   + M+ T+   W+   G+ +  +G N+F+F F    D+ RV+ +GPW 
Subjt:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY

Query:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITG--PIGGRW
        FDK LL+L  M   VQPS +    V FWVH  +LPL  MN  + + +GNAVG+F ++D    G+ WG ++ +R+ +D++KPLRRG+KL ++   PI   W
Subjt:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITG--PIGGRW

Query:  IPMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVV----------------------
        +  +YERLP +C  CG   H   +C   LS      V   QYG+WLR + +IKS G    G     V  G+ PG   +                      
Subjt:  IPMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVV----------------------

Query:  --EVVSDPPRKQGDVSSSSTI--------GTDA--VRVVGENWSP--VGGLEVDTSDVVVTQSGKTRGAGSSSRGLGLTSSVEVQAELGACHGLATMDG-
           V +D  R + ++S  S          GT      + G N SP    G E      V+  +G    + +S  GL       + + +GA   L +MD  
Subjt:  --EVVSDPPRKQGDVSSSSTI--------GTDA--VRVVGENWSP--VGGLEVDTSDVVVTQSGKTRGAGSSSRGLGLTSSVEVQAELGACHGLATMDG-

Query:  ---PSESGLLKGIASSGKKWKRKAR--FGSVSQSVGVDDLKRKVVDEVSDQAG---------KKSKVVASVIACDDVVPSVVFPAEADVQPRRA
             E   L       KKWKR AR    S+     V   KR V++EV  Q G         K  K +      D    + +   EAD+QPRR+
Subjt:  ---PSESGLLKGIASSGKKWKRKAR--FGSVSQSVGVDDLKRKVVDEVSDQAG---------KKSKVVASVIACDDVVPSVVFPAEADVQPRRA

TrEMBL top hitse value%identityAlignment
A0A2N9FJK9 CCHC-type domain-containing protein3.8e-3937.8Show/hide
Query:  MAVALLEGWKKLSLTVAEEEIAADVDA-AAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPW
        MA  L+E W++ SLT  E+E    + A  A+ D     ++CL+GKLL  +      ++ TM   W   +G+  + MG+N+FLF F    D  RV+   PW
Subjt:  MAVALLEGWKKLSLTVAEEEIAADVDA-AAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPW

Query:  YFDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWI
         FD  LLVL +       + + FN   FWV F  +PL  M     +RLG A+G  E VD ++ G+ WG  L VRI +D+ KPL+RG +L   G  G +WI
Subjt:  YFDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWI

Query:  PMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLR
          +YERLP  C HCG   HG  +C + +     +  +   YG WLR
Subjt:  PMQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLR

A0A2N9GF83 CCHC-type domain-containing protein3.8e-3937.96Show/hide
Query:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY
        M   L+E W++ SLT  +E     +D  A+ +  +  ++CL+GKLL  +      ++ TM   W V  G+  + MG N+FLF F    D  RV +  PW 
Subjt:  MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWY

Query:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIP
        FD  LLVL         + + FN   FWV    +PL  M     +R+G A+G  E VD +  G+ WG  L VRI VDI KP++RG +L   G  G  WI 
Subjt:  FDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIP

Query:  MQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLR
         +YERLP FC HCG   HG  +C + L           QYG WLR
Subjt:  MQYERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054811.4e-6247.76Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEA-GLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK
        LLE WK   LT  E++IA D+D++A+      L   L+ KLL+ R I+  V++ T+ IAW+++    S++ +G N+FLF+F+R+ DR R++  GPW FD+
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEA-GLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK

Query:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY
         L++++   ++ +P  M F  V+ WVHFFDL L CMN  MA RLGNA+G FE+V+ N+   CWGS L VR+R D+ KPL RGIKLN+ GP+GG WIP+QY
Subjt:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY

Query:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEG
        ERLP F  HCG  DH   DC        D   +  QYG WLRF+G
Subjt:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEG

A0A6J1DU55 uncharacterized protein LOC1110231354.3e-6748.35Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDKF
        LL  W+K  LT  E+EIA DVD  AV      L   LVGKLLA R I+  V+ R + +AW+VE  L++E +GKN+FLF F R  D  RV+++GPW+FDK 
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDKF

Query:  LLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQYE
        L+VL+   +    S + FN+VAFW+H FDLP+  +N  MA RLGNA+G F +VDCN  G  WG+SL +R+ +DI KPLRRGIK+NI GP+GG WIP+QYE
Subjt:  LLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQYE

Query:  RLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVE
        RLP FC  CG+  H S DC        D      +YG WLRF GS     K   G+     D   S  ++  E
Subjt:  RLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVE

A0A6J1DX30 uncharacterized protein LOC1110248744.4e-6443.41Show/hide
Query:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVE-AGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK
        LLE WK   LT  EEE A DVDA+A     S+L   LVGKL   RPI   VM+ TM  AW++E     ++ +G N+FLFSF RA DR ++ +SGPW FD+
Subjt:  LLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVE-AGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDK

Query:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY
         L+++    A++ PS + F K+  WV FFDLPLGC+   MA RLGNA+G FE  DC+ +   WGS+L VR+ +DI KPLRRGIKLN+ GPIGG WIP+QY
Subjt:  FLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQY

Query:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGDVSSSSTIGTDAVRVV
        ERLP FC HCGL    SS              ++ QYGSWLR++G++K           P + Q   P   +++       K G+ S SS+         
Subjt:  ERLPQFCGHCGLFDHGSSDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGDVSSSSTIGTDAVRVV

Query:  GENWSPVGGLEVDTSDVVVTQSGKTRGAGSSSRG
        G   +P  G      +  VT++ K +GA  S +G
Subjt:  GENWSPVGGLEVDTSDVVVTQSGKTRGAGSSSRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding5.8e-0820.98Show/hide
Query:  FSFDRAQDRFRVVESGPWYFDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKP
        F F   +  F ++  GPW F+ ++ V++    +   +   F ++ FW+    +PL  +   +   +G  +G F                       ++  
Subjt:  FSFDRAQDRFRVVESGPWYFDKFLLVLELMGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKP

Query:  LRRGIKLNITGPIGGRWIPMQYERLPQFCGHCGLFDHGSSDCP
        L R + +          +  QYE+L  FC  CG+  H +S+CP
Subjt:  LRRGIKLNITGPIGGRWIPMQYERLPQFCGHCGLFDHGSSDCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGGCATTGTTGGAGGGTTGGAAGAAACTGAGTTTGACAGTGGCCGAGGAGGAGATCGCAGCTGATGTTGATGCGGCTGCAGTTTTTGATGTCGCATCTCAGTT
GACCAACTGTTTGGTGGGAAAGCTTCTTGCCCCGCGACCTATTGCTCCTAAAGTCATGCGACGAACAATGGCTATTGCATGGCGTGTTGAGGCTGGTCTGTCGATTGAGA
AAATGGGTAAGAATGTGTTCTTGTTTTCTTTTGACAGGGCACAGGACCGGTTTCGTGTTGTTGAGTCAGGTCCGTGGTATTTTGATAAGTTCCTGCTCGTCCTTGAGCTG
ATGGGTGCCATGGTTCAACCTTCGGCAATGTGCTTCAATAAAGTGGCATTTTGGGTGCATTTCTTTGATCTTCCTTTGGGATGTATGAATGTTGGAATGGCAAAACGGCT
GGGCAATGCGGTTGGTGAGTTTGAAAATGTGGATTGTAACTCGGTCGGACTGTGTTGGGGCTCAAGTTTGCATGTGCGGATTCGTGTGGATATTAAAAAGCCATTGCGTC
GTGGGATAAAGTTGAATATTACTGGTCCCATTGGCGGTCGCTGGATTCCGATGCAATATGAGAGGTTGCCTCAGTTTTGCGGCCACTGTGGGTTGTTTGATCATGGTTCC
AGCGACTGTCCTATGCTGTTGTCGCCAGTGACTGATGTTCCTGTGCAGCGTTTCCAGTATGGGAGTTGGCTGCGATTTGAAGGGTCGATCAAATCCTTGGGTAAGGAGTC
GATTGGGCAGTGTTCGCCAGAGGTGGATCAAGGAAGTTCGCCGGGGGTGTCGGTGGTTGAGGTCGTTTCAGACCCTCCGAGAAAGCAGGGGGACGTGTCGTCATCTTCGA
CTATAGGAACTGACGCAGTCAGGGTAGTTGGTGAGAATTGGTCGCCTGTTGGTGGGCTGGAGGTTGACACATCTGATGTTGTGGTGACCCAGTCTGGGAAGACACGCGGT
GCTGGGTCGAGTAGTCGTGGGCTCGGGTTGACTTCTTCTGTGGAAGTGCAAGCTGAATTGGGTGCATGCCATGGTCTGGCTACCATGGATGGGCCGAGCGAGTCTGGTTT
ATTGAAGGGAATTGCTTCTTCGGGTAAGAAATGGAAGCGTAAGGCAAGGTTTGGTTCAGTGTCCCAATCGGTTGGTGTGGATGATCTAAAGCGTAAGGTAGTGGATGAGG
TTAGTGACCAAGCTGGGAAGAAGAGCAAAGTGGTGGCTAGTGTTATTGCATGTGATGATGTTGTTCCAAGTGTTGTATTTCCGGCGGAGGCTGATGTTCAGCCCCGCCGA
GCATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTGGCATTGTTGGAGGGTTGGAAGAAACTGAGTTTGACAGTGGCCGAGGAGGAGATCGCAGCTGATGTTGATGCGGCTGCAGTTTTTGATGTCGCATCTCAGTT
GACCAACTGTTTGGTGGGAAAGCTTCTTGCCCCGCGACCTATTGCTCCTAAAGTCATGCGACGAACAATGGCTATTGCATGGCGTGTTGAGGCTGGTCTGTCGATTGAGA
AAATGGGTAAGAATGTGTTCTTGTTTTCTTTTGACAGGGCACAGGACCGGTTTCGTGTTGTTGAGTCAGGTCCGTGGTATTTTGATAAGTTCCTGCTCGTCCTTGAGCTG
ATGGGTGCCATGGTTCAACCTTCGGCAATGTGCTTCAATAAAGTGGCATTTTGGGTGCATTTCTTTGATCTTCCTTTGGGATGTATGAATGTTGGAATGGCAAAACGGCT
GGGCAATGCGGTTGGTGAGTTTGAAAATGTGGATTGTAACTCGGTCGGACTGTGTTGGGGCTCAAGTTTGCATGTGCGGATTCGTGTGGATATTAAAAAGCCATTGCGTC
GTGGGATAAAGTTGAATATTACTGGTCCCATTGGCGGTCGCTGGATTCCGATGCAATATGAGAGGTTGCCTCAGTTTTGCGGCCACTGTGGGTTGTTTGATCATGGTTCC
AGCGACTGTCCTATGCTGTTGTCGCCAGTGACTGATGTTCCTGTGCAGCGTTTCCAGTATGGGAGTTGGCTGCGATTTGAAGGGTCGATCAAATCCTTGGGTAAGGAGTC
GATTGGGCAGTGTTCGCCAGAGGTGGATCAAGGAAGTTCGCCGGGGGTGTCGGTGGTTGAGGTCGTTTCAGACCCTCCGAGAAAGCAGGGGGACGTGTCGTCATCTTCGA
CTATAGGAACTGACGCAGTCAGGGTAGTTGGTGAGAATTGGTCGCCTGTTGGTGGGCTGGAGGTTGACACATCTGATGTTGTGGTGACCCAGTCTGGGAAGACACGCGGT
GCTGGGTCGAGTAGTCGTGGGCTCGGGTTGACTTCTTCTGTGGAAGTGCAAGCTGAATTGGGTGCATGCCATGGTCTGGCTACCATGGATGGGCCGAGCGAGTCTGGTTT
ATTGAAGGGAATTGCTTCTTCGGGTAAGAAATGGAAGCGTAAGGCAAGGTTTGGTTCAGTGTCCCAATCGGTTGGTGTGGATGATCTAAAGCGTAAGGTAGTGGATGAGG
TTAGTGACCAAGCTGGGAAGAAGAGCAAAGTGGTGGCTAGTGTTATTGCATGTGATGATGTTGTTCCAAGTGTTGTATTTCCGGCGGAGGCTGATGTTCAGCCCCGCCGA
GCATTATGA
Protein sequenceShow/hide protein sequence
MAVALLEGWKKLSLTVAEEEIAADVDAAAVFDVASQLTNCLVGKLLAPRPIAPKVMRRTMAIAWRVEAGLSIEKMGKNVFLFSFDRAQDRFRVVESGPWYFDKFLLVLEL
MGAMVQPSAMCFNKVAFWVHFFDLPLGCMNVGMAKRLGNAVGEFENVDCNSVGLCWGSSLHVRIRVDIKKPLRRGIKLNITGPIGGRWIPMQYERLPQFCGHCGLFDHGS
SDCPMLLSPVTDVPVQRFQYGSWLRFEGSIKSLGKESIGQCSPEVDQGSSPGVSVVEVVSDPPRKQGDVSSSSTIGTDAVRVVGENWSPVGGLEVDTSDVVVTQSGKTRG
AGSSSRGLGLTSSVEVQAELGACHGLATMDGPSESGLLKGIASSGKKWKRKARFGSVSQSVGVDDLKRKVVDEVSDQAGKKSKVVASVIACDDVVPSVVFPAEADVQPRR
AL