; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031425 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031425
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr11:8326223..8327982
RNA-Seq ExpressionLag0031425
SyntenyLag0031425
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]5.0e-6848.06Show/hide
Query:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGV-GLQVERIGQNVFLFSFSNVADRNRVFHTGP
        M  + +L++WK  +LTS+E+++AVD+DS A+  T   ++  L  KLLS R I   +L+ TL IAW++      V+ IG N+FLF+F+  +DRNR+   GP
Subjt:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGV-GLQVERIGQNVFLFSFSNVADRNRVFHTGP

Query:  WFFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCW
        W FD+ L++++    L KP D+ F  V+ W+HF+DL +ACMN TMA +LGNA+G FEDV+ NA+  CWG+ LRVRVR D+ KPL RGIK+N+DGP+GGCW
Subjt:  WFFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCW

Query:  IPIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKH-EYGSWLRYEGPPRNQGI
        IPI++E+LP+F   CG L H   DC    S     SV K+ +YG WLR++G   +  I
Subjt:  IPIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKH-EYGSWLRYEGPPRNQGI

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.1e-7850.89Show/hide
Query:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPW
        M+   +L DW+K +LTS+E+E+A+DVD +A+      + + L GKLL+ R I +++L R L +AW+V   L VE IG+N+FLF F    D NRV  TGPW
Subjt:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPW

Query:  FFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWI
        FFDK L+VL+        S+L F+ VAFW+H +DLPM+ +N TMA +LGNA+G F DVDCN  G  WGASLR+RV IDITKPLRRGIKIN+DGP+GGCWI
Subjt:  FFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWI

Query:  PIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDR
        PI++E+LP+FC  CG++GH ++DC   + +A   S    EYG WLR+ G        R GK  A ED+     SS +S++R
Subjt:  PIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]4.1e-6241.31Show/hide
Query:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRV-GVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK
        +L++WK  +LTS+EEE A+DVD+ A + T  R++  L GKL   R I   +++ T+  AW++     +V+ +G N+FLFSF+   DRN+++ +GPW FD+
Subjt:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRV-GVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK

Query:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKF
         L+++     L  PS+L F  +  W+ F+DLP+ C+   MA +LGNA+G FE+ DC+     WG++LRVRV +DI+KPLRRGIK+N+DGPIGG WIPI++
Subjt:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKF

Query:  EKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIVEP
        E+LP+FC  CGL                  S  KH+YGSWLRY+G  +         K   ED +    ++  S   SP VGAG   V   P    I  P
Subjt:  EKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIVEP

Query:  KQDSV
         +  V
Subjt:  KQDSV

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]7.2e-5136.69Show/hide
Query:  TILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK
        ++LD  + L LTS EE+  V +  E+ S  +G+   CL GKLL+ R    E ++ TL   W+   G+QV  IG N+F+F F +V D+ RV   GPW FDK
Subjt:  TILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK

Query:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDG--PIGGCWIPI
         LL+L  M P  +PSD++   V FW+H  +LP+  MN  +   +GNAVG+F D+D    G+ WG ++R+RV +D+ KPLRRG+K+ +    PI   W+  
Subjt:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDG--PIGGCWIPI

Query:  KFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIV
        K+E+LP +C  CG LGH   +C    SSADG  V   +YG+WLR +   +++G  R G       +I   +       +   +   H  V +    P   
Subjt:  KFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIV

Query:  EPKQDSVKSVIGGD----VGAAVETVQEGTSVGPAVDE
         P +  V+S    D     G  +  V EG   G AV E
Subjt:  EPKQDSVKSVIGGD----VGAAVETVQEGTSVGPAVDE

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]1.2e-5041.09Show/hide
Query:  TILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK
        +++D  + L LTS EE+  V +  ++ S  +G+   CL GKLL+ R    E ++ TL   W+   G+QV  IG N+F+F F +V D+ RV   GPW FDK
Subjt:  TILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK

Query:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDG--PIGGCWIPI
         LL+L  M P  +PSD++   V FW+H  +LP+  MN  + + +GNAVG+F D+D    G+ WG ++R+RV ID+ KPLRRG+K+ +    PI   W+  
Subjt:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDG--PIGGCWIPI

Query:  KFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLG
        K+E+LP +C  CG LGH   +C    S ADG  V   +YG+WLR +   +++G  R G
Subjt:  KFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLG

TrEMBL top hitse value%identityAlignment
A0A1R3GTB5 Reverse transcriptase1.2e-4639.68Show/hide
Query:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKF
        + D W+   LT +EE + V VD   +  T+     CL GKLLS R +  E++R  + + W++  GLQV  IG+N+F+F F +  ++ RV+   PW F+K 
Subjt:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKF

Query:  LLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGG-CWIPIKF
        LLVL+     D   D++ D  +FW   +DLP+  MN ++ R +G + G  E++D   D + WG  LR R R+++TKPLRRG+ +    P GG   I  ++
Subjt:  LLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGG-CWIPIKF

Query:  EKLPEFCSRCGLLGHGANDC-SLVFSSADGGSVLKHEYGSWLRYEGP
        EKLP+FC  CG L H  N+C   V    D G ++K EYG WLR E P
Subjt:  EKLPEFCSRCGLLGHGANDC-SLVFSSADGGSVLKHEYGSWLRYEGP

A0A1R3K847 Uncharacterized protein1.6e-4334.69Show/hide
Query:  WKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVL
        W+   LT  EE + + V+   ++ ++   K CL GKLLS R +  +++R  L + W++  GLQV  IG+ +++F F +  ++ RV+  GPW F+K LLVL
Subjt:  WKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVL

Query:  EMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGG-CWIPIKFEKLP
        +         +++ ++ AFW+  +DLP+  M  ++ + +G++ G   ++D   D + WG  LR+R  +++ KPLRRG+ +    P GG   +  ++EKLP
Subjt:  EMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGG-CWIPIKFEKLP

Query:  EFCSRCGLLGHGANDC-SLVFSSADGGSVLKHEYGSWLRYEGPPRN------QGIA--RLGKKVAVEDAIT
        +FC  CG L H  N+C   V    D G V K EYG WLR E P  N       GI+  R GK+   +  +T
Subjt:  EFCSRCGLLGHGANDC-SLVFSSADGGSVLKHEYGSWLRYEGPPRN------QGIA--RLGKKVAVEDAIT

A0A6J1BSZ1 uncharacterized protein LOC1110054812.4e-6848.06Show/hide
Query:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGV-GLQVERIGQNVFLFSFSNVADRNRVFHTGP
        M  + +L++WK  +LTS+E+++AVD+DS A+  T   ++  L  KLLS R I   +L+ TL IAW++      V+ IG N+FLF+F+  +DRNR+   GP
Subjt:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGV-GLQVERIGQNVFLFSFSNVADRNRVFHTGP

Query:  WFFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCW
        W FD+ L++++    L KP D+ F  V+ W+HF+DL +ACMN TMA +LGNA+G FEDV+ NA+  CWG+ LRVRVR D+ KPL RGIK+N+DGP+GGCW
Subjt:  WFFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCW

Query:  IPIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKH-EYGSWLRYEGPPRNQGI
        IPI++E+LP+F   CG L H   DC    S     SV K+ +YG WLR++G   +  I
Subjt:  IPIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKH-EYGSWLRYEGPPRNQGI

A0A6J1DU55 uncharacterized protein LOC1110231351.5e-7850.89Show/hide
Query:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPW
        M+   +L DW+K +LTS+E+E+A+DVD +A+      + + L GKLL+ R I +++L R L +AW+V   L VE IG+N+FLF F    D NRV  TGPW
Subjt:  MEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPW

Query:  FFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWI
        FFDK L+VL+        S+L F+ VAFW+H +DLPM+ +N TMA +LGNA+G F DVDCN  G  WGASLR+RV IDITKPLRRGIKIN+DGP+GGCWI
Subjt:  FFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWI

Query:  PIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDR
        PI++E+LP+FC  CG++GH ++DC   + +A   S    EYG WLR+ G        R GK  A ED+     SS +S++R
Subjt:  PIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDR

A0A6J1DX30 uncharacterized protein LOC1110248742.0e-6241.31Show/hide
Query:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRV-GVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK
        +L++WK  +LTS+EEE A+DVD+ A + T  R++  L GKL   R I   +++ T+  AW++     +V+ +G N+FLFSF+   DRN+++ +GPW FD+
Subjt:  ILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRV-GVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDK

Query:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKF
         L+++     L  PS+L F  +  W+ F+DLP+ C+   MA +LGNA+G FE+ DC+     WG++LRVRV +DI+KPLRRGIK+N+DGPIGG WIPI++
Subjt:  FLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKF

Query:  EKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIVEP
        E+LP+FC  CGL                  S  KH+YGSWLRY+G  +         K   ED +    ++  S   SP VGAG   V   P    I  P
Subjt:  EKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEVPIIPAIVEP

Query:  KQDSV
         +  V
Subjt:  KQDSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding5.5e-0921.33Show/hide
Query:  EEVAVDVDSEAISNTIGRIKFCLAGKLLS------PRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVLEMMG
        +E+A+ V  EAI+  I      L    L+      P+ +  E++ R L I               +   F F +      +   GPW F+ ++ V++   
Subjt:  EEVAVDVDSEAISNTIGRIKFCLAGKLLS------PRQIGSEILRRTLSIAWRVGVGLQVERIGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVLEMMG

Query:  PLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKFEKLPEFCSR
         L   SD  F  + FW+    +P+  + A +   +G  +G F + +   D             + + K                     ++EKL  FC+ 
Subjt:  PLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKFEKLPEFCSR

Query:  CGLLGHGANDC
        CG+L H A++C
Subjt:  CGLLGHGANDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.3e-1322.38Show/hide
Query:  EEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVER--IGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVLEMMGPLDK
        EE  + +   A    +   +  L G++L+P+     + R  L + ++ G+G QV    +    F   F +  D        PW F+++ + L+     D 
Subjt:  EEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVER--IGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVLEMMGPLDK

Query:  PSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKFEKLPEFCSRCGLL
        P++    F+  W+H   +P+  ++      + + +G    +D N +       +RV+VR+D T+PLR   ++          I  ++EKL   C+ C  +
Subjt:  PSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRRGIKINVDGPIGGCWIPIKFEKLPEFCSRCGLL

Query:  GHGANDCSLV
         H  + C  V
Subjt:  GHGANDCSLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTGGCGGCAGGCGACGGCAACGGCAAGGGCGGCGGCTGCAAGAGGCGATGGCTGCAAGAGGCGACGGCTAGTGTGGGTTGGACGGTTCGCTGGTGCGCTATGGA
GGAGACAACGATTCTTGATGACTGGAAGAAACTGAGGTTGACATCGGATGAAGAAGAAGTTGCGGTGGATGTGGACAGTGAGGCGATTTCTAACACGATCGGCCGCATTA
AGTTTTGCCTGGCAGGGAAGTTGCTTAGTCCTCGTCAGATTGGAAGTGAGATTCTACGTCGGACACTTTCCATTGCGTGGAGGGTGGGTGTGGGTCTCCAGGTTGAAAGA
ATTGGGCAGAATGTATTTCTGTTTTCTTTCTCTAATGTTGCAGATCGTAATCGAGTGTTTCACACGGGCCCATGGTTCTTTGATAAATTCTTGCTCGTGTTGGAGATGAT
GGGGCCGCTTGACAAGCCGTCTGATCTTCGGTTTGATTTTGTGGCTTTTTGGCTACACTTTTATGATTTACCTATGGCCTGTATGAATGCTACCATGGCTCGAAAACTGG
GTAATGCAGTTGGTCGTTTTGAGGATGTGGACTGTAATGCGGATGGACTTTGTTGGGGTGCTAGCTTGCGGGTTCGTGTGCGCATTGATATTACGAAACCTTTGCGGCGT
GGGATCAAGATTAATGTTGATGGTCCTATCGGTGGTTGTTGGATCCCAATTAAGTTTGAAAAGCTTCCGGAATTTTGTTCGCGTTGCGGTTTGTTGGGTCATGGTGCAAA
TGATTGCTCTCTGGTTTTCTCCTCGGCTGATGGTGGCTCTGTGTTGAAGCATGAGTATGGTTCTTGGCTGCGTTATGAGGGACCTCCAAGAAATCAAGGTATTGCACGCC
TGGGGAAGAAAGTCGCGGTTGAGGATGCTATAACTCCCCAGATTTCATCGCCGTCGTCGGAGGATCGGTCGCCGGCTGTAGGCGCTGGCCATGATGTAGTTGGCGAGGTC
CCTATAATCCCGGCGATCGTGGAGCCGAAGCAAGATAGTGTGAAATCGGTTATTGGGGGAGACGTGGGTGCTGCTGTTGAGACAGTTCAGGAGGGTACCTCTGTGGGGCC
TGCGGTTGATGAGCATTGTTTGGGTAGTGGGCCGAAGGCTGCGGTGATGGGGCGTGTGGTTCCGATTGGGCCGCACTATTTGATTGGGGCAACCACTAAATGGAAGCGCA
ATGCGCGTATGGGTCCCACGGTTTCTTCAAATGATGGTGACGTGGCTTCCAAGCGTAAGGGCCTTCCGTTGACTCCTAGTACTGTCTGCAAAAGGGCAAAAACGTCTGAT
GTTGCTTGTGTTGATGCTTGTGTTGATGATGGTACCGATCTTTCTGGTGTATCGGCGGTGGCTGAAGAGCAGCCCCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTGGCGGCAGGCGACGGCAACGGCAAGGGCGGCGGCTGCAAGAGGCGATGGCTGCAAGAGGCGACGGCTAGTGTGGGTTGGACGGTTCGCTGGTGCGCTATGGA
GGAGACAACGATTCTTGATGACTGGAAGAAACTGAGGTTGACATCGGATGAAGAAGAAGTTGCGGTGGATGTGGACAGTGAGGCGATTTCTAACACGATCGGCCGCATTA
AGTTTTGCCTGGCAGGGAAGTTGCTTAGTCCTCGTCAGATTGGAAGTGAGATTCTACGTCGGACACTTTCCATTGCGTGGAGGGTGGGTGTGGGTCTCCAGGTTGAAAGA
ATTGGGCAGAATGTATTTCTGTTTTCTTTCTCTAATGTTGCAGATCGTAATCGAGTGTTTCACACGGGCCCATGGTTCTTTGATAAATTCTTGCTCGTGTTGGAGATGAT
GGGGCCGCTTGACAAGCCGTCTGATCTTCGGTTTGATTTTGTGGCTTTTTGGCTACACTTTTATGATTTACCTATGGCCTGTATGAATGCTACCATGGCTCGAAAACTGG
GTAATGCAGTTGGTCGTTTTGAGGATGTGGACTGTAATGCGGATGGACTTTGTTGGGGTGCTAGCTTGCGGGTTCGTGTGCGCATTGATATTACGAAACCTTTGCGGCGT
GGGATCAAGATTAATGTTGATGGTCCTATCGGTGGTTGTTGGATCCCAATTAAGTTTGAAAAGCTTCCGGAATTTTGTTCGCGTTGCGGTTTGTTGGGTCATGGTGCAAA
TGATTGCTCTCTGGTTTTCTCCTCGGCTGATGGTGGCTCTGTGTTGAAGCATGAGTATGGTTCTTGGCTGCGTTATGAGGGACCTCCAAGAAATCAAGGTATTGCACGCC
TGGGGAAGAAAGTCGCGGTTGAGGATGCTATAACTCCCCAGATTTCATCGCCGTCGTCGGAGGATCGGTCGCCGGCTGTAGGCGCTGGCCATGATGTAGTTGGCGAGGTC
CCTATAATCCCGGCGATCGTGGAGCCGAAGCAAGATAGTGTGAAATCGGTTATTGGGGGAGACGTGGGTGCTGCTGTTGAGACAGTTCAGGAGGGTACCTCTGTGGGGCC
TGCGGTTGATGAGCATTGTTTGGGTAGTGGGCCGAAGGCTGCGGTGATGGGGCGTGTGGTTCCGATTGGGCCGCACTATTTGATTGGGGCAACCACTAAATGGAAGCGCA
ATGCGCGTATGGGTCCCACGGTTTCTTCAAATGATGGTGACGTGGCTTCCAAGCGTAAGGGCCTTCCGTTGACTCCTAGTACTGTCTGCAAAAGGGCAAAAACGTCTGAT
GTTGCTTGTGTTGATGCTTGTGTTGATGATGGTACCGATCTTTCTGGTGTATCGGCGGTGGCTGAAGAGCAGCCCCGCTAG
Protein sequenceShow/hide protein sequence
MKVAAGDGNGKGGGCKRRWLQEATASVGWTVRWCAMEETTILDDWKKLRLTSDEEEVAVDVDSEAISNTIGRIKFCLAGKLLSPRQIGSEILRRTLSIAWRVGVGLQVER
IGQNVFLFSFSNVADRNRVFHTGPWFFDKFLLVLEMMGPLDKPSDLRFDFVAFWLHFYDLPMACMNATMARKLGNAVGRFEDVDCNADGLCWGASLRVRVRIDITKPLRR
GIKINVDGPIGGCWIPIKFEKLPEFCSRCGLLGHGANDCSLVFSSADGGSVLKHEYGSWLRYEGPPRNQGIARLGKKVAVEDAITPQISSPSSEDRSPAVGAGHDVVGEV
PIIPAIVEPKQDSVKSVIGGDVGAAVETVQEGTSVGPAVDEHCLGSGPKAAVMGRVVPIGPHYLIGATTKWKRNARMGPTVSSNDGDVASKRKGLPLTPSTVCKRAKTSD
VACVDACVDDGTDLSGVSAVAEEQPR