; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005291 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005291
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr6:13641084..13643057
RNA-Seq ExpressionLag0005291
SyntenyLag0005291
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]8.0e-5644.53Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP
        M   +L+++W    LTS E++++V  D SA+E TG  L   LI KL S R ++  V++ T   AWK+      V+++G N+F+F F+   DR R+L+ GP
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP

Query:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW
        W FD+ L+I++ P+   KP D  F +VS WVHF+DL L   N+ MA RLGNAIG FEDV+        G+ LRVR+R D+  PL RG+K+ LD  MGG W
Subjt:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW

Query:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRF
        IPIQYE+LP++ + CG + H+ KDC        S    + YG WLRF
Subjt:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRF

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]7.2e-6547.77Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPW
        MD  +L+ DW +  LTS E+E+++  D  AV+     L   L+GKL + R ++A+V+ +    AWKV H L VE +G+NLF+F F  E D  RV+K GPW
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPW

Query:  IFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWI
         FDK L++L+ P      S+  F+ V+FW+H +DLP+ W N+ MA RLGNAIG F DVD        GASLR+R+ +DI+ PLRRG+KI +D  MGG WI
Subjt:  IFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWI

Query:  PIQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRF
        PIQYE+LP++C+FCG+IGH   DC  + L+A     +   YG WLRF
Subjt:  PIQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.0e-5040.7Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP
        M    L+++W    LTS EEE ++  D SA   TG  L   L+GKL   R +   VM+ T   AWK+  +   V+ LG NLF+F F    DR ++ K GP
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP

Query:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW
        W FD+ L+++  P+  + PS+  F+ +  WV F+DLPL    ++MA RLGNA+G FE+ D  +     G++LRVR+ +DIS PLRRG+K+ LD  +GG+W
Subjt:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW

Query:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRFDAKGVLLASIP
        IPIQYE+LP++C+ CG           L S+ + H     YG WLR+  +G +  ++P
Subjt:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRFDAKGVLLASIP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]1.6e-4837.01Show/hide
Query:  SLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLG---CCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWI
        SL+     LSLTS E+    A  R   E T L++G    CL+GKL + R    E M+ T    W+ T G++V ++G NLF+F F +  D+RRVL  GPW 
Subjt:  SLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLG---CCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWI

Query:  FDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIP
        FDK LL+L      ++PSD + + V FWVH  +LPL   N+++ E +GNA+G+F D+D  +G    G ++R+R+ +D+  PLRRG+K+ L  S    W+ 
Subjt:  FDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIP

Query:  IQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRFD---AKGVLLASIPAGGSATGLGIPHIQTRCPAGLASMVRFNRPAYGGLRLSPR
         +YE+LP YC+FCG +GH  ++C  +L SAD S    + YG WLR D   +KG    S  AG    G+G  H   +      +  + N+     LR SP 
Subjt:  IQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRFD---AKGVLLASIPAGGSATGLGIPHIQTRCPAGLASMVRFNRPAYGGLRLSPR

Query:  R-----DSGHVRVGAAWYSLSGNVRAMDKGKGKIVVENQV-GSKVSWGASSSME
        R     D    R   +  +L+  +   D+G    V E  + GS +S   S S E
Subjt:  R-----DSGHVRVGAAWYSLSGNVRAMDKGKGKIVVENQV-GSKVSWGASSSME

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]7.3e-4936.72Show/hide
Query:  SLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLG---CCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWI
        SLV     LSLTS E+    A  R   + T L++G    CL+GKL + R    E M+ T    W+ T G++V ++G NLF+F F +  D+RRVL  GPW 
Subjt:  SLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLG---CCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWI

Query:  FDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIP
        FDK LL+L      ++PSD + + V FWVH  +LPL   N+++ + +GNA+G+F D+D  +G    G ++R+R+ +D+  PLRRG+K+ L  S    W+ 
Subjt:  FDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIP

Query:  IQYEKLPEYCFFCGIIGHLHKDCGQLLS-ADRSHCSLMNYGDWLRFD---AKGVLLASIPAGGSATGLGIPHIQTRCPAGLASMVRFNRPAYGGLRLSPR
         +YE+LP YC+FCG +GH  ++C   LS AD +    + YG WLR D   +KG    S  AG    G+G  H   +      +  + N+     LR SP 
Subjt:  IQYEKLPEYCFFCGIIGHLHKDCGQLLS-ADRSHCSLMNYGDWLRFD---AKGVLLASIPAGGSATGLGIPHIQTRCPAGLASMVRFNRPAYGGLRLSPR

Query:  R-----DSGHVRVGAAWYSLSGNVRAMDKGKGKIVVENQV-GSKVSWGASSSME
        R     D    R   +  SL+  +     G+G  V E  + GS +S   SS  E
Subjt:  R-----DSGHVRVGAAWYSLSGNVRAMDKGKGKIVVENQV-GSKVSWGASSSME

TrEMBL top hitse value%identityAlignment
A0A2N9GWE9 Uncharacterized protein3.0e-4036.99Show/hide
Query:  LVQDWSRLSLTSVE-EEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAW-KVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFD
        LV++W + SLT  E   V++AAD  A+E + ++   CL+GKL + ++   E ++ T    W  V  G+    +G NLF+F+F ++ +RRRV+   PW+FD
Subjt:  LVQDWSRLSLTSVE-EEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAW-KVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFD

Query:  KFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQ
          LL+L+        S  +FS   FWV FY +PL +  ++  E++G+  G+ E+VD        G  LRVRI +DI+ P+ RG +++  +S+G  W+  +
Subjt:  KFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQ

Query:  YEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCS----LMNYGDWLR
        YE+LP  CF CG+IGHL +DC   +S  +++ S    +  YG WLR
Subjt:  YEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCS----LMNYGDWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054813.9e-5644.53Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP
        M   +L+++W    LTS E++++V  D SA+E TG  L   LI KL S R ++  V++ T   AWK+      V+++G N+F+F F+   DR R+L+ GP
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP

Query:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW
        W FD+ L+I++ P+   KP D  F +VS WVHF+DL L   N+ MA RLGNAIG FEDV+        G+ LRVR+R D+  PL RG+K+ LD  MGG W
Subjt:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW

Query:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRF
        IPIQYE+LP++ + CG + H+ KDC        S    + YG WLRF
Subjt:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRF

A0A6J1D765 uncharacterized protein LOC1110179025.1e-4035.17Show/hide
Query:  WSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLIL
        W     T+ E E +V  DR     T   +  C++ KL + + ++AE +R      W+V +  R E LG N+++  F +  ++ RVL  GPW F+K LL+L
Subjt:  WSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLIL

Query:  ELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPE
          P    +P D  F+  +FW+  +++P +  + EMA  LG  +G+ E+++G       G  +RVR+++D+S PLRRG+K+   D     W P++YEKLP+
Subjt:  ELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPE

Query:  YCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLR
        +C+ CG IGH  ++C Q      ++ S   YGDWLR
Subjt:  YCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLR

A0A6J1DU55 uncharacterized protein LOC1110231353.5e-6547.77Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPW
        MD  +L+ DW +  LTS E+E+++  D  AV+     L   L+GKL + R ++A+V+ +    AWKV H L VE +G+NLF+F F  E D  RV+K GPW
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPW

Query:  IFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWI
         FDK L++L+ P      S+  F+ V+FW+H +DLP+ W N+ MA RLGNAIG F DVD        GASLR+R+ +DI+ PLRRG+KI +D  MGG WI
Subjt:  IFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWI

Query:  PIQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRF
        PIQYE+LP++C+FCG+IGH   DC  + L+A     +   YG WLRF
Subjt:  PIQYEKLPEYCFFCGIIGHLHKDC-GQLLSADRSHCSLMNYGDWLRF

A0A6J1DX30 uncharacterized protein LOC1110248742.4e-5040.7Show/hide
Query:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP
        M    L+++W    LTS EEE ++  D SA   TG  L   L+GKL   R +   VM+ T   AWK+  +   V+ LG NLF+F F    DR ++ K GP
Subjt:  MDPMSLVQDWSRLSLTSVEEEVSVAADRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKV-THGLRVELLGRNLFIFRFDNEDDRRRVLKQGP

Query:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW
        W FD+ L+++  P+  + PS+  F+ +  WV F+DLPL    ++MA RLGNA+G FE+ D  +     G++LRVR+ +DIS PLRRG+K+ LD  +GG+W
Subjt:  WIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSW

Query:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRFDAKGVLLASIP
        IPIQYE+LP++C+ CG           L S+ + H     YG WLR+  +G +  ++P
Subjt:  IPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRFDAKGVLLASIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13450.1 unknown protein4.4e-1227.95Show/hide
Query:  AWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNG
        AW +T+ +   +L      F F +E D   VL++ PW+++ + +  +     L  + +  +S+  WV    +PL +  +E A  + + +GE   +D  + 
Subjt:  AWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNG

Query:  FHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGHLHKDC
             A +RVRIR  I+  LR  ++II  DS   + I  QYE+L   C  C  + H    C
Subjt:  FHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGHLHKDC

AT3G31430.1 unknown protein4.0e-1324.35Show/hide
Query:  DRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLKPSDYRFSSV
        + +  E   +L G  ++ + Q+ R + A + R      W  +  +   ++    F F F  E+    VL++GPW F+ ++++L+      +P    F  +
Subjt:  DRSAVERTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLKPSDYRFSSV

Query:  SFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGH
         FWV    +P  + N+ + E +G A+G+  D D            RV +  DI+HPLR          +  + +  +YE+L  +C  CG++ H
Subjt:  SFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGH

AT3G42140.1 zinc ion binding;nucleic acid binding1.7e-0826.28Show/hide
Query:  ELLGRNLFI----FRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLK-PSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMG
        E++GR L I    F F +E+    +L++GPW F+ ++ +++   R  K  SD  F  + FW+    +PL +    +  R+  +IGE            MG
Subjt:  ELLGRNLFI----FRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLK-PSDYRFSSVSFWVHFYDLPLDWYNQEMAERLGNAIGEFEDVDGRNGFHHMG

Query:  ASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGHLHKDC
          L   +  D+                  S +  QYEKL  +C  CG++ H   +C
Subjt:  ASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGHLHKDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGAAATCTGGTAATTCCGATGATGTGGTTTCCGTTTTCATTCTTTTTCCATCCGTTTCCAAATCTGCCAGGCTGGGGCAGTGATTTCTTAGCCGTGAAGCGGCT
ATTTCAGGAGGTTCGAGGTGGGCATCGTTCATCAAATCTGGTGGTAATTCCTATCTCTCTATCTGCCTGCTTTTTTCCTCTTCATTGTTTTTTTTCGCTTTGGGGAGGTT
TTTTTCTGGGGACAATGGATCCTATGTCGCTTGTCCAAGATTGGTCGAGATTGAGTTTGACTTCGGTGGAGGAGGAAGTATCGGTGGCGGCTGACCGTTCAGCTGTGGAG
CGAACTGGGCTTTTGCTCGGGTGTTGCTTGATTGGGAAGCTCCAATCTCACCGCTTTCTAGCTGCGGAGGTGATGAGAAAGACTTTTGCTCCCGCTTGGAAAGTTACCCA
TGGCCTGAGGGTTGAATTGTTGGGCAGAAATCTCTTTATTTTCAGATTCGATAATGAGGATGATAGGAGGCGGGTATTGAAGCAGGGGCCCTGGATTTTTGATAAGTTCC
TTTTAATTTTGGAGCTTCCAATTCGTGGTCTCAAACCTTCAGATTATCGGTTTTCATCGGTTTCATTTTGGGTCCATTTCTATGACCTCCCTCTTGACTGGTATAATCAG
GAAATGGCGGAGAGGCTTGGAAATGCGATCGGTGAGTTCGAAGATGTCGACGGCAGGAATGGTTTCCATCACATGGGAGCAAGCTTGCGAGTTCGTATTCGGGTAGACAT
TTCGCATCCTCTTAGGCGAGGCGTTAAAATTATTCTTGATGATTCCATGGGAGGTAGTTGGATTCCGATTCAATATGAAAAATTACCGGAATATTGTTTCTTTTGTGGGA
TTATTGGTCATCTTCATAAAGATTGTGGGCAATTGTTATCAGCTGATAGATCGCATTGTTCGTTGATGAACTATGGTGATTGGCTTCGTTTTGATGCTAAAGGGGTCCTT
CTGGCAAGCATTCCGGCAGGCGGATCGGCGACGGGTCTTGGAATTCCTCATATTCAGACCCGTTGTCCGGCGGGTTTGGCATCTATGGTTCGGTTCAATCGTCCAGCGTA
TGGTGGTTTGCGGCTTAGTCCTCGTCGTGATTCTGGACATGTTCGTGTTGGTGCGGCTTGGTACTCGCTTTCTGGCAACGTGCGTGCGATGGACAAAGGGAAAGGGAAGA
TTGTTGTTGAGAATCAGGTTGGTTCGAAGGTCTCGTGGGGTGCATCTTCGTCGATGGAGGGTCGGCTATCATCTTACTCTGTTCCGGCGATTGGTACGGCGGCTGAAGCT
CCCTCTCTGGTCAAGGATGTGAACGAAGGCGTTACGTGGAGAGAGATTGTTACTAAAGGGAAATTAAAAGCCTTTAATGAGGCTTTTAAAGCTAAATTTCCAACTCCACC
GGTTGTCCCTGAATCTTCTCAAAACCGTTCAGAGGCTATTAATGGAGAAGTGAATGGGGCCAAGCTGTTTTCGGTTGAGGACTCTTTATTGGGAGAATTATGTGCTGCAA
AATTTGGGGAGGTGAATGAGAAGCCGTTGGATTGTTTTAGTGGTTTGCCAAGGAGTGGGATGAATGGGCCTTTAGCTGATAATGGGCCTTTTATGGATGAAGATCTTCCA
AGTGAAACTCCCATTGGGTCTGTTGAATATCACTCCAACAATTTCGAGTCCTTGATGGATCATATTAAGCCCATTGTTGATGCTTCTGAGAATGGGCTTTCGGTGAAGAA
GGAAGGGAAAAAACAAGGCCAAGGGTATCAATGGAAGAAACGTGCTCGTGCGGGGTTTGTTCCAGCCGGTCTGAATCTTTCAGTGCTCGAGGAATTTAACAAGCGGAAGA
ATGGACCTATTTTATTTTCGCCAGAAAATTTGAAGCGTCCGCGAATTGAGTCCAATGAGTGTAATCAGGCGGGGACTGCAGAGCAGCCCCGCCCGAAACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGAAATCTGGTAATTCCGATGATGTGGTTTCCGTTTTCATTCTTTTTCCATCCGTTTCCAAATCTGCCAGGCTGGGGCAGTGATTTCTTAGCCGTGAAGCGGCT
ATTTCAGGAGGTTCGAGGTGGGCATCGTTCATCAAATCTGGTGGTAATTCCTATCTCTCTATCTGCCTGCTTTTTTCCTCTTCATTGTTTTTTTTCGCTTTGGGGAGGTT
TTTTTCTGGGGACAATGGATCCTATGTCGCTTGTCCAAGATTGGTCGAGATTGAGTTTGACTTCGGTGGAGGAGGAAGTATCGGTGGCGGCTGACCGTTCAGCTGTGGAG
CGAACTGGGCTTTTGCTCGGGTGTTGCTTGATTGGGAAGCTCCAATCTCACCGCTTTCTAGCTGCGGAGGTGATGAGAAAGACTTTTGCTCCCGCTTGGAAAGTTACCCA
TGGCCTGAGGGTTGAATTGTTGGGCAGAAATCTCTTTATTTTCAGATTCGATAATGAGGATGATAGGAGGCGGGTATTGAAGCAGGGGCCCTGGATTTTTGATAAGTTCC
TTTTAATTTTGGAGCTTCCAATTCGTGGTCTCAAACCTTCAGATTATCGGTTTTCATCGGTTTCATTTTGGGTCCATTTCTATGACCTCCCTCTTGACTGGTATAATCAG
GAAATGGCGGAGAGGCTTGGAAATGCGATCGGTGAGTTCGAAGATGTCGACGGCAGGAATGGTTTCCATCACATGGGAGCAAGCTTGCGAGTTCGTATTCGGGTAGACAT
TTCGCATCCTCTTAGGCGAGGCGTTAAAATTATTCTTGATGATTCCATGGGAGGTAGTTGGATTCCGATTCAATATGAAAAATTACCGGAATATTGTTTCTTTTGTGGGA
TTATTGGTCATCTTCATAAAGATTGTGGGCAATTGTTATCAGCTGATAGATCGCATTGTTCGTTGATGAACTATGGTGATTGGCTTCGTTTTGATGCTAAAGGGGTCCTT
CTGGCAAGCATTCCGGCAGGCGGATCGGCGACGGGTCTTGGAATTCCTCATATTCAGACCCGTTGTCCGGCGGGTTTGGCATCTATGGTTCGGTTCAATCGTCCAGCGTA
TGGTGGTTTGCGGCTTAGTCCTCGTCGTGATTCTGGACATGTTCGTGTTGGTGCGGCTTGGTACTCGCTTTCTGGCAACGTGCGTGCGATGGACAAAGGGAAAGGGAAGA
TTGTTGTTGAGAATCAGGTTGGTTCGAAGGTCTCGTGGGGTGCATCTTCGTCGATGGAGGGTCGGCTATCATCTTACTCTGTTCCGGCGATTGGTACGGCGGCTGAAGCT
CCCTCTCTGGTCAAGGATGTGAACGAAGGCGTTACGTGGAGAGAGATTGTTACTAAAGGGAAATTAAAAGCCTTTAATGAGGCTTTTAAAGCTAAATTTCCAACTCCACC
GGTTGTCCCTGAATCTTCTCAAAACCGTTCAGAGGCTATTAATGGAGAAGTGAATGGGGCCAAGCTGTTTTCGGTTGAGGACTCTTTATTGGGAGAATTATGTGCTGCAA
AATTTGGGGAGGTGAATGAGAAGCCGTTGGATTGTTTTAGTGGTTTGCCAAGGAGTGGGATGAATGGGCCTTTAGCTGATAATGGGCCTTTTATGGATGAAGATCTTCCA
AGTGAAACTCCCATTGGGTCTGTTGAATATCACTCCAACAATTTCGAGTCCTTGATGGATCATATTAAGCCCATTGTTGATGCTTCTGAGAATGGGCTTTCGGTGAAGAA
GGAAGGGAAAAAACAAGGCCAAGGGTATCAATGGAAGAAACGTGCTCGTGCGGGGTTTGTTCCAGCCGGTCTGAATCTTTCAGTGCTCGAGGAATTTAACAAGCGGAAGA
ATGGACCTATTTTATTTTCGCCAGAAAATTTGAAGCGTCCGCGAATTGAGTCCAATGAGTGTAATCAGGCGGGGACTGCAGAGCAGCCCCGCCCGAAACCATGA
Protein sequenceShow/hide protein sequence
MVGNLVIPMMWFPFSFFFHPFPNLPGWGSDFLAVKRLFQEVRGGHRSSNLVVIPISLSACFFPLHCFFSLWGGFFLGTMDPMSLVQDWSRLSLTSVEEEVSVAADRSAVE
RTGLLLGCCLIGKLQSHRFLAAEVMRKTFAPAWKVTHGLRVELLGRNLFIFRFDNEDDRRRVLKQGPWIFDKFLLILELPIRGLKPSDYRFSSVSFWVHFYDLPLDWYNQ
EMAERLGNAIGEFEDVDGRNGFHHMGASLRVRIRVDISHPLRRGVKIILDDSMGGSWIPIQYEKLPEYCFFCGIIGHLHKDCGQLLSADRSHCSLMNYGDWLRFDAKGVL
LASIPAGGSATGLGIPHIQTRCPAGLASMVRFNRPAYGGLRLSPRRDSGHVRVGAAWYSLSGNVRAMDKGKGKIVVENQVGSKVSWGASSSMEGRLSSYSVPAIGTAAEA
PSLVKDVNEGVTWREIVTKGKLKAFNEAFKAKFPTPPVVPESSQNRSEAINGEVNGAKLFSVEDSLLGELCAAKFGEVNEKPLDCFSGLPRSGMNGPLADNGPFMDEDLP
SETPIGSVEYHSNNFESLMDHIKPIVDASENGLSVKKEGKKQGQGYQWKKRARAGFVPAGLNLSVLEEFNKRKNGPILFSPENLKRPRIESNECNQAGTAEQPRPKP