; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034671 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034671
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:9658111..9659643
RNA-Seq ExpressionLag0034671
SyntenyLag0034671
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]5.7e-3328.5Show/hide
Query:  DRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTH
        +R  Q L+ S++ K  T K IN E F S +  IW  ++ + ++ +G NIF  RF+N   +  ILE GPW +DK +L+  E        D++FRYV  W  
Subjt:  DRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTH

Query:  FHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKE----GED
         H LP AC +R     +G L+G+V+++D  +  +   G  +RI++ IDV  PLKRG+ +  G       + + YE+LP+FCY CG +GH +++     ++
Subjt:  FHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKE----GED

Query:  IMGTSDGELRYGAWLREPVKLWS-----TEGSPLGNNVYG--------RGRGSF-WGGGRGR------GRMAAFDSEYGENLDNQPPTQIQRTEEISLPA
        I  TS    ++G W+R   +  S      + SP G+   G        R +GS  W  G+        G      +E       +  T + RT  +S   
Subjt:  IMGTSDGELRYGAWLREPVKLWS-----TEGSPLGNNVYG--------RGRGSF-WGGGRGR------GRMAAFDSEYGENLDNQPPTQIQRTEEISLPA

Query:  TKVDTSVTMNPEKEKVAAEASD----MELEINLVSAESMENCSN-----GTENLIKVMEDNKKWKEVSGNHGMEELQNVNEDSKLLNKKGKLKDMNLKEM
              +  +  KEK+  +++     ME  + + +    EN SN     G+E+    + + K+WK ++     E+  +VNE ++ L KK    DM+++E 
Subjt:  TKVDTSVTMNPEKEKVAAEASD----MELEINLVSAESMENCSN-----GTENLIKVMEDNKKWKEVSGNHGMEELQNVNEDSKLLNKKGKLKDMNLKEM

Query:  ESLHSVEEKKVENL
             +    V+ +
Subjt:  ESLHSVEEKKVENL

TXG66887.1 hypothetical protein EZV62_008162 [Acer yangbiense]1.6e-3233.46Show/hide
Query:  SSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEP
        +S+ L  K     + R++  D +G+KL   ++ K+ T K +N E F + MPKIW Q    ++ V  N F+  F+N   + WIL  GPW +D  +L+ E+P
Subjt:  SSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEP

Query:  KAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFC
                + F  V+ W      P  C ++   E IG+L+G++  +D+    +  +G  +R+K+ +DV  PLKR + L+   KG +  + + YEKLP++C
Subjt:  KAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFC

Query:  YGCGCLGHTIKE-----GEDIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYG----RGRGSFWGGGRGR
        + CG +GH+ ++     GEDI G+ D E  YG W+R           P  NNV G    RG GS  G G  R
Subjt:  YGCGCLGHTIKE-----GEDIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYG----RGRGSFWGGGRGR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.8e-3433.05Show/hide
Query:  WPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI--IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLL
        W +  L   +  +  +   S ++ +G+ L  S++CK+ +K+ I+  +  + +   W  +     +D +GFNIF+  F  +  +  IL  GPW +D+A+++
Subjt:  WPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI--IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLL

Query:  FEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKL
         + P +     DM+FR VSLW HF  L  AC ++  A  +GN +G  E V+  +  +  WG  LR++++ DV  PL RGI L         WI + YE+L
Subjt:  FEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKL

Query:  PDFCYGCGCLGHTIKEGED-IMGTSDGELRYGAWLR
        PDF Y CG L H +K+  D  + +    L+YG WLR
Subjt:  PDFCYGCGCLGHTIKEGED-IMGTSDGELRYGAWLR

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]1.3e-3434.68Show/hide
Query:  IDRSGQKLTTS-----ILCKIHTKKKINPEMFSSKMPKIWS-QEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRY
        IDR    LT       ++ K+HT K+I+ E   S M  +W     T  + +G NI++  FK+   K  +L SGPW ++K++L+   P A     DM F +
Subjt:  IDRSGQKLTTS-----ILCKIHTKKKINPEMFSSKMPKIWS-QEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRY

Query:  VSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEG
         + W   H +P+ C S   A  +G  LG VE+++  D  D   G  +R++++IDV  PL+RGI L++ S G D W  + YEKLPDFCY CG +GH+ +E 
Subjt:  VSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEG

Query:  E--DIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYGR----GRGSFWGGGR-GRGRMAAFDSEYGENLDNQPPTQIQRTEEISLPATKVDTSVT
        E    + T++   +YG WLR    L     S     V+ R    GRG    GGR GRG     D  +  ++D  P +  +R  E  +     + SVT
Subjt:  E--DIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYGR----GRGSFWGGGR-GRGRMAAFDSEYGENLDNQPPTQIQRTEEISLPATKVDTSVT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.8e-3129.92Show/hide
Query:  SNWPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAML
        ++W    L   +  +  +     +  + Q L  S++ K+  K+ I+ ++ S  +   W  EH + ++ +G N+F+  F        ++++GPWF+DKA++
Subjt:  SNWPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAML

Query:  LFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEK
        + ++P +  +  ++EF  V+ W H   LP +  ++  A  +GN +G    VD  +++  SWG SLRI++ ID+  PL+RGI +         WI + YE+
Subjt:  LFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEK

Query:  LPDFCYGCGCLGHT--------IKEGEDIMGTSDGELRYGAWLR
        LPDFCY CG +GH+        +   +D   TS+    YG WLR
Subjt:  LPDFCYGCGCLGHT--------IKEGEDIMGTSDGELRYGAWLR

TrEMBL top hitse value%identityAlignment
A0A5C7H9Y2 CCHC-type domain-containing protein2.7e-3328.5Show/hide
Query:  DRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTH
        +R  Q L+ S++ K  T K IN E F S +  IW  ++ + ++ +G NIF  RF+N   +  ILE GPW +DK +L+  E        D++FRYV  W  
Subjt:  DRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTH

Query:  FHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKE----GED
         H LP AC +R     +G L+G+V+++D  +  +   G  +RI++ IDV  PLKRG+ +  G       + + YE+LP+FCY CG +GH +++     ++
Subjt:  FHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKE----GED

Query:  IMGTSDGELRYGAWLREPVKLWS-----TEGSPLGNNVYG--------RGRGSF-WGGGRGR------GRMAAFDSEYGENLDNQPPTQIQRTEEISLPA
        I  TS    ++G W+R   +  S      + SP G+   G        R +GS  W  G+        G      +E       +  T + RT  +S   
Subjt:  IMGTSDGELRYGAWLREPVKLWS-----TEGSPLGNNVYG--------RGRGSF-WGGGRGR------GRMAAFDSEYGENLDNQPPTQIQRTEEISLPA

Query:  TKVDTSVTMNPEKEKVAAEASD----MELEINLVSAESMENCSN-----GTENLIKVMEDNKKWKEVSGNHGMEELQNVNEDSKLLNKKGKLKDMNLKEM
              +  +  KEK+  +++     ME  + + +    EN SN     G+E+    + + K+WK ++     E+  +VNE ++ L KK    DM+++E 
Subjt:  TKVDTSVTMNPEKEKVAAEASD----MELEINLVSAESMENCSN-----GTENLIKVMEDNKKWKEVSGNHGMEELQNVNEDSKLLNKKGKLKDMNLKEM

Query:  ESLHSVEEKKVENL
             +    V+ +
Subjt:  ESLHSVEEKKVENL

A0A5C7ISG5 CCHC-type domain-containing protein1.5e-3132.54Show/hide
Query:  ILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFS
        ++ KI + KK+N E F   + ++WS    + I+ V FNIF+ +F N + +  I + GPW+++K++++ E+P+   S   ++F  V LW   H +P  C +
Subjt:  ILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFS

Query:  RNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEGED----IMGTSDGELR
        + +A+ +   +G+V  ++I  +    WG  L++K+QID+  PLKR + L+         +S+ YE+LP+FCY CG +GH  KE  D    I        +
Subjt:  RNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEGED----IMGTSDGELR

Query:  YGAWLREPV
        +G+W+R  V
Subjt:  YGAWLREPV

A0A6J1BSZ1 uncharacterized protein LOC1110054818.5e-3533.05Show/hide
Query:  WPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI--IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLL
        W +  L   +  +  +   S ++ +G+ L  S++CK+ +K+ I+  +  + +   W  +     +D +GFNIF+  F  +  +  IL  GPW +D+A+++
Subjt:  WPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI--IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLL

Query:  FEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKL
         + P +     DM+FR VSLW HF  L  AC ++  A  +GN +G  E V+  +  +  WG  LR++++ DV  PL RGI L         WI + YE+L
Subjt:  FEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKL

Query:  PDFCYGCGCLGHTIKEGED-IMGTSDGELRYGAWLR
        PDF Y CG L H +K+  D  + +    L+YG WLR
Subjt:  PDFCYGCGCLGHTIKEGED-IMGTSDGELRYGAWLR

A0A6J1D765 uncharacterized protein LOC1110179026.5e-3534.68Show/hide
Query:  IDRSGQKLTTS-----ILCKIHTKKKINPEMFSSKMPKIWS-QEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRY
        IDR    LT       ++ K+HT K+I+ E   S M  +W     T  + +G NI++  FK+   K  +L SGPW ++K++L+   P A     DM F +
Subjt:  IDRSGQKLTTS-----ILCKIHTKKKINPEMFSSKMPKIWS-QEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKAGVSCEDMEFRY

Query:  VSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEG
         + W   H +P+ C S   A  +G  LG VE+++  D  D   G  +R++++IDV  PL+RGI L++ S G D W  + YEKLPDFCY CG +GH+ +E 
Subjt:  VSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKEG

Query:  E--DIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYGR----GRGSFWGGGR-GRGRMAAFDSEYGENLDNQPPTQIQRTEEISLPATKVDTSVT
        E    + T++   +YG WLR    L     S     V+ R    GRG    GGR GRG     D  +  ++D  P +  +R  E  +     + SVT
Subjt:  E--DIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYGR----GRGSFWGGGR-GRGRMAAFDSEYGENLDNQPPTQIQRTEEISLPATKVDTSVT

A0A6J1DU55 uncharacterized protein LOC1110231358.8e-3229.92Show/hide
Query:  SNWPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAML
        ++W    L   +  +  +     +  + Q L  S++ K+  K+ I+ ++ S  +   W  EH + ++ +G N+F+  F        ++++GPWF+DKA++
Subjt:  SNWPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTI-IDKVGFNIFICRFKNAKIKGWILESGPWFYDKAML

Query:  LFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEK
        + ++P +  +  ++EF  V+ W H   LP +  ++  A  +GN +G    VD  +++  SWG SLRI++ ID+  PL+RGI +         WI + YE+
Subjt:  LFEEPKAGVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEK

Query:  LPDFCYGCGCLGHT--------IKEGEDIMGTSDGELRYGAWLR
        LPDFCY CG +GH+        +   +D   TS+    YG WLR
Subjt:  LPDFCYGCGCLGHT--------IKEGEDIMGTSDGELRYGAWLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTGGCGAAGCAATTGGCCGAGCTCCATGTTACTCCTGCGGAAAAAGCTAGTATATTCAAATTACCGGAAAAGTGAGATCGATCGCTCAGGACAGAAACTGACGAC
TTCTATTTTGTGCAAAATTCATACAAAGAAGAAAATTAACCCCGAGATGTTTAGCTCCAAGATGCCGAAGATATGGAGTCAAGAACACACTATCATTGATAAGGTGGGAT
TCAACATTTTTATATGCAGGTTCAAAAATGCGAAAATTAAAGGATGGATCCTGGAATCGGGACCATGGTTTTACGACAAAGCCATGCTTCTTTTCGAGGAGCCGAAGGCA
GGGGTTAGCTGCGAGGATATGGAATTCAGGTATGTTTCTCTATGGACTCATTTTCATAAGCTTCCATATGCGTGTTTTTCCAGGAATTCTGCGGAAACCATTGGAAATTT
ACTAGGAAAGGTAGAAAAAGTGGACATACGGGATGATGAAGACCCTTCTTGGGGATGTTCATTAAGGATTAAAATTCAAATCGATGTTTACATGCCTCTTAAACGAGGTA
TCTTTTTGCAATCAGGGAGTAAGGGGATTGATAAATGGATTTCGGTTACGTACGAAAAACTACCGGATTTCTGTTATGGTTGTGGTTGCTTGGGTCATACTATTAAGGAG
GGTGAGGATATAATGGGTACAAGTGATGGCGAACTTAGATATGGAGCTTGGTTAAGAGAGCCTGTCAAGTTATGGAGCACTGAAGGTAGTCCATTGGGGAACAATGTTTA
TGGTCGAGGAAGAGGTAGTTTTTGGGGAGGAGGTAGAGGGAGGGGTCGGATGGCGGCTTTTGACAGTGAGTATGGTGAGAATCTTGATAATCAACCACCCACTCAGATAC
AGAGGACTGAAGAGATTAGTTTGCCGGCGACAAAAGTAGACACGTCGGTGACCATGAATCCAGAAAAGGAAAAAGTAGCGGCTGAAGCTTCTGATATGGAATTAGAAATC
AATTTGGTTAGTGCTGAATCAATGGAGAATTGCTCCAACGGGACAGAGAATCTCATTAAAGTTATGGAGGACAATAAAAAATGGAAGGAAGTAAGCGGTAATCATGGGAT
GGAGGAGTTACAAAATGTTAATGAGGATTCAAAGTTGCTGAATAAAAAAGGAAAGTTGAAAGATATGAATTTAAAGGAGATGGAAAGTTTACATTCGGTGGAAGAAAAGA
AAGTTGAAAATTTGAATCTGGAAGTGTCCAAGATGGATGAATCTCTAAGTGTGGAGATGGTGAATGGGCCTCCAATTAAGGATGAATTGCCGCCAACGGAGGAAATTAAA
GCTAAAAAATGGAAGAGAGTTATACAGGGAGACATTTCACTGGCGACATCCATACAAGACAAACAAGACATCCAAGTTTTTAGTGGATCAAAACATAAAGCAGAGGTGGA
ACTGGAACAATTGGAACCAAGTAAACGAAGGTTAGTGACTGAGGGTAATGATTCAAATGCGAGATCGGTGGAGGCTACGAGACAGCCCCGCCGGGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTGGCGAAGCAATTGGCCGAGCTCCATGTTACTCCTGCGGAAAAAGCTAGTATATTCAAATTACCGGAAAAGTGAGATCGATCGCTCAGGACAGAAACTGACGAC
TTCTATTTTGTGCAAAATTCATACAAAGAAGAAAATTAACCCCGAGATGTTTAGCTCCAAGATGCCGAAGATATGGAGTCAAGAACACACTATCATTGATAAGGTGGGAT
TCAACATTTTTATATGCAGGTTCAAAAATGCGAAAATTAAAGGATGGATCCTGGAATCGGGACCATGGTTTTACGACAAAGCCATGCTTCTTTTCGAGGAGCCGAAGGCA
GGGGTTAGCTGCGAGGATATGGAATTCAGGTATGTTTCTCTATGGACTCATTTTCATAAGCTTCCATATGCGTGTTTTTCCAGGAATTCTGCGGAAACCATTGGAAATTT
ACTAGGAAAGGTAGAAAAAGTGGACATACGGGATGATGAAGACCCTTCTTGGGGATGTTCATTAAGGATTAAAATTCAAATCGATGTTTACATGCCTCTTAAACGAGGTA
TCTTTTTGCAATCAGGGAGTAAGGGGATTGATAAATGGATTTCGGTTACGTACGAAAAACTACCGGATTTCTGTTATGGTTGTGGTTGCTTGGGTCATACTATTAAGGAG
GGTGAGGATATAATGGGTACAAGTGATGGCGAACTTAGATATGGAGCTTGGTTAAGAGAGCCTGTCAAGTTATGGAGCACTGAAGGTAGTCCATTGGGGAACAATGTTTA
TGGTCGAGGAAGAGGTAGTTTTTGGGGAGGAGGTAGAGGGAGGGGTCGGATGGCGGCTTTTGACAGTGAGTATGGTGAGAATCTTGATAATCAACCACCCACTCAGATAC
AGAGGACTGAAGAGATTAGTTTGCCGGCGACAAAAGTAGACACGTCGGTGACCATGAATCCAGAAAAGGAAAAAGTAGCGGCTGAAGCTTCTGATATGGAATTAGAAATC
AATTTGGTTAGTGCTGAATCAATGGAGAATTGCTCCAACGGGACAGAGAATCTCATTAAAGTTATGGAGGACAATAAAAAATGGAAGGAAGTAAGCGGTAATCATGGGAT
GGAGGAGTTACAAAATGTTAATGAGGATTCAAAGTTGCTGAATAAAAAAGGAAAGTTGAAAGATATGAATTTAAAGGAGATGGAAAGTTTACATTCGGTGGAAGAAAAGA
AAGTTGAAAATTTGAATCTGGAAGTGTCCAAGATGGATGAATCTCTAAGTGTGGAGATGGTGAATGGGCCTCCAATTAAGGATGAATTGCCGCCAACGGAGGAAATTAAA
GCTAAAAAATGGAAGAGAGTTATACAGGGAGACATTTCACTGGCGACATCCATACAAGACAAACAAGACATCCAAGTTTTTAGTGGATCAAAACATAAAGCAGAGGTGGA
ACTGGAACAATTGGAACCAAGTAAACGAAGGTTAGTGACTGAGGGTAATGATTCAAATGCGAGATCGGTGGAGGCTACGAGACAGCCCCGCCGGGCACAATGA
Protein sequenceShow/hide protein sequence
MRWRSNWPSSMLLLRKKLVYSNYRKSEIDRSGQKLTTSILCKIHTKKKINPEMFSSKMPKIWSQEHTIIDKVGFNIFICRFKNAKIKGWILESGPWFYDKAMLLFEEPKA
GVSCEDMEFRYVSLWTHFHKLPYACFSRNSAETIGNLLGKVEKVDIRDDEDPSWGCSLRIKIQIDVYMPLKRGIFLQSGSKGIDKWISVTYEKLPDFCYGCGCLGHTIKE
GEDIMGTSDGELRYGAWLREPVKLWSTEGSPLGNNVYGRGRGSFWGGGRGRGRMAAFDSEYGENLDNQPPTQIQRTEEISLPATKVDTSVTMNPEKEKVAAEASDMELEI
NLVSAESMENCSNGTENLIKVMEDNKKWKEVSGNHGMEELQNVNEDSKLLNKKGKLKDMNLKEMESLHSVEEKKVENLNLEVSKMDESLSVEMVNGPPIKDELPPTEEIK
AKKWKRVIQGDISLATSIQDKQDIQVFSGSKHKAEVELEQLEPSKRRLVTEGNDSNARSVEATRQPRRAQ