; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025291 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025291
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr10:10898025..10901162
RNA-Seq ExpressionLag0025291
SyntenyLag0025291
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062637.1 uncharacterized protein E6C27_scaffold79G001490 [Cucumis melo var. makuwa]2.0e-6554.78Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMESI+P+IR YL  VGFE+W+ AYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EI CAHAL V+R  NL  Y+FVS YY++  L +TY G V    NH++WS +  N N LPP+
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R  GRPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

TYJ96591.1 uncharacterized protein E5676_scaffold1278G00090 [Cucumis melo var. makuwa]2.0e-7057.83Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMESI+P+IR YL  VGFE+W+RAYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EIPCAHAL V+R  NL  Y+FVS YY++  LS+TY G V P  NH++WS +G N N LPPV
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R AGRPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

XP_016900337.1 PREDICTED: uncharacterized protein LOC107990862 [Cucumis melo]1.7e-6455.65Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MR MESI+P+IR YL  VGFE+W+RAYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EIP AHAL V+   NL  Y+FVS YY++  LS+TY G V P  NH++WS +  N N LPPV
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R A RPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

XP_038880504.1 uncharacterized protein LOC120072166 [Benincasa hispida]1.6e-7863.48Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        M+WMESI+PTIREYLN VGFEKWA A+SR+R Y +MTTN+SE LN+IL E R+FPVA+LLD IR +LQ WFYE+G     MK  LT+W E ELR++H++S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF VDPI+NE+ KVIDGDN+F+VNL   SCSCRVWDL EIPCAHA AV+ G NL  Y+FV +YYF  TL STYKG V+P  NHS+W SI   +N LPP+
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
        VKR  GRPRKQR+LSIG   +    ++  R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

XP_038907134.1 uncharacterized protein LOC120092945 [Benincasa hispida]2.8e-8065.9Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        M+WMESI+P+IREYL+ VGFEKWARAYSR++RYRMMTTN+SESL+S+L ESR+FP+A+LLD+IR LLQ WFYER  +A  MK+ LT+W E+ELR +H++S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        R+F VD I+NE+ KV+DGDN+++VN+   SCSC  WDL+EIPCAHA AVL   NL +Y+FVS+YYFS T SSTYK  +HP  NHS+WSS+  + N LPP+
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIG
        VKRQ GRPRKQR+LS+G
Subjt:  VKRQAGRPRKQRMLSIG

TrEMBL top hitse value%identityAlignment
A0A1S4DWH2 uncharacterized protein LOC1079908628.1e-6555.65Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MR MESI+P+IR YL  VGFE+W+RAYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EIP AHAL V+   NL  Y+FVS YY++  LS+TY G V P  NH++WS +  N N LPPV
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R A RPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

A0A5A7T3G5 Protein FAR1-RELATED SEQUENCE 4-like8.9e-6457.14Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMES++P+IR+YL+ V FEKWARAY  ++RY+MMTTN+SESLN++L ESRD PVA LLD+IR +LQ WFY+R  VA CM++ LT+W E ELR +H  S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF V+ I++ + +VIDG   FIV L+  SC+CRVWDLDEIPCAHALAVLRG                         + P  NH+ W SIG   N LPP 
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIG
         KR+AGRPRKQR+LSIG
Subjt:  VKRQAGRPRKQRMLSIG

A0A5A7V6N4 SWIM-type domain-containing protein9.5e-6654.78Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMESI+P+IR YL  VGFE+W+ AYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EI CAHAL V+R  NL  Y+FVS YY++  L +TY G V    NH++WS +  N N LPP+
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R  GRPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

A0A5D3BD68 SWIM-type domain-containing protein9.8e-7157.83Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMESI+P+IR YL  VGFE+W+RAYSRKRRY++MTTN+ ESLNS L   RD PVA+LL+AIR  LQRWFYER   A C+KS L++W E  +RK  D+S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV
        RSF+V+P+S  + +V+DG   F+V L   SCSC  WDL+EIPCAHAL V+R  NL  Y+FVS YY++  LS+TY G V P  NH++WS +G N N LPPV
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPV

Query:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR
         +R AGRPRK+R+ SIG  +K    ++ +R
Subjt:  VKRQAGRPRKQRMLSIGSNNKKGNTTQKRR

A0A5D3DJR8 MuDR family transposase1.4e-6464.32Show/hide
Query:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS
        MRWMES++P+IR+YL+ V FEKWARAY  ++RY+MMTTN+SESLN++L ESRD PVA LLD+IR +LQ WFY+R  VA CM++ LT+W E ELR +H  S
Subjt:  MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKS

Query:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHS
        RSF V+ I++ + +VIDG   FIV L+  SC+CRVWDLDEIPCAHALAVLRGRN+ TYSFVS Y+ S TL STY G V P  NH+
Subjt:  RSFVVDPISNEQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64260.1 MuDR family transposase1.9e-0524.48Show/hide
Query:  HPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYE-RGNVAFCMKSTLTTWVESELRKE--HDKSRSFV
        +P   ++L+ +   KWA A+    RY ++  +  E+L ++    R FP  T+  A+ G +   F E R +    + S  ++     +  E   DK   F+
Subjt:  HPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYE-RGNVAFCMKSTLTTWVESELRKE--HDKSRSFV

Query:  VDPI-------SNEQSKVIDGD--NYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEW
         D I         +  KV +      +IV L + +C+CR +   + PC HALAV     +    +V   Y       TY     P  + + W
Subjt:  VDPI-------SNEQSKVIDGD--NYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTGGATGGAGTCGATACATCCTACTATTCGAGAATATCTCAATAATGTCGGGTTTGAAAAGTGGGCTCGTGCATATTCTAGGAAAAGGAGATATCGGATGATGAC
AACCAATGTTTCAGAATCTTTAAATTCAATTTTATTAGAGTCTAGAGATTTTCCTGTTGCAACACTACTTGATGCTATCAGAGGTTTATTGCAAAGATGGTTTTACGAGC
GAGGTAATGTAGCATTTTGTATGAAGTCTACTTTGACCACTTGGGTAGAGAGTGAGTTACGAAAAGAACATGATAAGTCCAGAAGTTTTGTGGTTGATCCTATTAGTAAT
GAACAATCCAAAGTAATTGATGGAGATAACTATTTCATTGTGAACTTGGAATTGGGATCATGTAGTTGTCGTGTTTGGGATCTTGATGAAATTCCATGCGCTCATGCACT
TGCTGTTCTTCGTGGGCGGAATTTGAGAACTTATTCTTTTGTATCAAATTATTACTTTTCAAGAACGTTGTCATCAACTTATAAAGGTTTTGTTCATCCCGCCGATAATC
ATTCTGAATGGAGCTCTATAGGTGGAAACATGAATGCGTTGCCTCCAGTTGTTAAACGTCAGGCAGGAAGACCAAGAAAACAGAGGATGCTATCAATTGGATCCAACAAC
AAAAAGGGTAATACTACCCAGAAGAGACGAAACAGGGAGGTAACCAACCTCATGAGATCTTGGGAGAAAGAAGCAGAGTATCAATCAGATAAAATTGACGAAATCGAGGA
AGAGCAGTTTAGTGAAAACATAGTAGAAAGGGATAGATGTGTCTTTTTCAGGGACAGATTGATGGCTGGATCTCTGGGGTATATGGGCCATGCTCCTGAAGAGGGAAGTC
AGTTCTTTCAAGAACTTTATGACCTTCAGGGTCTTTGTCAAGGGGTCTGGTGTATTGCTGGGGACTTTAACATGATTAGATGGTCCAAGGAAAGATTAGTTCCTAAGAAT
TCCTCTAGAGGCATGAAGAAGTTTAATCGTTTTGCTCGTATATGTGCCCTCATTGATCCGCCCATGTTAAATGGTAATTTTTCGTGGTCTAGAATGGGAGTTAGAGTGGC
TGCCTCGAGAATTGACAGATTCCTAGTGTCGAAGCCTTGGGTTGATTCCTTCGGAGATGGCAGAGTGGAAAGACTTCATAGACCCACCTCTGACCACTTCCCCATTATGA
TGTCCTTAGGGGCCTTAAATTGGGGCCCAACCCCTTTTAGATTTGAGAACGCTTGGTTAGATAATGTTGAATTCAAAGGGAAGGTCGAATCCTGGTGGAAGGAGCTGTAT
CCGACAGGATGGATTGGGTTTAGACTTATGGAGAAGCTGAAAGGCCTGAAGTTTAAGATCAGAGAGTGGAATAGGGAGAGCCAATCCAAAGTTGGTAGTAAAAAAAAGGA
GATTCTGGCAAAAATAGAAGAGATAGATTGTTTAGAAGAGAAAAATAACGTCCAACTTATTCAGATTGAAGAAAGAAAAATGCTTAAGGCAAAGTTGATGGAGATTATCA
TTGACGAGCAAAGATGTCTCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTGGATGGAGTCGATACATCCTACTATTCGAGAATATCTCAATAATGTCGGGTTTGAAAAGTGGGCTCGTGCATATTCTAGGAAAAGGAGATATCGGATGATGAC
AACCAATGTTTCAGAATCTTTAAATTCAATTTTATTAGAGTCTAGAGATTTTCCTGTTGCAACACTACTTGATGCTATCAGAGGTTTATTGCAAAGATGGTTTTACGAGC
GAGGTAATGTAGCATTTTGTATGAAGTCTACTTTGACCACTTGGGTAGAGAGTGAGTTACGAAAAGAACATGATAAGTCCAGAAGTTTTGTGGTTGATCCTATTAGTAAT
GAACAATCCAAAGTAATTGATGGAGATAACTATTTCATTGTGAACTTGGAATTGGGATCATGTAGTTGTCGTGTTTGGGATCTTGATGAAATTCCATGCGCTCATGCACT
TGCTGTTCTTCGTGGGCGGAATTTGAGAACTTATTCTTTTGTATCAAATTATTACTTTTCAAGAACGTTGTCATCAACTTATAAAGGTTTTGTTCATCCCGCCGATAATC
ATTCTGAATGGAGCTCTATAGGTGGAAACATGAATGCGTTGCCTCCAGTTGTTAAACGTCAGGCAGGAAGACCAAGAAAACAGAGGATGCTATCAATTGGATCCAACAAC
AAAAAGGGTAATACTACCCAGAAGAGACGAAACAGGGAGGTAACCAACCTCATGAGATCTTGGGAGAAAGAAGCAGAGTATCAATCAGATAAAATTGACGAAATCGAGGA
AGAGCAGTTTAGTGAAAACATAGTAGAAAGGGATAGATGTGTCTTTTTCAGGGACAGATTGATGGCTGGATCTCTGGGGTATATGGGCCATGCTCCTGAAGAGGGAAGTC
AGTTCTTTCAAGAACTTTATGACCTTCAGGGTCTTTGTCAAGGGGTCTGGTGTATTGCTGGGGACTTTAACATGATTAGATGGTCCAAGGAAAGATTAGTTCCTAAGAAT
TCCTCTAGAGGCATGAAGAAGTTTAATCGTTTTGCTCGTATATGTGCCCTCATTGATCCGCCCATGTTAAATGGTAATTTTTCGTGGTCTAGAATGGGAGTTAGAGTGGC
TGCCTCGAGAATTGACAGATTCCTAGTGTCGAAGCCTTGGGTTGATTCCTTCGGAGATGGCAGAGTGGAAAGACTTCATAGACCCACCTCTGACCACTTCCCCATTATGA
TGTCCTTAGGGGCCTTAAATTGGGGCCCAACCCCTTTTAGATTTGAGAACGCTTGGTTAGATAATGTTGAATTCAAAGGGAAGGTCGAATCCTGGTGGAAGGAGCTGTAT
CCGACAGGATGGATTGGGTTTAGACTTATGGAGAAGCTGAAAGGCCTGAAGTTTAAGATCAGAGAGTGGAATAGGGAGAGCCAATCCAAAGTTGGTAGTAAAAAAAAGGA
GATTCTGGCAAAAATAGAAGAGATAGATTGTTTAGAAGAGAAAAATAACGTCCAACTTATTCAGATTGAAGAAAGAAAAATGCTTAAGGCAAAGTTGATGGAGATTATCA
TTGACGAGCAAAGATGTCTCAACTAA
Protein sequenceShow/hide protein sequence
MRWMESIHPTIREYLNNVGFEKWARAYSRKRRYRMMTTNVSESLNSILLESRDFPVATLLDAIRGLLQRWFYERGNVAFCMKSTLTTWVESELRKEHDKSRSFVVDPISN
EQSKVIDGDNYFIVNLELGSCSCRVWDLDEIPCAHALAVLRGRNLRTYSFVSNYYFSRTLSSTYKGFVHPADNHSEWSSIGGNMNALPPVVKRQAGRPRKQRMLSIGSNN
KKGNTTQKRRNREVTNLMRSWEKEAEYQSDKIDEIEEEQFSENIVERDRCVFFRDRLMAGSLGYMGHAPEEGSQFFQELYDLQGLCQGVWCIAGDFNMIRWSKERLVPKN
SSRGMKKFNRFARICALIDPPMLNGNFSWSRMGVRVAASRIDRFLVSKPWVDSFGDGRVERLHRPTSDHFPIMMSLGALNWGPTPFRFENAWLDNVEFKGKVESWWKELY
PTGWIGFRLMEKLKGLKFKIREWNRESQSKVGSKKKEILAKIEEIDCLEEKNNVQLIQIEERKMLKAKLMEIIIDEQRCLN