; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G013593 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G013593
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionMuDRA-like transposase
Genome locationCG_Chr09:20671226..20672748
RNA-Seq ExpressionClCG09G013593
SyntenyClCG09G013593
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001207 - Transposase, mutator type
IPR006564 - Zinc finger, PMZ-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038876499.1 uncharacterized protein LOC120068931 [Benincasa hispida]2.1e-7964.83Show/hide
Query:  MELSKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEA
        M+ S SFKVT+YPS HTCSLE+R G+HQQ KSW+IGHLIKSKYEQVG+TY PRDIIEDV R+Y VNI+Y RAYR REY LV  RGS KGSY +V AY EA
Subjt:  MELSKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEA

Query:  FKLANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYKDGENDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEA
         KL N  T+F VEVEDE++FKY++MA GPCI+GFLNCI  +IVVD THLHEKYK                   G++PNLVI  DRH SI KAV  +F + 
Subjt:  FKLANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYKDGENDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEA

Query:  FHALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT
        F+ALCIY+I+NNL+D+F NKDII HFYLA KAYR++
Subjt:  FHALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT

XP_038880342.1 uncharacterized protein LOC120071984 [Benincasa hispida]3.7e-7665.11Show/hide
Query:  GSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGTVFVVEVEDERFFKYIY
        G+HQ+ KS VIGHLIKSKYEQVGQTY PRDIIED   D+ VNISY R YR REY LV  RGS KGSY +V AY E FKLANPG +F VEVEDE++FKY++
Subjt:  GSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGTVFVVEVEDERFFKYIY

Query:  MAFGPCIRGFLNCIRSVIVVDSTHLHEKYK---------DGE--------------NDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEAF
        +AFG CIRGFLNCI  V VVD THL+EKYK         DG               NDASW WFMTHLKA + DI NLVIIS  H SI K V ++F  AF
Subjt:  MAFGPCIRGFLNCIRSVIVVDSTHLHEKYK---------DGE--------------NDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEAF

Query:  HALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT
        HALCIY+I+NNLMDRF NKDII HFYLAAKAY+++
Subjt:  HALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT

XP_038880424.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Benincasa hispida]3.1e-8365.82Show/hide
Query:  MELSKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEA
        M+ S SFKVT+YP  HTCSLE+RTG+H+Q KSWVIGHLIKSKYEQ+ QTYCP+DII+ V R+Y VNI+YG+ YR REY LV  RG  +GSY +V AYGEA
Subjt:  MELSKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEA

Query:  FKLANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIP
         KLAN GT+F VEVEDE++FKY++MA G CIRGFLN IR VIVVD THLH KYK                       DGENDASW WFMTHLKA IGD+P
Subjt:  FKLANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIP

Query:  NLVIISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLM
        NLVIISDRH SI KAV+ +F +AFHALCIY+IRN L+
Subjt:  NLVIISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLM

XP_038884809.1 protein FAR-RED ELONGATED HYPOCOTYL 3-like [Benincasa hispida]2.0e-8565.34Show/hide
Query:  VTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGT
        +T+Y S HTCSLE+RTG+H+Q KSWVIGHLIKSKYEQVG+ Y  R IIEDV R+Y VNI+YGRAY  +EY L+ TRG  +GSYA+V AY EA KLAN GT
Subjt:  VTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGT

Query:  VFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLVIISDR
        +F VEVED ++FKY++MA  PCIRGFLNCIR VIVVD TH HEKYK                       D ENDASW WFMTHLKA IGD+ NLVIIS+R
Subjt:  VFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLVIISDR

Query:  HKSIGKAVLSVFLEAFHALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT
        H SI KA+  +F  AFHALCIY+IRNNL+D+F N+DII HFYLAAKAYRI+
Subjt:  HKSIGKAVLSVFLEAFHALCIYYIRNNLMDRF-NKDIILHFYLAAKAYRIT

XP_038904212.1 uncharacterized protein LOC120090557 [Benincasa hispida]7.7e-7466.82Show/hide
Query:  IRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGTVFVVEVEDERFFK
        +R  +H+Q KSWVI HLIKSKYE+VGQTY P+DIIED+ RDY VNISY R                    A V AYGEA KLANPGT+F VEVEDE++FK
Subjt:  IRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPGTVFVVEVEDERFFK

Query:  YIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYKDGENDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRF-NKD
        Y++MA G CI+GFLNC   VIVVD THLHEKYKDGENDASW WFMTHLKA I D+PNL+IISD H SI KAV S+F +AFHALCIY+IRNNL+D+F NK+
Subjt:  YIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYKDGENDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRF-NKD

Query:  IILHFYLAAKAYRI
         I HFYLAAK YR+
Subjt:  IILHFYLAAKAYRI

TrEMBL top hitse value%identityAlignment
A0A2I7YUI4 MuDRA-like transposase1.6e-6934.83Show/hide
Query:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL
        S  FK+ +Y  VH+CSL++    H+Q KSWV+G LIKSK++ VG+ Y PRDIIED+ +DY +N+SY +A+R RE A    RG  + SY L+  YGEA KL
Subjt:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL

Query:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV
        AN GT+F +E+ED RFFKY++MA GPC+RGFLNCIR VIV+D T L  KY+                       D E DAS  WF+  LK  IG++PNL 
Subjt:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV

Query:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------
         ++DR     K + SVF  AFH LC+ ++  NL D++  D I   FY A++ YR                                              
Subjt:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------

Query:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM
                                                                     +T +V+P+D Y  +V D     +++L ++  TC EF   
Subjt:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM

Query:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK
        ++PC+HA+A A  RNINV +LCA ++  E +LAAY++ +YPV ++
Subjt:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK

A0A5A7SXK9 MuDRA-like transposase1.2e-6935.06Show/hide
Query:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL
        S  FK+ +Y  VH+CSLE     H+Q KSWV+G LIKSK++  G+ Y PRDIIED+ +DY +N+SY +A+R RE A    RGS + SY L+  YGEA K 
Subjt:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL

Query:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV
         NPGT+F +E+ED+RFFKY++MA G C+RGFLNCIR VIV+D T L  KY+                       D E D S  WF+  LK  IG++PNL 
Subjt:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV

Query:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------
         ++DR     K + SVF  AFH LC+ ++  NL D++  D +   FY A++ YR                                              
Subjt:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------

Query:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM
                                                                     +T +V+P+D Y  +V D     +V+L ++  TC EF   
Subjt:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM

Query:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK
        ++PCSH +AAA  RNINV +LCA ++  E +LAAYAK +YPV ++
Subjt:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK

A0A5A7T0Q4 MuDRA-like transposase2.8e-6938.46Show/hide
Query:  KVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPG
        K+ +Y  VH+CSLE     H+Q KSWV+G LIKSK++  G+ Y PRDIIED+ +DY +N+SY +A+R RE A    RGS + SY L+  YGEA K  NPG
Subjt:  KVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLANPG

Query:  TVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLVIISD
        T+F +E+ED+RFFKY++MA G C+RGFLNCIR VIV+D T L  KY+                       D E D S  WF+  LK  IG++ NL  ++D
Subjt:  TVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLVIISD

Query:  RHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAY--------------------------------RITYRVSPMDMYVLNVDDG
        +     K + SVF  AFH LC+ ++  NL D++  D +   FY A++ Y                                R+ Y     +M   N+ + 
Subjt:  RHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAY--------------------------------RITYRVSPMDMYVLNVDDG

Query:  -------------------------HLGR--LVDLCSRTYTCMEFNCMEIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPV
                                 HL +  +V+L ++  TC EF   ++PCSHA+AAA  RNINV +LCA ++  E +LAAYA+ +YPV
Subjt:  -------------------------HLGR--LVDLCSRTYTCMEFNCMEIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPV

A0A5D3BSX9 MuDRA-like transposase3.6e-6934.83Show/hide
Query:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL
        S  FK+ +Y  VH+CSLE     H+Q KSWV+G LIKSK++  G+ Y PRDIIED+ +DY +N+SY +A+R RE A    RGS + SY L+  YGEA K 
Subjt:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL

Query:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV
         N GT+F +E+ED+RFFKY++MA G C+RGFLNCIR VIV+D T L  KY+                       D E D S  WF+  LK  IG++PNL 
Subjt:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV

Query:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------
         ++DR     K + SVF  AFH LC+ ++  NL D++  D +   FY A++ YR                                              
Subjt:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------

Query:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM
                                                                     +T +V+P+D Y  +V D     +V+L ++  TC EF   
Subjt:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM

Query:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK
        ++PCSHA+AAA +RNINV +LCA ++  E +LAAYA+ +YPV ++
Subjt:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK

Q5GIT1 MuDRA transposase-like3.6e-6934.83Show/hide
Query:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL
        S  FK+ +Y  VH+CSLE     H+Q KSWV+G LIKSK++  G+ Y PRDIIED+ +DY +N+SY +A+R RE A    RGS + SY L+  YGEA K 
Subjt:  SKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKL

Query:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV
         N GT+F +E+ED+RFFKY++MA G C+RGFLNCIR VIV+D T L  KY+                       D E D S  WF+  LK  IG++PNL 
Subjt:  ANPGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYK-----------------------DGENDASWLWFMTHLKALIGDIPNLV

Query:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------
         ++DR     K + SVF  AFH LC+ ++  NL D++  D +   FY A++ YR                                              
Subjt:  IISDRHKSIGKAVLSVFLEAFHALCIYYIRNNLMDRFNKDIILH-FYLAAKAYR----------------------------------------------

Query:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM
                                                                     +T +V+P+D Y  +V D     +V+L ++  TC EF   
Subjt:  -------------------------------------------------------------ITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCM

Query:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK
        ++PCSHA+AAA +RNINV +LCA ++  E +LAAYA+ +YPV ++
Subjt:  EIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVDDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTGAGCAAATCCTTCAAGGTGACTCAGTATCCGAGTGTCCATACATGTTCACTTGAAATAAGGACAGGTAGCCATCAACAAACCAAGAGTTGGGTTATT
GGACACTTAATCAAGTCCAAGTACGAGCAAGTAGGCCAGACATACTGCCCAAGAGATATAATTGAAGATGTCTGTCGAGATTATAGTGTGAATATAAGCTATGGG
CGGGCTTACCGTGAAAGGGAGTATGCATTGGTGTGTACACGAGGGTCGTCAAAAGGATCGTATGCTCTTGTGAATGCATATGGTGAGGCATTTAAACTTGCAAAC
CCAGGCACAGTGTTTGTGGTTGAAGTAGAAGATGAGCGGTTCTTTAAGTACATCTACATGGCTTTTGGTCCGTGCATTAGGGGATTCTTGAACTGCATCCGTTCT
GTAATTGTTGTGGACAGTACCCATTTGCATGAGAAGTATAAAGACGGTGAGAACGATGCATCTTGGCTTTGGTTCATGACCCACTTGAAGGCATTGATCGGAGAC
ATCCCTAACTTGGTGATTATATCAGATCGTCACAAATCCATTGGGAAGGCTGTATTAAGTGTATTCCTAGAGGCATTCCATGCACTGTGTATATACTACATTCGG
AATAACTTGATGGATAGGTTCAATAAGGACATAATCCTGCATTTCTATCTTGCTGCCAAGGCTTACAGGATAACGTATCGGGTATCTCCAATGGACATGTATGTA
CTGAATGTTGATGATGGACATCTTGGTAGGTTGGTTGATCTCTGTTCACGAACATATACGTGCATGGAGTTCAATTGCATGGAGATTCCATGCTCACATGCATTA
GCTGCAGCAGCTATGCGGAACATTAACGTCCAAACATTGTGCGCAAAGTGGTTCAATGAAGAGCATGTGCTTGCCGCGTATGCCAAACTGATATACCCAGTCGAC
GACAAACCGTCAAAATACCGTCTATTGGTGAAGAGCCATAACAAGTACATAAGTGTACACGATGCGATTAGGAAGGACACAACAGAAAAACATGCAAGTAGCCAC
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTGAGCAAATCCTTCAAGGTGACTCAGTATCCGAGTGTCCATACATGTTCACTTGAAATAAGGACAGGTAGCCATCAACAAACCAAGAGTTGGGTTATT
GGACACTTAATCAAGTCCAAGTACGAGCAAGTAGGCCAGACATACTGCCCAAGAGATATAATTGAAGATGTCTGTCGAGATTATAGTGTGAATATAAGCTATGGG
CGGGCTTACCGTGAAAGGGAGTATGCATTGGTGTGTACACGAGGGTCGTCAAAAGGATCGTATGCTCTTGTGAATGCATATGGTGAGGCATTTAAACTTGCAAAC
CCAGGCACAGTGTTTGTGGTTGAAGTAGAAGATGAGCGGTTCTTTAAGTACATCTACATGGCTTTTGGTCCGTGCATTAGGGGATTCTTGAACTGCATCCGTTCT
GTAATTGTTGTGGACAGTACCCATTTGCATGAGAAGTATAAAGACGGTGAGAACGATGCATCTTGGCTTTGGTTCATGACCCACTTGAAGGCATTGATCGGAGAC
ATCCCTAACTTGGTGATTATATCAGATCGTCACAAATCCATTGGGAAGGCTGTATTAAGTGTATTCCTAGAGGCATTCCATGCACTGTGTATATACTACATTCGG
AATAACTTGATGGATAGGTTCAATAAGGACATAATCCTGCATTTCTATCTTGCTGCCAAGGCTTACAGGATAACGTATCGGGTATCTCCAATGGACATGTATGTA
CTGAATGTTGATGATGGACATCTTGGTAGGTTGGTTGATCTCTGTTCACGAACATATACGTGCATGGAGTTCAATTGCATGGAGATTCCATGCTCACATGCATTA
GCTGCAGCAGCTATGCGGAACATTAACGTCCAAACATTGTGCGCAAAGTGGTTCAATGAAGAGCATGTGCTTGCCGCGTATGCCAAACTGATATACCCAGTCGAC
GACAAACCGTCAAAATACCGTCTATTGGTGAAGAGCCATAACAAGTACATAAGTGTACACGATGCGATTAGGAAGGACACAACAGAAAAACATGCAAGTAGCCAC
TGA
Protein sequenceShow/hide protein sequence
MELSKSFKVTQYPSVHTCSLEIRTGSHQQTKSWVIGHLIKSKYEQVGQTYCPRDIIEDVCRDYSVNISYGRAYREREYALVCTRGSSKGSYALVNAYGEAFKLAN
PGTVFVVEVEDERFFKYIYMAFGPCIRGFLNCIRSVIVVDSTHLHEKYKDGENDASWLWFMTHLKALIGDIPNLVIISDRHKSIGKAVLSVFLEAFHALCIYYIR
NNLMDRFNKDIILHFYLAAKAYRITYRVSPMDMYVLNVDDGHLGRLVDLCSRTYTCMEFNCMEIPCSHALAAAAMRNINVQTLCAKWFNEEHVLAAYAKLIYPVD
DKPSKYRLLVKSHNKYISVHDAIRKDTTEKHASSH