; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G21094 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G21094
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionSnoaL-like domain-containing protein
Genome locationctg910:1537419..1539213
RNA-Seq ExpressionCucsat.G21094
SyntenyCucsat.G21094
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437322.1 PREDICTED: uncharacterized protein LOC103482777 isoform X1 [Cucumis melo]3.78e-15684.5Show/hide
Query:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
        MSLITSPQAVNFGGS SFRRFS   FLNKRTSCIFQQKKNYGNYNKRKTN  LV SCLMDDSFS   SSSNSPGEMIE FYKCINEKNLK+M+TYISEDC
Subjt:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC

Query:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
        LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIR VYER PSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQ +QII E Q KAGHL L IM
Subjt:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM

Query:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVD
        KLVTLLLAK+ AILEWL K SQQRWVK +SKIC+ LF  LLD+F KSYLTFIH G QLYS VL FL Y+++
Subjt:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVD

XP_008437324.1 PREDICTED: uncharacterized protein LOC103482777 isoform X2 [Cucumis melo]1.12e-11883.57Show/hide
Query:  MDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNM
        MDDSFSC GSSSNSPGEMIE FYKCINEKNLKEMSTYISEDCLIED+LF EKFKGKKAAMSFIEKLTESMGPD+KFRIRKVYER PS A AIWHLEWRNM
Subjt:  MDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNM

Query:  EIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQL
        EIPLTKGCTFIDIRDEERKTIQ +QII E Q KAGHL L IMKLVTLLLAK+ AILEWL K SQQRWVK +SKIC+ LF  LLD+F KSYLTFIH G QL
Subjt:  EIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQL

Query:  YSCVLKFLYYIVD
        YS VL FL Y+++
Subjt:  YSCVLKFLYYIVD

XP_011654728.2 uncharacterized protein LOC101214565 [Cucumis sativus]1.28e-195100Show/hide
Query:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
        MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
Subjt:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC

Query:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
        LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
Subjt:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM

Query:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFFL
        KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFFL
Subjt:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFFL

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]2.81e-9758.66Show/hide
Query:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLV--------LSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK
        M+LITSP  QA+  G SQ FR F+ YT + K+ S +FQQKK       RKT+ TLV        LSCL D S     S SNSP EM+++FY+CINEK LK
Subjt:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLV--------LSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK

Query:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF
        E+S+YISEDCLIEDSLF E F GK+AA+ F ++LT+SMGPDVKFR R VYE   S AGA WHL W+N +IP TKGCTFIDI +EER TIQK QII EPQ 
Subjt:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF

Query:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF
        KAGHLIL +MKLVT LLA+  AI +W++K SQQRWV+W++KICV L+   L S  +SYLTFIH G+ ++   +K L +++ FF
Subjt:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF

XP_038906943.1 uncharacterized protein LOC120092809 [Benincasa hispida]7.08e-13875.8Show/hide
Query:  MSLITSP---QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTL--------VLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNL
        MSLI SP   QAVN GGSQS RRFS  TFL KR+SCI QQKKNYGN  K+KTNN L        V SCL DDSFS   S SNSP EMIERFYKCINEKNL
Subjt:  MSLITSP---QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTL--------VLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNL

Query:  KEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQ
        KE+S+YISEDC IEDSLF+E F GKKAA+ F E+LT SMGPDVKFRI  +YER  S  GAIWHLEW+NMEIP TKGCTFIDIR+EERKTIQK QII EPQ
Subjt:  KEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQ

Query:  FKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIV
         KAGHLIL IMKLVTLLLAK  AI EWLIK SQQRWVKWMSKIC+ LF LLLDSF KSYLTFIHFGA+LYS VLKFL+Y++
Subjt:  FKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIV

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein3.50e-187100Show/hide
Query:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
        MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
Subjt:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC

Query:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
        LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
Subjt:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM

Query:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSC
        KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSC
Subjt:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSC

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X11.83e-15684.5Show/hide
Query:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC
        MSLITSPQAVNFGGS SFRRFS   FLNKRTSCIFQQKKNYGNYNKRKTN  LV SCLMDDSFS   SSSNSPGEMIE FYKCINEKNLK+M+TYISEDC
Subjt:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDC

Query:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM
        LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIR VYER PSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQ +QII E Q KAGHL L IM
Subjt:  LIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIM

Query:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVD
        KLVTLLLAK+ AILEWL K SQQRWVK +SKIC+ LF  LLD+F KSYLTFIH G QLYS VL FL Y+++
Subjt:  KLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVD

A0A1S3AUC2 uncharacterized protein LOC103482777 isoform X25.40e-11983.57Show/hide
Query:  MDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNM
        MDDSFSC GSSSNSPGEMIE FYKCINEKNLKEMSTYISEDCLIED+LF EKFKGKKAAMSFIEKLTESMGPD+KFRIRKVYER PS A AIWHLEWRNM
Subjt:  MDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNM

Query:  EIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQL
        EIPLTKGCTFIDIRDEERKTIQ +QII E Q KAGHL L IMKLVTLLLAK+ AILEWL K SQQRWVK +SKIC+ LF  LLD+F KSYLTFIH G QL
Subjt:  EIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQL

Query:  YSCVLKFLYYIVD
        YS VL FL Y+++
Subjt:  YSCVLKFLYYIVD

A0A6J1ELY0 uncharacterized protein LOC1114357471.38e-9257.24Show/hide
Query:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTL--------VLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK
        M+LITSP  QA+  G SQ FR F+ YT + KR S +FQQKK       RKT+ TL        V SCL D S     S SNSP EM+++ Y+CINEK LK
Subjt:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTL--------VLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK

Query:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF
        E+S+Y+SEDCLIEDSLF E F GK+AA+ F ++LT+SMGPDVKFR R VYE   S AG  WHL W+N +IP TKGCTFI I +E+R TIQK QII EPQ 
Subjt:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF

Query:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF
        KAGHLIL +MKLVT LLA+  AI +W++K SQQRWV+W++KICV L+N  L S  +SYLTFIH  + ++   LK L ++  FF
Subjt:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF

A0A6J1I4H1 uncharacterized protein LOC1114690811.36e-9758.66Show/hide
Query:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLV--------LSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK
        M+LITSP  QA+  G SQ FR F+ YT + K+ S +FQQKK       RKT+ TLV        LSCL D S     S SNSP EM+++FY+CINEK LK
Subjt:  MSLITSP--QAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLV--------LSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLK

Query:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF
        E+S+YISEDCLIEDSLF E F GK+AA+ F ++LT+SMGPDVKFR R VYE   S AGA WHL W+N +IP TKGCTFIDI +EER TIQK QII EPQ 
Subjt:  EMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQF

Query:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF
        KAGHLIL +MKLVT LLA+  AI +W++K SQQRWV+W++KICV L+   L S  +SYLTFIH G+ ++   +K L +++ FF
Subjt:  KAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein4.1e-2031.16Show/hide
Query:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQ----KKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYI
        +S +TSP  V+         F   T L K TS    Q      +YG   K  T N +V           P ++  S  E++  FY  +N  +L  ++  I
Subjt:  MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQ----KKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYI

Query:  SEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF--IDIRDEERKTIQKIQIINEPQFKAGH
        ++DC+ ED +F   F G+KA + F  K  ES   D++F I  +     S  G  WHLEW+    P +KGC+F  +++ D +R+ +     + EP  K G 
Subjt:  SEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTF--IDIRDEERKTIQKIQIINEPQFKAGH

Query:  LILDIMKLVTLLLAK
         +L  +K VT LL K
Subjt:  LILDIMKLVTLLLAK

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein7.2e-2533.48Show/hide
Query:  NKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHP
        NK  +   +V+SCL D   S P   S    + + +FY  INEKN  ++S+ IS DC I+D  F + F+GK+ AM F E+L +SMG +VKF +  V E   
Subjt:  NKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEKFKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHP

Query:  SMAGAIWHLEWRNMEIPLTKGCTFIDIRDE-ERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDS
          A   WHLEW+  +IP T+GC+F +  DE  R  I+  +I+ E   K G + L ++K +T L  +     E L        ++   +I       L++ 
Subjt:  SMAGAIWHLEWRNMEIPLTKGCTFIDIRDE-ERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKASQQRWVKWMSKICVTLFNLLLDS

Query:  FSKSYLTFIHFGAQLYSCVLKFLYYIVDFF
           SYL  +   A+ +  V+K +  I + F
Subjt:  FSKSYLTFIHFGAQLYSCVLKFLYYIVDFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCTTATTACAAGTCCTCAAGCTGTCAACTTTGGGGGCTCCCAATCATTTAGGAGATTTTCTTCTTATACATTCTTGAATAAAAGAACTTCATGTATATTCCAACA
AAAGAAGAACTATGGAAACTACAACAAGAGGAAAACAAACAACACACTCGTCCTATCATGTTTAATGGATGATTCCTTTTCCTGCCCAGGTTCAAGTTCAAATTCTCCGG
GAGAAATGATTGAGAGATTCTACAAATGCATCAATGAAAAGAACTTAAAGGAAATGAGTACTTACATCTCAGAAGACTGCCTCATTGAAGACTCCTTGTTCATTGAAAAA
TTTAAAGGGAAGAAGGCAGCTATGAGTTTCATTGAAAAACTAACTGAGAGCATGGGTCCAGATGTGAAGTTTAGAATCCGTAAAGTATACGAAAGACACCCTTCCATGGC
AGGAGCAATCTGGCATTTAGAGTGGAGGAACATGGAGATTCCCTTAACCAAGGGTTGCACCTTCATTGACATCAGAGATGAAGAAAGAAAAACTATACAGAAGATACAAA
TTATAAATGAACCACAATTTAAAGCAGGACATCTAATCTTGGATATAATGAAACTTGTGACTTTATTGCTTGCTAAGAATTCAGCAATTTTAGAATGGTTGATAAAAGCT
TCTCAACAACGTTGGGTAAAGTGGATGTCAAAGATCTGTGTAACTCTCTTCAATCTTCTCTTGGACAGCTTTTCGAAGAGCTATCTAACCTTTATACATTTTGGGGCTCA
ACTATATAGTTGTGTACTCAAATTTTTATATTATATTGTAGATTTTTTCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCTTATTACAAGTCCTCAAGCTGTCAACTTTGGGGGCTCCCAATCATTTAGGAGATTTTCTTCTTATACATTCTTGAATAAAAGAACTTCATGTATATTCCAACA
AAAGAAGAACTATGGAAACTACAACAAGAGGAAAACAAACAACACACTCGTCCTATCATGTTTAATGGATGATTCCTTTTCCTGCCCAGGTTCAAGTTCAAATTCTCCGG
GAGAAATGATTGAGAGATTCTACAAATGCATCAATGAAAAGAACTTAAAGGAAATGAGTACTTACATCTCAGAAGACTGCCTCATTGAAGACTCCTTGTTCATTGAAAAA
TTTAAAGGGAAGAAGGCAGCTATGAGTTTCATTGAAAAACTAACTGAGAGCATGGGTCCAGATGTGAAGTTTAGAATCCGTAAAGTATACGAAAGACACCCTTCCATGGC
AGGAGCAATCTGGCATTTAGAGTGGAGGAACATGGAGATTCCCTTAACCAAGGGTTGCACCTTCATTGACATCAGAGATGAAGAAAGAAAAACTATACAGAAGATACAAA
TTATAAATGAACCACAATTTAAAGCAGGACATCTAATCTTGGATATAATGAAACTTGTGACTTTATTGCTTGCTAAGAATTCAGCAATTTTAGAATGGTTGATAAAAGCT
TCTCAACAACGTTGGGTAAAGTGGATGTCAAAGATCTGTGTAACTCTCTTCAATCTTCTCTTGGACAGCTTTTCGAAGAGCTATCTAACCTTTATACATTTTGGGGCTCA
ACTATATAGTTGTGTACTCAAATTTTTATATTATATTGTAGATTTTTTCTTGTAA
Protein sequenceShow/hide protein sequence
MSLITSPQAVNFGGSQSFRRFSSYTFLNKRTSCIFQQKKNYGNYNKRKTNNTLVLSCLMDDSFSCPGSSSNSPGEMIERFYKCINEKNLKEMSTYISEDCLIEDSLFIEK
FKGKKAAMSFIEKLTESMGPDVKFRIRKVYERHPSMAGAIWHLEWRNMEIPLTKGCTFIDIRDEERKTIQKIQIINEPQFKAGHLILDIMKLVTLLLAKNSAILEWLIKA
SQQRWVKWMSKICVTLFNLLLDSFSKSYLTFIHFGAQLYSCVLKFLYYIVDFFL