; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G02550 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G02550
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSnoaL-like domain-containing protein
Genome locationClcChr01:2262163..2264163
RNA-Seq ExpressionClc01G02550
SyntenyClc01G02550
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437322.1 PREDICTED: uncharacterized protein LOC103482777 isoform X1 [Cucumis melo]3.0e-10071.48Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCT-FLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK
        M+LI S  PQ VN G S    R S T FL KRTSCI QQ+KNYGN  KRK N RL        VSSCL +DSFS+  SS+NSP EMIE FYKCINEKNLK
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCT-FLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK

Query:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV
        ++++YISE CLIEDSLFIE   GKKAA+SF E+L  SMGPDVKFRI  VYER  S AGAIWHLEW+NM+IP TKGCTFIDIR+EERKTIQ  QII E Q+
Subjt:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV

Query:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK
        KAGHL LA MKLVTLLLAK+PAI EWL KVSQQRWVK +SKICI LFK LLD+FLKSYLTFIHLG Q+Y++VL FL YVIE FK
Subjt:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK

XP_011654728.2 uncharacterized protein LOC101214565 [Cucumis sativus]2.4e-10272.44Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSC-TFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK
        M+LI S  PQ VN G SQ   R S  TFL KRTSCI QQ+KNYGN  KRK N+ L        V SCL +DSFS  GSS+NSP EMIERFYKCINEKNLK
Subjt:  MALIISPPPQVVNLGSSQPLTRLSC-TFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK

Query:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV
        E+S+YISE CLIEDSLFIE   GKKAA+SF E+L  SMGPDVKFRI  VYER  S AGAIWHLEW+NM+IP TKGCTFIDIR+EERKTIQK QII EPQ 
Subjt:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV

Query:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF
        KAGHLIL  MKLVTLLLAK  AI EWLIK SQQRWVKWMSKIC+ LF LLLDSF KSYLTFIH GAQ+Y+ VLKFL+Y+++ F
Subjt:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]6.9e-8963.12Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE
        MALI SPPPQ + LG SQP    + T + K+ S + QQ+      K RK +  L TDVK  FV SCL + S   L S +NSPSEM+++FY+CINEK LKE
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE

Query:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK
        LSSYISE CLIEDSLF E  IGK+AAL FF+EL  SMGPDVKFR  NVYE G S AGA WHL WKN KIPFTKGCTFIDI NEER TIQKAQII+EPQVK
Subjt:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK

Query:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF
        AGHLIL  MKLVT LLA+YPAI +W++K+SQQRWV+W++KIC++L+K  L S ++SYLTFIH G+ ++   +K L +VI  F
Subjt:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF

XP_023550969.1 uncharacterized protein LOC111808949 [Cucurbita pepo subsp. pepo]3.8e-8762.77Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE
        MALI SPPPQ + LG SQP    + T + KR S + QQ+      K RK +  L+T+VK  FV SCL + S   L S + SPSEM+ + Y+CINEK LKE
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE

Query:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK
        LSSY+SE CLIEDSLF EP IGK+AAL FFEEL  SMG DVKFR  NVYE G S AGA WHL WKN KIPFTKGCTFIDI NE+R TIQKAQIIIEPQVK
Subjt:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK

Query:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF
        AGHLIL  MKLVT LLA+YPAI +W++K+S QRWV+W++KIC++L+K  L S L+SYLTFIH G+ ++   LK L +VI  F
Subjt:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF

XP_038906943.1 uncharacterized protein LOC120092809 [Benincasa hispida]2.9e-13586.97Show/hide
Query:  MALIIS-PPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK
        M+LIIS PPPQ VNLG SQ L R SCTFLTKR+SCILQQ+KNYGN KK+K N+RLSTDVKLHFVSSCL +DSFS L S +NSPSEMIERFYKCINEKNLK
Subjt:  MALIIS-PPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK

Query:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV
        ELSSYISE C IEDSLF+EP IGKKAAL FFEEL HSMGPDVKFRIHN+YER VS  GAIWHLEWKNM+IPFTKGCTFIDIRNEERKTIQKAQIIIEPQ+
Subjt:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV

Query:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK
        KAGHLILA MKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIH GA++Y+NVLKFLHYVI+SFK
Subjt:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein3.0e-9873.33Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSC-TFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK
        M+LI S  PQ VN G SQ   R S  TFL KRTSCI QQ+KNYGN  KRK N+ L        V SCL +DSFS  GSS+NSP EMIERFYKCINEKNLK
Subjt:  MALIISPPPQVVNLGSSQPLTRLSC-TFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK

Query:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV
        E+S+YISE CLIEDSLFIE   GKKAA+SF E+L  SMGPDVKFRI  VYER  S AGAIWHLEW+NM+IP TKGCTFIDIR+EERKTIQK QII EPQ 
Subjt:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV

Query:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYT
        KAGHLIL  MKLVTLLLAK  AI EWLIK SQQRWVKWMSKIC+ LF LLLDSF KSYLTFIH GAQ+Y+
Subjt:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYT

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X11.4e-10071.48Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCT-FLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK
        M+LI S  PQ VN G S    R S T FL KRTSCI QQ+KNYGN  KRK N RL        VSSCL +DSFS+  SS+NSP EMIE FYKCINEKNLK
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCT-FLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLK

Query:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV
        ++++YISE CLIEDSLFIE   GKKAA+SF E+L  SMGPDVKFRI  VYER  S AGAIWHLEW+NM+IP TKGCTFIDIR+EERKTIQ  QII E Q+
Subjt:  ELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQV

Query:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK
        KAGHL LA MKLVTLLLAK+PAI EWL KVSQQRWVK +SKICI LFK LLD+FLKSYLTFIHLG Q+Y++VL FL YVIE FK
Subjt:  KAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK

A0A6J1DVV2 uncharacterized protein LOC111023569 isoform X13.6e-8362.68Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEK----NYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEK
        MALI SPPP  ++LG SQ    L  T L K +SC+ QQ+K     YG+   RK N R STDVKL FV SCL +DS S L S++N  SEMIE FY+CINEK
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEK----NYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEK

Query:  NLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIE
        NL+EL SYISE C+IEDSLFIEP  G+K AL FFEEL  SMG  VKFRI NVYE G SGAGAIW L WK+++IPF+KGCTFI+IRNE+R+ IQKAQII+E
Subjt:  NLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIE

Query:  PQVKAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHL-GAQVYTNVLKFLHYVI
        PQVKAGH ILA +KLVT LL  +PAIPEWL+K+ Q  WVKW+SKICI LF LL +SFL+S L F +L   + +   L FL Y++
Subjt:  PQVKAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHL-GAQVYTNVLKFLHYVI

A0A6J1ELY0 uncharacterized protein LOC1114357472.0e-8662.59Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE
        MALI SPPPQ + LG SQP    + T + KR S + QQ+      K RK +  L+TDVK  FVSSCL + S   L S +NSPSEM+++ Y+CINEK LKE
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE

Query:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK
        LSSY+SE CLIEDSLF E  IGK+AAL FF+EL  SMGPDVKFR  NVYE G S AG  WHL WKN KIPFTKGCTFI I NE+R TIQKAQIIIEPQVK
Subjt:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK

Query:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYV
        AGHLIL  MKLVT LLA+YPAI +W++K+SQQRWV+W++KIC++L+   L S L+SYLTFIH  + ++   LK L +V
Subjt:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYV

A0A6J1I4H1 uncharacterized protein LOC1114690813.3e-8963.12Show/hide
Query:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE
        MALI SPPPQ + LG SQP    + T + K+ S + QQ+      K RK +  L TDVK  FV SCL + S   L S +NSPSEM+++FY+CINEK LKE
Subjt:  MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKE

Query:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK
        LSSYISE CLIEDSLF E  IGK+AAL FF+EL  SMGPDVKFR  NVYE G S AGA WHL WKN KIPFTKGCTFIDI NEER TIQKAQII+EPQVK
Subjt:  LSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVK

Query:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF
        AGHLIL  MKLVT LLA+YPAI +W++K+SQQRWV+W++KIC++L+K  L S ++SYLTFIH G+ ++   +K L +VI  F
Subjt:  AGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein1.1e-2334.23Show/hide
Query:  SSANSPSEMIERFYKCINEKNLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCT
        ++  S SE++  FY  +N  +L  ++  I++ C+ ED +F  P +G+KA L FF +   S   D++F I ++     S  G  WHLEWK    PF+KGC+
Subjt:  SSANSPSEMIERFYKCINEKNLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCT

Query:  FIDIR-NEERKTIQKAQIIIEPQVKAGHLILATMKLVTLLLAKYPAIPE
        F  +   + ++ I   +  +EP +K G  +LA +K VT LL K+P + +
Subjt:  FIDIR-NEERKTIQKAQIIIEPQVKAGHLILATMKLVTLLLAKYPAIPE

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein1.9e-2836.24Show/hide
Query:  HFVSSCLNNDSFSFLGSSANSPS--EMIERFYKCINEKNLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGA
        + V SCL+  +     S  N  S  + + +FY  INEKN  +LSS IS  C I+D  F +P  GK+ A+ FFEEL  SMG +VKF + NV E     A  
Subjt:  HFVSSCLNNDSFSFLGSSANSPS--EMIERFYKCINEKNLKELSSYISEGCLIEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGA

Query:  IWHLEWKNMKIPFTKGCTFIDIRNE-ERKTIQKAQIIIEPQVKAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSY
         WHLEWK  KIPFT+GC+F +  +E  R  I+ A+I+IE  +K G + L+ +K +T L  ++P   E  ++      ++   +I  +    L++  + SY
Subjt:  IWHLEWKNMKIPFTKGCTFIDIRNE-ERKTIQKAQIIIEPQVKAGHLILATMKLVTLLLAKYPAIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSY

Query:  LTFIHLGAQVYTNVLKFL
        L  +   A+ +  V+K +
Subjt:  LTFIHLGAQVYTNVLKFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTATTATAAGTCCTCCTCCCCAAGTTGTCAACTTGGGGAGCTCCCAACCATTGACAAGATTGTCTTGTACATTCTTGACTAAAAGAACTTCATGTATATTACA
ACAAGAGAAGAACTATGGCAACAGCAAGAAGAGGAAAGCGAACGACAGATTATCCACAGATGTTAAGCTACATTTCGTCTCATCGTGCTTAAACAATGATTCCTTTTCCT
TCCTAGGTTCAAGTGCAAATTCTCCATCAGAAATGATTGAGAGATTCTACAAATGCATCAATGAAAAAAACTTGAAGGAATTGAGCAGTTACATCTCAGAAGGCTGCCTC
ATTGAAGACTCCTTGTTCATTGAACCAATTATAGGGAAGAAGGCAGCTCTGAGTTTCTTTGAAGAACTAGCTCATAGCATGGGTCCAGATGTGAAGTTTAGAATTCATAA
CGTCTACGAAAGAGGCGTTTCCGGGGCAGGAGCAATCTGGCATTTAGAGTGGAAGAACATGAAGATTCCCTTCACTAAGGGTTGCACCTTCATTGACATCAGAAATGAAG
AAAGAAAAACTATACAGAAGGCACAAATTATAATTGAACCACAAGTAAAAGCAGGACATCTAATCTTGGCTACAATGAAGCTTGTGACTTTATTGCTTGCTAAGTATCCA
GCAATTCCAGAATGGCTGATAAAAGTTTCCCAACAACGTTGGGTAAAGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCTTCTCTTGGACAGCTTTTTGAAGAGCTA
TCTAACTTTTATTCATTTAGGGGCTCAAGTGTATACAAATGTACTCAAATTTTTACATTATGTTATAGAATCTTTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ACTTTCTTGGCCCTTGCATCTTTGTCTTCATAAAAAGCCCCCACCCGTTTCTCCTTTTGCAATGAATTAATAATTGATAATGGCCCTTATTATAAGTCCTCCTCCCCAAG
TTGTCAACTTGGGGAGCTCCCAACCATTGACAAGATTGTCTTGTACATTCTTGACTAAAAGAACTTCATGTATATTACAACAAGAGAAGAACTATGGCAACAGCAAGAAG
AGGAAAGCGAACGACAGATTATCCACAGATGTTAAGCTACATTTCGTCTCATCGTGCTTAAACAATGATTCCTTTTCCTTCCTAGGTTCAAGTGCAAATTCTCCATCAGA
AATGATTGAGAGATTCTACAAATGCATCAATGAAAAAAACTTGAAGGAATTGAGCAGTTACATCTCAGAAGGCTGCCTCATTGAAGACTCCTTGTTCATTGAACCAATTA
TAGGGAAGAAGGCAGCTCTGAGTTTCTTTGAAGAACTAGCTCATAGCATGGGTCCAGATGTGAAGTTTAGAATTCATAACGTCTACGAAAGAGGCGTTTCCGGGGCAGGA
GCAATCTGGCATTTAGAGTGGAAGAACATGAAGATTCCCTTCACTAAGGGTTGCACCTTCATTGACATCAGAAATGAAGAAAGAAAAACTATACAGAAGGCACAAATTAT
AATTGAACCACAAGTAAAAGCAGGACATCTAATCTTGGCTACAATGAAGCTTGTGACTTTATTGCTTGCTAAGTATCCAGCAATTCCAGAATGGCTGATAAAAGTTTCCC
AACAACGTTGGGTAAAGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCTTCTCTTGGACAGCTTTTTGAAGAGCTATCTAACTTTTATTCATTTAGGGGCTCAAGTG
TATACAAATGTACTCAAATTTTTACATTATGTTATAGAATCTTTCAAGTAAAGGATACATCATATAGACACACAAAGATTTATTCTGTTCGTGGGCATGGAAAAGATTGT
AAACATACAAAACATCCCCTTCTAGTCATCATTTCCATCACGACTCAAATACGTTTCTAACAATGAGAAAACAGCCAATGGACCAAGAAGGGGAGGGAGTGGTGGAGGCC
TACTGAACGCTGCAATTGGGTCTGCTTTCTCTTTGGTAACCACAGACTCTCCCTCAGAATCATTCTTCATTCCTTGCCTTTGTACTGATGAGGCCAGACTACCTAATCAA
ATTGCAGTTCAACAATTGTTTCATGTCATGAAATTGCAAGGCAAGGAAACAAACAAAAATGAAACAGTCAAATTATAGAATATGTTTGGTCCAATAACTTTTCTCTATAT
AATATCTGTACTTTAAAAACTAAAAGATTCAAATAGGTTGCAAAATTTGTATAGTTGAATTATAAGTTTACCTAATGTACCTTAAAATGTTGAAAAAGTCTTTGAACTTT
CAATTTAGTAGCCAATAAATCTCTTCCTTTATTTTCACTAA
Protein sequenceShow/hide protein sequence
MALIISPPPQVVNLGSSQPLTRLSCTFLTKRTSCILQQEKNYGNSKKRKANDRLSTDVKLHFVSSCLNNDSFSFLGSSANSPSEMIERFYKCINEKNLKELSSYISEGCL
IEDSLFIEPIIGKKAALSFFEELAHSMGPDVKFRIHNVYERGVSGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERKTIQKAQIIIEPQVKAGHLILATMKLVTLLLAKYP
AIPEWLIKVSQQRWVKWMSKICIILFKLLLDSFLKSYLTFIHLGAQVYTNVLKFLHYVIESFK