; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004172 (gene) of Chayote v1 genome

Gene IDSed0004172
OrganismSechium edule (Chayote v1)
DescriptionSnoaL-like domain-containing protein
Genome locationLG05:5919923..5921964
RNA-Seq ExpressionSed0004172
SyntenySed0004172
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437322.1 PREDICTED: uncharacterized protein LOC103482777 isoform X1 [Cucumis melo]9.0e-8965.25Show/hide
Query:  ISAPQAVNL-RSKSLTKLSHT-SLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL
        I++PQAVN   S S  + S+T  L KRTSC+FQQK     YG +N RK N        +R VSSCL D S S   SSSN P EMI  FYKCINEK+LK++
Subjt:  ISAPQAVNL-RSKSLTKLSHT-SLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL

Query:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA
        ++YISEDCLIEDSLFIE F GKKAA+ F E+LT+SMGPDVKFRI TVYER  + AGAIWHLEW+NM+IP TKGCTFIDIR+EER+T +  QII E Q+KA
Subjt:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA

Query:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK
        GH  LA+MKLVTLLLAK+P I EW+ KVSQQRWVK ISKICI LFK +LD+FLKSYL  IH G QLY  VL FL +++E FK
Subjt:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK

XP_011654728.2 uncharacterized protein LOC101214565 [Cucumis sativus]2.6e-8865.48Show/hide
Query:  ISAPQAVNL-RSKSLTKL-SHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL
        I++PQAVN   S+S  +  S+T L KRTSC+FQQK     YG +N RK N           V SCL D S S   SSSN P EMI +FYKCINEK+LKE+
Subjt:  ISAPQAVNL-RSKSLTKL-SHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL

Query:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA
        S+YISEDCLIEDSLFIE F GKKAA+ F E+LT+SMGPDVKFRI  VYER  + AGAIWHLEW+NM+IP TKGCTFIDIR+EER+T +K QII EPQ KA
Subjt:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA

Query:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF
        GH IL +MKLVTLLLAK   I EW+IK SQQRWVK +SKIC+ LF L+LDSF KSYL  IHFGAQLY  VLKFL++I+++F
Subjt:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF

XP_022929026.1 uncharacterized protein LOC111435747 [Cucurbita moschata]6.5e-8761.35Show/hide
Query:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE
        + T   PQA+ L S+     ++TS+PKR S LFQQK           RK +Y  +TD K RFVSSCLKDGSV  LDS SN P+EM+ K Y+CINEK LKE
Subjt:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE

Query:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK
        LSSY+SEDCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR   VYE GA+ AG  WHL WKN KIPFTKGCTFI I NE+ RT +KAQIIIEPQVK
Subjt:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK

Query:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF
        AGH IL +MKLVT LLA+YP I +WV+K+SQQRWV+ ++KIC++L+   L S L+SYL  IH  + +++  LK L ++  +F
Subjt:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]1.2e-8861.7Show/hide
Query:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE
        + T   PQA+ L S+     ++TS+PK+ S LFQQK           RK +Y   TD K RFV SCLKDGSV  LDS SN P+EM+ KFY+CINEK LKE
Subjt:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE

Query:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK
        LSSYISEDCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR   VYE G + AGA WHL WKN KIPFTKGCTFIDI NEE RT +KAQII+EPQVK
Subjt:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK

Query:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF
        AGH IL +MKLVT LLA+YP I +WV+K+SQQRWV+ ++KIC++L+K  L S ++SYL  IH G+ +++  +K L +++ +F
Subjt:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF

XP_038906943.1 uncharacterized protein LOC120092809 [Benincasa hispida]2.3e-10872.98Show/hide
Query:  MVFTISAPQAVNL-RSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSL
        ++ +   PQAVNL  S+SL + S T L KR+SC+ QQK     YG    +K N R STD KL FVSSCLKD S S LDS SN P+EMI +FYKCINEK+L
Subjt:  MVFTISAPQAVNL-RSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSL

Query:  KELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQ
        KELSSYISEDC IEDSLF+EPFIGKKAAL+FFEELT SMGPDVKFRI  +YER  +  GAIWHLEWKNM+IPFTKGCTFIDIRNEER+T +KAQIIIEPQ
Subjt:  KELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQ

Query:  VKAGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK
        +KAGH ILA+MKLVTLLLAKYP IPEW+IKVSQQRWVK +SKICIILFKL+LDSFLKSYL  IHFGA+LY  VLKFLH++++ FK
Subjt:  VKAGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein3.8e-8566.29Show/hide
Query:  ISAPQAVNL-RSKSLTKL-SHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL
        I++PQAVN   S+S  +  S+T L KRTSC+FQQK     YG +N RK N           V SCL D S S   SSSN P EMI +FYKCINEK+LKE+
Subjt:  ISAPQAVNL-RSKSLTKL-SHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL

Query:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA
        S+YISEDCLIEDSLFIE F GKKAA+ F E+LT+SMGPDVKFRI  VYER  + AGAIWHLEW+NM+IP TKGCTFIDIR+EER+T +K QII EPQ KA
Subjt:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA

Query:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLY
        GH IL +MKLVTLLLAK   I EW+IK SQQRWVK +SKIC+ LF L+LDSF KSYL  IHFGAQLY
Subjt:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLY

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X14.4e-8965.25Show/hide
Query:  ISAPQAVNL-RSKSLTKLSHT-SLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL
        I++PQAVN   S S  + S+T  L KRTSC+FQQK     YG +N RK N        +R VSSCL D S S   SSSN P EMI  FYKCINEK+LK++
Subjt:  ISAPQAVNL-RSKSLTKLSHT-SLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKEL

Query:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA
        ++YISEDCLIEDSLFIE F GKKAA+ F E+LT+SMGPDVKFRI TVYER  + AGAIWHLEW+NM+IP TKGCTFIDIR+EER+T +  QII E Q+KA
Subjt:  SSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKA

Query:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK
        GH  LA+MKLVTLLLAK+P I EW+ KVSQQRWVK ISKICI LFK +LD+FLKSYL  IH G QLY  VL FL +++E FK
Subjt:  GHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK

A0A6J1DVV2 uncharacterized protein LOC111023569 isoform X19.1e-8762.5Show/hide
Query:  ISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQ-KTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKELS
        I +P  ++L S+S   L +T+LPK +SC+FQQ KT  L+YG W  RK N RSSTD KLRFV SCL D S S LDS+SNP +EMI  FY+CINEK+L+EL 
Subjt:  ISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQ-KTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKELS

Query:  SYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKAG
        SYISEDC+IEDSLFIEPF G+K AL+FFEELTQSMG  VKFRI  VYE G +GAGAIW L WK+++IPF+KGCTFI+IRNE+RR  +KAQII+EPQVKAG
Subjt:  SYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKAG

Query:  HFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIH-FGAQLYIIVLKFLHFILEY
        HFILA++KLVT LL  +P IPEW++K+ Q  WVK +SKICI LF L+ +SFL+S LA  + +  + +++ L FL +IL +
Subjt:  HFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIH-FGAQLYIIVLKFLHFILEY

A0A6J1ELY0 uncharacterized protein LOC1114357473.1e-8761.35Show/hide
Query:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE
        + T   PQA+ L S+     ++TS+PKR S LFQQK           RK +Y  +TD K RFVSSCLKDGSV  LDS SN P+EM+ K Y+CINEK LKE
Subjt:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE

Query:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK
        LSSY+SEDCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR   VYE GA+ AG  WHL WKN KIPFTKGCTFI I NE+ RT +KAQIIIEPQVK
Subjt:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK

Query:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF
        AGH IL +MKLVT LLA+YP I +WV+K+SQQRWV+ ++KIC++L+   L S L+SYL  IH  + +++  LK L ++  +F
Subjt:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF

A0A6J1I4H1 uncharacterized protein LOC1114690815.7e-8961.7Show/hide
Query:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE
        + T   PQA+ L S+     ++TS+PK+ S LFQQK           RK +Y   TD K RFV SCLKDGSV  LDS SN P+EM+ KFY+CINEK LKE
Subjt:  VFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKE

Query:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK
        LSSYISEDCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR   VYE G + AGA WHL WKN KIPFTKGCTFIDI NEE RT +KAQII+EPQVK
Subjt:  LSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVK

Query:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF
        AGH IL +MKLVT LLA+YP I +WV+K+SQQRWV+ ++KIC++L+K  L S ++SYL  IH G+ +++  +K L +++ +F
Subjt:  AGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein6.3e-2433.33Show/hide
Query:  DSSSNPPTEMIVKFYKCINEKSLKELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGC
        +++    +E++  FY  +N   L  ++  I++DC+ ED +F  PF+G+KA L FF +  +S   D++F I  +    ++  G  WHLEWK    PF+KGC
Subjt:  DSSSNPPTEMIVKFYKCINEKSLKELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGC

Query:  TFIDIR-NEERRTTRKAQIIIEPQVKAGHFILAVMKLVTLLLAKYPTIPE
        +F  +   + +R     +  +EP +K G  +LA +K VT LL K+P + +
Subjt:  TFIDIR-NEERRTTRKAQIIIEPQVKAGHFILAVMKLVTLLLAKYPTIPE

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein4.1e-3136.41Show/hide
Query:  VSGLDSSSNPPTEM-----IVKFYKCINEKSLKELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKN
        VS LD  ++ P ++     ++KFY  INEK+  +LSS IS DC I+D  F +PF GK+ A++FFEEL +SMG +VKF +  V E     A   WHLEWK 
Subjt:  VSGLDSSSNPPTEM-----IVKFYKCINEKSLKELSSYISEDCLIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKN

Query:  MKIPFTKGCTFIDIRNE-ERRTTRKAQIIIEPQVKAGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGA
         KIPFT+GC+F +  +E  R   R A+I+IE  +K G   L+++K +T L  ++P   E  ++      ++   +I  +    +++  + SYL  +   A
Subjt:  MKIPFTKGCTFIDIRNE-ERRTTRKAQIIIEPQVKAGHFILAVMKLVTLLLAKYPTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGA

Query:  QLYIIVLKFLHFILEYF
        + +++V+K +  I   F
Subjt:  QLYIIVLKFLHFILEYF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCTTTACAATAAGTGCTCCTCAAGCTGTCAACTTGAGATCCAAATCTTTGACAAAACTTTCCCATACATCCTTGCCTAAAAGAACCTCATGTTTATTTCAACAAAA
GACCAGAGAACTAAGATATGGTGTCTGGAACAATAGGAAAGCAAACTACAGATCATCCACCGACACAAAACTCCGTTTCGTGTCGTCGTGCCTGAAGGATGGTTCGGTTT
CCGGATTGGATTCGAGTTCAAATCCTCCAACAGAAATGATTGTGAAGTTTTACAAATGCATCAATGAAAAAAGCTTGAAGGAATTGAGCAGTTACATCTCAGAAGATTGC
CTCATTGAGGACTCTTTGTTCATTGAACCATTCATAGGGAAGAAGGCAGCTCTGAAGTTCTTTGAAGAACTAACTCAAAGCATGGGTCCAGATGTTAAGTTTAGAATTAG
TACCGTCTACGAAAGAGGCGCTACCGGGGCAGGAGCAATCTGGCATTTAGAGTGGAAGAACATGAAGATTCCCTTCACCAAGGGTTGCACTTTCATTGACATCAGAAATG
AAGAAAGAAGAACAACAAGAAAGGCACAAATTATAATTGAACCACAAGTCAAAGCAGGACATTTCATCTTGGCTGTAATGAAGCTTGTGACTTTATTGCTTGCCAAGTAT
CCAACAATTCCTGAATGGGTGATTAAAGTTTCTCAACAACGTTGGGTAAAGGGGATATCAAAGATCTGTATAATTCTCTTCAAGCTCATCTTGGACAGCTTTTTGAAGAG
CTATCTAGCCTGTATACATTTCGGAGCTCAACTGTATATTATTGTACTCAAATTTTTACATTTTATTTTAGAGTATTTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
TACCCATTTCCCCTTCTGTTGTACTGAATCAACAATAATGGTCTTTACAATAAGTGCTCCTCAAGCTGTCAACTTGAGATCCAAATCTTTGACAAAACTTTCCCATACAT
CCTTGCCTAAAAGAACCTCATGTTTATTTCAACAAAAGACCAGAGAACTAAGATATGGTGTCTGGAACAATAGGAAAGCAAACTACAGATCATCCACCGACACAAAACTC
CGTTTCGTGTCGTCGTGCCTGAAGGATGGTTCGGTTTCCGGATTGGATTCGAGTTCAAATCCTCCAACAGAAATGATTGTGAAGTTTTACAAATGCATCAATGAAAAAAG
CTTGAAGGAATTGAGCAGTTACATCTCAGAAGATTGCCTCATTGAGGACTCTTTGTTCATTGAACCATTCATAGGGAAGAAGGCAGCTCTGAAGTTCTTTGAAGAACTAA
CTCAAAGCATGGGTCCAGATGTTAAGTTTAGAATTAGTACCGTCTACGAAAGAGGCGCTACCGGGGCAGGAGCAATCTGGCATTTAGAGTGGAAGAACATGAAGATTCCC
TTCACCAAGGGTTGCACTTTCATTGACATCAGAAATGAAGAAAGAAGAACAACAAGAAAGGCACAAATTATAATTGAACCACAAGTCAAAGCAGGACATTTCATCTTGGC
TGTAATGAAGCTTGTGACTTTATTGCTTGCCAAGTATCCAACAATTCCTGAATGGGTGATTAAAGTTTCTCAACAACGTTGGGTAAAGGGGATATCAAAGATCTGTATAA
TTCTCTTCAAGCTCATCTTGGACAGCTTTTTGAAGAGCTATCTAGCCTGTATACATTTCGGAGCTCAACTGTATATTATTGTACTCAAATTTTTACATTTTATTTTAGAG
TATTTCAAGTAAAGGATACATCATTATATCTTAACTGCACAAACATCCATTCACTTCATGACATGGAATAAATAAAACTCGCTAACTTGCCGGGAAGTATCTCGTACATC
CTTCCCACCATTTGTTTGTATAATAATG
Protein sequenceShow/hide protein sequence
MVFTISAPQAVNLRSKSLTKLSHTSLPKRTSCLFQQKTRELRYGVWNNRKANYRSSTDTKLRFVSSCLKDGSVSGLDSSSNPPTEMIVKFYKCINEKSLKELSSYISEDC
LIEDSLFIEPFIGKKAALKFFEELTQSMGPDVKFRISTVYERGATGAGAIWHLEWKNMKIPFTKGCTFIDIRNEERRTTRKAQIIIEPQVKAGHFILAVMKLVTLLLAKY
PTIPEWVIKVSQQRWVKGISKICIILFKLILDSFLKSYLACIHFGAQLYIIVLKFLHFILEYFK