; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G002290 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G002290
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSnoaL-like domain-containing protein
Genome locationchr09:2386357..2388075
RNA-Seq ExpressionLsi09G002290
SyntenyLsi09G002290
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437322.1 PREDICTED: uncharacterized protein LOC103482777 isoform X1 [Cucumis melo]1.4e-8668.1Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCT-SLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK
        MSL T+  PQAVN  GS S RRFS T  L KRTSCI QQ KNYGN   RKTN RL        VSSCL DDSFS   SSSNSP E+IE FYKCINEKNLK
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCT-SLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK

Query:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI
        +++ YISEDCLIEDS FIE F GKKAA+SF E+LT+SMGPDVK ++  +YE   S A AIW+LEW+NMEIPL+KGCTFIDIR+EERKTIQ  QII E Q+
Subjt:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI

Query:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY
        KAGHL L          AIMK+VTLLLA +PAI EWL KVSQQRWV  +SKICI LFK LLD+FLKSYLTFIH   QLY
Subjt:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY

XP_011654728.2 uncharacterized protein LOC101214565 [Cucumis sativus]5.9e-9070.25Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSC-TSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK
        MSL T+  PQAVN  GSQS RRFS  T L KRTSCI QQ KNYGN   RKTNN L        V SCL DDSFS   SSSNSP E+IERFYKCINEKNLK
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSC-TSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK

Query:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI
        E+S YISEDCLIEDS FIE F GKKAA+SF E+LT+SMGPDVK ++  +YE   S A AIW+LEW+NMEIPL+KGCTFIDIR+EERKTIQK QII EPQ 
Subjt:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI

Query:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY
        KAGHLIL           IMK+VTLLLA   AI EWLIK SQQRWV WMSKIC+ LF  LLDSF KSYLTFIHF AQLY
Subjt:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY

XP_022929026.1 uncharacterized protein LOC111435747 [Cucurbita moschata]3.8e-8161.15Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL
        M+L T+ PPQA+ L GSQ  R F+ TS+ KR S + QQK     KLRKT+  L+TDVK  FVSSCLKD S  SLDS SNSPSE++++ Y+CINEK LKEL
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL

Query:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA
        S+Y+SEDCLIEDS F E FIGK+AAL FF+ELTQSMGPDVK +  ++YE GASRA   W+L WKN +IP +KGCTFI I NE+R TIQKAQIIIEPQ+KA
Subjt:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA

Query:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI
        GHLIL           +MK+VT LLA YPAI +W++K+SQQRWV W++KIC++L+   L S L+SYLTFIH  + +++
Subjt:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]2.0e-8261.51Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL
        M+L T+ PPQA+ L GSQ  R F+ TS+ K+ S + QQK     KLRKT+  L TDVK  FV SCLKD S  SLDS SNSPSE++++FY+CINEK LKEL
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL

Query:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA
        S+YISEDCLIEDS F E FIGK+AAL FF+ELTQSMGPDVK +  ++YE G SRA A W+L WKN +IP +KGCTFIDI NEER TIQKAQII+EPQ+KA
Subjt:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA

Query:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI
        GHLIL           +MK+VT LLA YPAI +W++K+SQQRWV W++KIC++L+K  L S ++SYLTFIH  + +++
Subjt:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI

XP_038906943.1 uncharacterized protein LOC120092809 [Benincasa hispida]1.5e-11783.15Show/hide
Query:  PPQAVNLRGSQSSRRFSCTSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKELSNYISE
        PPQAVNL GSQS RRFSCT LTKR+SCILQQ KNYGNCK +KTNNRLSTDVKLHFVSSCLKDDSFS LDS SNSPSE+IERFYKCINEKNLKELS+YISE
Subjt:  PPQAVNLRGSQSSRRFSCTSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKELSNYISE

Query:  DCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKAGHLILV
        DC IEDS F+EPFIGKKAAL FFEELT SMGPDVK ++H+IYE   S   AIW+LEWKNMEIP +KGCTFIDIRNEERKTIQKAQIIIEPQIKAGHLIL 
Subjt:  DCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKAGHLILV

Query:  SIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYIN
                 AIMK+VTLLLA YPAIPEWLIKVSQQRWV WMSKICIILFK LLDSFLKSYLTFIHF A+LY N
Subjt:  SIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYIN

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein2.0e-10771.52Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSC-TSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK
        MSL T+  PQAVN  GSQS RRFS  T L KRTSCI QQ KNYGN   RKTNN L        V SCL DDSFS   SSSNSP E+IERFYKCINEKNLK
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSC-TSLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK

Query:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI
        E+S YISEDCLIEDS FIE F GKKAA+SF E+LT+SMGPDVK ++  +YE   S A AIW+LEW+NMEIPL+KGCTFIDIR+EERKTIQK QII EPQ 
Subjt:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI

Query:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYINSSFPSRLKYVSNNEKTANG
        KAGHLIL           IMK+VTLLLA   AI EWLIK SQQRWV WMSKIC+ LF  LLDSF KSYLTFIHF AQLY  SSFPSR+KY+SN EKTANG
Subjt:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYINSSFPSRLKYVSNNEKTANG

Query:  PRRGGSGGGLLNAAIG
        PR GGSGGGLLNA IG
Subjt:  PRRGGSGGGLLNAAIG

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X16.6e-8768.1Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCT-SLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK
        MSL T+  PQAVN  GS S RRFS T  L KRTSCI QQ KNYGN   RKTN RL        VSSCL DDSFS   SSSNSP E+IE FYKCINEKNLK
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCT-SLTKRTSCILQQ-KNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLK

Query:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI
        +++ YISEDCLIEDS FIE F GKKAA+SF E+LT+SMGPDVK ++  +YE   S A AIW+LEW+NMEIPL+KGCTFIDIR+EERKTIQ  QII E Q+
Subjt:  ELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQI

Query:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY
        KAGHL L          AIMK+VTLLLA +PAI EWL KVSQQRWV  +SKICI LFK LLD+FLKSYLTFIH   QLY
Subjt:  KAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLY

A0A6J1DVV2 uncharacterized protein LOC111023569 isoform X14.7e-7759.35Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQK-----NYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEK
        M+L  + PP ++   GSQS R    T+L K +SC+ QQK      YG+  +RKTN R STDVKL FV SCL DDS S LDS+SN  SE+IE FY+CINEK
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQK-----NYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEK

Query:  NLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIE
        NL+EL +YISEDC+IEDS FIEPF G+K AL FFEELTQSMG  VK ++ ++YE G S A AIW L WK++EIP SKGCTFI+IRNE+R+ IQKAQII+E
Subjt:  NLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIE

Query:  PQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFW
        PQ+KAGH IL          A++K+VT LL T+PAIPEWL+K+ Q  WV W+SKICI LF  L +SFL+S L F + +
Subjt:  PQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFW

A0A6J1ELY0 uncharacterized protein LOC1114357471.8e-8161.15Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL
        M+L T+ PPQA+ L GSQ  R F+ TS+ KR S + QQK     KLRKT+  L+TDVK  FVSSCLKD S  SLDS SNSPSE++++ Y+CINEK LKEL
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL

Query:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA
        S+Y+SEDCLIEDS F E FIGK+AAL FF+ELTQSMGPDVK +  ++YE GASRA   W+L WKN +IP +KGCTFI I NE+R TIQKAQIIIEPQ+KA
Subjt:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA

Query:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI
        GHLIL           +MK+VT LLA YPAI +W++K+SQQRWV W++KIC++L+   L S L+SYLTFIH  + +++
Subjt:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI

A0A6J1I4H1 uncharacterized protein LOC1114690819.8e-8361.51Show/hide
Query:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL
        M+L T+ PPQA+ L GSQ  R F+ TS+ K+ S + QQK     KLRKT+  L TDVK  FV SCLKD S  SLDS SNSPSE++++FY+CINEK LKEL
Subjt:  MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKEL

Query:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA
        S+YISEDCLIEDS F E FIGK+AAL FF+ELTQSMGPDVK +  ++YE G SRA A W+L WKN +IP +KGCTFIDI NEER TIQKAQII+EPQ+KA
Subjt:  SNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKA

Query:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI
        GHLIL           +MK+VT LLA YPAI +W++K+SQQRWV W++KIC++L+K  L S ++SYLTFIH  + +++
Subjt:  GHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein1.3e-1831.25Show/hide
Query:  DSSSNSPSEIIERFYKCINEKNLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGC
        +++  S SE++  FY  +N  +L  +++ I++DC+ ED  F  PF+G+KA L FF +  +S   D++  +  I    +S     W+LEWK    P SKGC
Subjt:  DSSSNSPSEIIERFYKCINEKNLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGC

Query:  TFIDIR-NEERKTIQKAQIIIEPQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPE
        +F  +   + ++ I   +  +EP IK G  +L +I          K VT LL  +P + +
Subjt:  TFIDIR-NEERKTIQKAQIIIEPQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPE

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein2.7e-2433.8Show/hide
Query:  SSLDSSSNSPSEI-----IERFYKCINEKNLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNM
        S LD  ++ P++I     + +FY  INEKN  +LS+ IS DC I+D SF +PF GK+ A+ FFEEL +SMG +VK  V ++ E     A   W+LEWK  
Subjt:  SSLDSSSNSPSEI-----IERFYKCINEKNLKELSNYISEDCLIEDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNM

Query:  EIPLSKGCTFIDIRNE-ERKTIQKAQIIIEPQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKS
        +IP ++GC+F +  +E  R  I+ A+I+IE  IK G + L          +++K +T L   +P   E  ++      +    +I  +    L++  + S
Subjt:  EIPLSKGCTFIDIRNE-ERKTIQKAQIIIEPQIKAGHLILVSIQAIISNNAIMKVVTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKS

Query:  YLTFIHFWAQLYI
        YL  +   A+ ++
Subjt:  YLTFIHFWAQLYI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCTAACTACAAATTCTCCTCCTCAAGCTGTCAACTTGAGGGGCTCCCAATCATCGAGAAGATTTTCCTGTACATCCTTGACTAAAAGAACCTCATGTATATTACA
ACAAAAGAACTATGGCAACTGCAAGTTGAGGAAAACAAACAACAGATTATCCACCGATGTTAAGCTTCATTTCGTCTCGTCGTGCTTAAAGGATGATTCCTTTTCCAGCT
TAGATTCAAGTTCAAATTCTCCATCAGAAATCATTGAGAGATTCTACAAATGCATCAATGAAAAGAACTTGAAGGAATTGAGCAATTATATCTCAGAAGACTGCCTCATT
GAAGACTCCTCGTTCATTGAACCATTTATAGGGAAGAAGGCAGCTCTGAGTTTCTTTGAAGAACTAACTCAAAGCATGGGTCCAGATGTAAAGATTAAAGTTCATCACAT
TTACGAAATAGGGGCTTCCAGGGCAATAGCAATCTGGAATTTAGAGTGGAAGAACATGGAGATTCCCTTATCCAAGGGTTGCACCTTCATTGACATCAGAAATGAAGAAA
GAAAAACAATACAGAAGGCACAAATTATAATTGAACCACAAATAAAAGCAGGACATCTAATCTTGGTAAGTATACAAGCCATCATTTCCAACAATGCTATAATGAAGGTC
GTGACTTTATTGCTTGCTACGTATCCAGCAATTCCAGAATGGCTGATAAAAGTTTCTCAACAACGTTGGGTAATGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCG
TCTCTTGGACAGCTTTTTGAAGAGCTATCTAACCTTCATACACTTTTGGGCTCAACTGTATATTAATTCATCATTTCCATCACGACTCAAATACGTTTCTAACAATGAGA
AAACAGCCAATGGACCAAGAAGGGGAGGGAGTGGTGGAGGCTTACTGAACGCTGCAATTGGGTCTGCATCCTCTTTGATAACCACCGACTCTCCCTCAGAATCATTCTTC
ATTCCTTGCTTTTGTATTGATGAAGCCAGACTACCTAATTGTCATGAAATTGCAATGTAA
mRNA sequenceShow/hide mRNA sequence
AAAGGACCCCACCCTTTTCTCTTTCTGCAGTGAATTAAAATAATAATGTCCCTAACTACAAATTCTCCTCCTCAAGCTGTCAACTTGAGGGGCTCCCAATCATCGAGAAG
ATTTTCCTGTACATCCTTGACTAAAAGAACCTCATGTATATTACAACAAAAGAACTATGGCAACTGCAAGTTGAGGAAAACAAACAACAGATTATCCACCGATGTTAAGC
TTCATTTCGTCTCGTCGTGCTTAAAGGATGATTCCTTTTCCAGCTTAGATTCAAGTTCAAATTCTCCATCAGAAATCATTGAGAGATTCTACAAATGCATCAATGAAAAG
AACTTGAAGGAATTGAGCAATTATATCTCAGAAGACTGCCTCATTGAAGACTCCTCGTTCATTGAACCATTTATAGGGAAGAAGGCAGCTCTGAGTTTCTTTGAAGAACT
AACTCAAAGCATGGGTCCAGATGTAAAGATTAAAGTTCATCACATTTACGAAATAGGGGCTTCCAGGGCAATAGCAATCTGGAATTTAGAGTGGAAGAACATGGAGATTC
CCTTATCCAAGGGTTGCACCTTCATTGACATCAGAAATGAAGAAAGAAAAACAATACAGAAGGCACAAATTATAATTGAACCACAAATAAAAGCAGGACATCTAATCTTG
GTAAGTATACAAGCCATCATTTCCAACAATGCTATAATGAAGGTCGTGACTTTATTGCTTGCTACGTATCCAGCAATTCCAGAATGGCTGATAAAAGTTTCTCAACAACG
TTGGGTAATGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCGTCTCTTGGACAGCTTTTTGAAGAGCTATCTAACCTTCATACACTTTTGGGCTCAACTGTATATTA
ATTCATCATTTCCATCACGACTCAAATACGTTTCTAACAATGAGAAAACAGCCAATGGACCAAGAAGGGGAGGGAGTGGTGGAGGCTTACTGAACGCTGCAATTGGGTCT
GCATCCTCTTTGATAACCACCGACTCTCCCTCAGAATCATTCTTCATTCCTTGCTTTTGTATTGATGAAGCCAGACTACCTAATTGTCATGAAATTGCAATGTAA
Protein sequenceShow/hide protein sequence
MSLTTNSPPQAVNLRGSQSSRRFSCTSLTKRTSCILQQKNYGNCKLRKTNNRLSTDVKLHFVSSCLKDDSFSSLDSSSNSPSEIIERFYKCINEKNLKELSNYISEDCLI
EDSSFIEPFIGKKAALSFFEELTQSMGPDVKIKVHHIYEIGASRAIAIWNLEWKNMEIPLSKGCTFIDIRNEERKTIQKAQIIIEPQIKAGHLILVSIQAIISNNAIMKV
VTLLLATYPAIPEWLIKVSQQRWVMWMSKICIILFKRLLDSFLKSYLTFIHFWAQLYINSSFPSRLKYVSNNEKTANGPRRGGSGGGLLNAAIGSASSLITTDSPSESFF
IPCFCIDEARLPNCHEIAM