; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018145 (gene) of Snake gourd v1 genome

Gene IDTan0018145
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSnoaL-like domain-containing protein
Genome locationLG07:67467185..67468747
RNA-Seq ExpressionTan0018145
SyntenyTan0018145
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily
IPR037401 - SnoaL-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654728.2 uncharacterized protein LOC101214565 [Cucumis sativus]1.7e-9569.37Show/hide
Query:  TIIPPPQAVSL-GSQSLTTL-AYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL
        ++I  PQAV+  GSQS     +YT L K+ SC+FQQKK     YGN+ K KTNN L        V SCL DDS S   SSSN P EMIE+FYKCINEKNL
Subjt:  TIIPPPQAVSL-GSQSLTTL-AYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL

Query:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ
        KE+S+YIS+DCLIEDSLF E F GKKAA+ F E+LT+SMGPDVKFRI  VYER  S AGAIWHL W+NMEIP TKGCTFIDIR+EER+TI+K QII EPQ
Subjt:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ

Query:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF
         KAGHLIL IMKLVTLLLAK  AI EWLIK SQQRWV WMSKIC+ LF LLLDSF KSYLTFIH G QLY   LKFL+YI++FF
Subjt:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF

XP_022156726.1 uncharacterized protein LOC111023569 isoform X1 [Momordica charantia]8.5e-9566.67Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPP  +SLGSQS   L YTALPK +SCVFQQKKT  L+YG+W + KTN R STD KL FV SCL DDS S L+S+SNP +EMIE FY+CINEKN
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        L+EL SYIS+DC+IEDSLF EPF G+K AL+FFEELTQSMG  VKFRI NVYE G SGAGAIW LAWK++EIP++KGCTFI+IRNE+RR I+KAQII+EP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHC-GVQLYINALKFLHYIIEF
        QVKAGH ILA++KLVT LL  +PAIPEWL+K+ Q  WV W+SKICI LF LL +SFL+S L F +    + ++  L FL YI+ F
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHC-GVQLYINALKFLHYIIEF

XP_022929026.1 uncharacterized protein LOC111435747 [Cucurbita moschata]1.4e-9464.21Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPPQA++LGSQ   T AYT++PK+ S +FQQKK R          KT+  L+TD K  FVSSCLKD SV  L+S SN P+EM++K Y+CINEK 
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        LKELSSY+S+DCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR  NVYE GAS AG  WHL WKN +IP+TKGCTFI I NE+ RTI+KAQIIIEP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF
        QVKAGHLIL +MKLVT LLA+YPAI +W++K+SQQRWV W++KIC++L+   L S L+SYLTFIHC   +++  LK L ++  FF
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]1.4e-9765.61Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPPQA++LGSQ   T AYT++PKK S +FQQKK R          KT+  L TD K  FV SCLKD SV  L+S SN P+EM++KFY+CINEK 
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        LKELSSYIS+DCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR  NVYE G S AGA WHL WKN +IP+TKGCTFIDI NEE RTI+KAQII+EP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF
        QVKAGHLIL +MKLVT LLA+YPAI +W++K+SQQRWV W++KIC++L+K  L S ++SYLTFIHCG  +++  +K L ++I FF
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF

XP_038906943.1 uncharacterized protein LOC120092809 [Benincasa hispida]1.3e-12281.49Show/hide
Query:  PPPQAVSL-GSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELS
        PPPQAV+L GSQSL   + T L K++SC+ QQKK     YGN +K KTNNRLSTD KLHFVSSCLKDDS SRL+S SN P+EMIE+FYKCINEKNLKELS
Subjt:  PPPQAVSL-GSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELS

Query:  SYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQVKAG
        SYIS+DC IEDSLF EPFIGKKAAL+FFEELT SMGPDVKFRIHN+YER  S  GAIWHL WKNMEIP+TKGCTFIDIRNEER+TI+KAQIIIEPQ+KAG
Subjt:  SYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQVKAG

Query:  HLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFFK
        HLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWV WMSKICIILFKLLLDSFLKSYLTFIH G +LY N LKFLHY+I+ FK
Subjt:  HLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFFK

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein2.1e-9170Show/hide
Query:  TIIPPPQAVSL-GSQSLTTL-AYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL
        ++I  PQAV+  GSQS     +YT L K+ SC+FQQKK     YGN+ K KTNN L        V SCL DDS S   SSSN P EMIE+FYKCINEKNL
Subjt:  TIIPPPQAVSL-GSQSLTTL-AYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL

Query:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ
        KE+S+YIS+DCLIEDSLF E F GKKAA+ F E+LT+SMGPDVKFRI  VYER  S AGAIWHL W+NMEIP TKGCTFIDIR+EER+TI+K QII EPQ
Subjt:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ

Query:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLY
         KAGHLIL IMKLVTLLLAK  AI EWLIK SQQRWV WMSKIC+ LF LLLDSF KSYLTFIH G QLY
Subjt:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLY

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X13.8e-9367.72Show/hide
Query:  TIIPPPQAVSL-GSQSLTTLAYTA-LPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL
        ++I  PQAV+  GS S    +YT  L K+ SC+FQQKK     YGN+ K KTN RL        VSSCL DDS S   SSSN P EMIE FYKCINEKNL
Subjt:  TIIPPPQAVSL-GSQSLTTLAYTA-LPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNL

Query:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ
        K++++YIS+DCLIEDSLF E F GKKAA+ F E+LT+SMGPDVKFRI  VYER  S AGAIWHL W+NMEIP TKGCTFIDIR+EER+TI+  QII E Q
Subjt:  KELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQ

Query:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFFK
        +KAGHL LAIMKLVTLLLAK+PAI EWL KVSQQRWV  +SKICI LFK LLD+FLKSYLTFIH G QLY + L FL Y+IE FK
Subjt:  VKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFFK

A0A6J1DVV2 uncharacterized protein LOC111023569 isoform X14.1e-9566.67Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPP  +SLGSQS   L YTALPK +SCVFQQKKT  L+YG+W + KTN R STD KL FV SCL DDS S L+S+SNP +EMIE FY+CINEKN
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        L+EL SYIS+DC+IEDSLF EPF G+K AL+FFEELTQSMG  VKFRI NVYE G SGAGAIW LAWK++EIP++KGCTFI+IRNE+RR I+KAQII+EP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHC-GVQLYINALKFLHYIIEF
        QVKAGH ILA++KLVT LL  +PAIPEWL+K+ Q  WV W+SKICI LF LL +SFL+S L F +    + ++  L FL YI+ F
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHC-GVQLYINALKFLHYIIEF

A0A6J1ELY0 uncharacterized protein LOC1114357477.0e-9564.21Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPPQA++LGSQ   T AYT++PK+ S +FQQKK R          KT+  L+TD K  FVSSCLKD SV  L+S SN P+EM++K Y+CINEK 
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        LKELSSY+S+DCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR  NVYE GAS AG  WHL WKN +IP+TKGCTFI I NE+ RTI+KAQIIIEP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF
        QVKAGHLIL +MKLVT LLA+YPAI +W++K+SQQRWV W++KIC++L+   L S L+SYLTFIHC   +++  LK L ++  FF
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF

A0A6J1I4H1 uncharacterized protein LOC1114690816.8e-9865.61Show/hide
Query:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN
        MA    PPPQA++LGSQ   T AYT++PKK S +FQQKK R          KT+  L TD K  FV SCLKD SV  L+S SN P+EM++KFY+CINEK 
Subjt:  MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKN

Query:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP
        LKELSSYIS+DCLIEDSLF E FIGK+AALKFF+ELTQSMGPDVKFR  NVYE G S AGA WHL WKN +IP+TKGCTFIDI NEE RTI+KAQII+EP
Subjt:  LKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEP

Query:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF
        QVKAGHLIL +MKLVT LLA+YPAI +W++K+SQQRWV W++KIC++L+K  L S ++SYLTFIHCG  +++  +K L ++I FF
Subjt:  QVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein9.9e-2530.45Show/hide
Query:  PQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELSSYI
        P  VSL     +    T L K  S    Q       YG   K  T N +  +T                   +    +E++  FY  +N  +L  ++  I
Subjt:  PQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELSSYI

Query:  SQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIR-NEERRTIRKAQIIIEPQVKAGHL
        +QDC+ ED +FS PF+G+KA L FF +  +S   D++F I ++    +S  G  WHL WK    P++KGC+F  +   + +R I   +  +EP +K G  
Subjt:  SQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIR-NEERRTIRKAQIIIEPQVKAGHL

Query:  ILAIMKLVTLLLAKYPAIPE
        +LA +K VT LL K+P + +
Subjt:  ILAIMKLVTLLLAKYPAIPE

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein2.3e-2935.84Show/hide
Query:  TKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAG
        ++ + V SCL D   SR    S    + + KFY  INEKN  +LSS IS DC I+D  F +PF GK+ A++FFEEL +SMG +VKF + NV E     A 
Subjt:  TKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELSSYISQDCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAG

Query:  AIWHLAWKNMEIPYTKGCTFIDIRNE-ERRTIRKAQIIIEPQVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKS
          WHL WK  +IP+T+GC+F +  +E  R  IR A+I+IE  +K G + L+++K +T L  ++P   E  ++      +    +I  +    L++  + S
Subjt:  AIWHLAWKNMEIPYTKGCTFIDIRNE-ERRTIRKAQIIIEPQVKAGHLILAIMKLVTLLLAKYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKS

Query:  YLTFIHCGVQLYINALKFLHYIIEFF
        YL  +    + ++  +K +  I   F
Subjt:  YLTFIHCGVQLYINALKFLHYIIEFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTACAATTATTCCTCCTCCTCAAGCTGTCAGCTTGGGCTCCCAATCTTTGACAACACTTGCCTATACAGCCTTACCTAAAAAAAACTCATGCGTATTTCAACA
AAAGAAGACCAGAGAACTAAGATATGGCAATTGGGAGAAGATGAAAACAAACAACAGATTATCCACCGACACGAAACTCCATTTCGTGTCATCATGCTTGAAGGATGATT
CGGTTTCCCGCTTGGAATCAAGTTCTAATCCTCCAACAGAAATGATCGAGAAATTCTACAAATGCATCAATGAAAAAAACTTGAAGGAATTGAGCAGTTACATCTCACAA
GATTGCCTCATTGAGGACTCTTTGTTTTCTGAACCATTTATAGGGAAGAAGGCAGCTCTAAAGTTCTTTGAAGAACTAACTCAAAGCATGGGTCCAGATGTGAAGTTTAG
AATTCATAACGTCTACGAAAGAGGCGCTTCCGGGGCAGGAGCAATCTGGCATTTAGCGTGGAAGAACATGGAGATTCCCTACACCAAGGGTTGCACTTTCATTGACATCA
GGAATGAAGAAAGAAGAACTATACGAAAGGCACAAATTATAATCGAACCACAAGTCAAAGCAGGACATCTCATCTTGGCTATAATGAAGCTTGTGACTTTATTACTTGCT
AAGTATCCAGCGATTCCTGAATGGCTGATAAAAGTTTCTCAACAACGTTGGGTAATGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCTTCTCTTGGATAGCTTTTT
GAAGAGCTATCTAACCTTCATACATTGTGGGGTTCAACTGTATATTAATGCACTCAAATTTTTGCATTATATTATAGAGTTTTTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTGTACAATTATTCCTCCTCCTCAAGCTGTCAGCTTGGGCTCCCAATCTTTGACAACACTTGCCTATACAGCCTTACCTAAAAAAAACTCATGCGTATTTCAACA
AAAGAAGACCAGAGAACTAAGATATGGCAATTGGGAGAAGATGAAAACAAACAACAGATTATCCACCGACACGAAACTCCATTTCGTGTCATCATGCTTGAAGGATGATT
CGGTTTCCCGCTTGGAATCAAGTTCTAATCCTCCAACAGAAATGATCGAGAAATTCTACAAATGCATCAATGAAAAAAACTTGAAGGAATTGAGCAGTTACATCTCACAA
GATTGCCTCATTGAGGACTCTTTGTTTTCTGAACCATTTATAGGGAAGAAGGCAGCTCTAAAGTTCTTTGAAGAACTAACTCAAAGCATGGGTCCAGATGTGAAGTTTAG
AATTCATAACGTCTACGAAAGAGGCGCTTCCGGGGCAGGAGCAATCTGGCATTTAGCGTGGAAGAACATGGAGATTCCCTACACCAAGGGTTGCACTTTCATTGACATCA
GGAATGAAGAAAGAAGAACTATACGAAAGGCACAAATTATAATCGAACCACAAGTCAAAGCAGGACATCTCATCTTGGCTATAATGAAGCTTGTGACTTTATTACTTGCT
AAGTATCCAGCGATTCCTGAATGGCTGATAAAAGTTTCTCAACAACGTTGGGTAATGTGGATGTCAAAGATCTGTATAATTCTCTTCAAGCTTCTCTTGGATAGCTTTTT
GAAGAGCTATCTAACCTTCATACATTGTGGGGTTCAACTGTATATTAATGCACTCAAATTTTTGCATTATATTATAGAGTTTTTCAAGTAA
Protein sequenceShow/hide protein sequence
MACTIIPPPQAVSLGSQSLTTLAYTALPKKNSCVFQQKKTRELRYGNWEKMKTNNRLSTDTKLHFVSSCLKDDSVSRLESSSNPPTEMIEKFYKCINEKNLKELSSYISQ
DCLIEDSLFSEPFIGKKAALKFFEELTQSMGPDVKFRIHNVYERGASGAGAIWHLAWKNMEIPYTKGCTFIDIRNEERRTIRKAQIIIEPQVKAGHLILAIMKLVTLLLA
KYPAIPEWLIKVSQQRWVMWMSKICIILFKLLLDSFLKSYLTFIHCGVQLYINALKFLHYIIEFFK