; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16618 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16618
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSnoaL-like domain-containing protein
Genome locationCarg_Chr15:8870612..8873036
RNA-Seq ExpressionCarg16618
SyntenyCarg16618
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579526.1 hypothetical protein SDJN03_23974, partial [Cucurbita argyrosperma subsp. sororia]3.0e-10243.34Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        M LITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVK RFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE---------------------------------------------------------
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE                                                         
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE---------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------MWKNTKIPFTKGCTFIHINNEQRTIQK
                                                                                 +WKNTKIPFTKGCTFIHINNEQRTIQK
Subjt:  -------------------------------------------------------------------------MWKNTKIPFTKGCTFIHINNEQRTIQK

Query:  AQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        AQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICV+LY FFLHSLLRSYLTFIHC SLMFVFTLKLLRWVKGFFN
Subjt:  AQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

KAG7016986.1 hypothetical protein SDJN02_22097, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-145100Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEMWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYP
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEMWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYP
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEMWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYP

Query:  AIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        AIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
Subjt:  AIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

XP_022929026.1 uncharacterized protein LOC111435747 [Cucurbita moschata]6.5e-14295.27Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE            +WKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
Subjt:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

XP_022970084.1 uncharacterized protein LOC111469081 [Cucurbita maxima]2.5e-13389.45Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPK+PS LFQQKKLRKTSYTL TDVKPRFV SCLKDGSVCSLDSCSNSPSEMVKK YECINEKKLKELSSY+SE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE            +WKNTKIPFTKGCTFI INNE+RTIQKAQII+EPQVKAGHLILGM
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        MKLVTSLLAQYPAIHKWVMKLSQQRWVRW+AKICV+LY FFLHSL+RSYLTFIHC SLMFVFT+KLLRWV GFFN
Subjt:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

XP_023550969.1 uncharacterized protein LOC111808949 [Cucurbita pepo subsp. pepo]2.7e-13289.82Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLAT+VKPRFV SCLKDGS CSLDSCS SPSEMV+KLYECINEKKLKELSSYMSE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
        DCLIEDSLFDE FIGKEAALKFF+ELTQSMG DVKFRFRNVYE            +WKNTKIPFTKGCTFI INNEQRTIQKAQIIIEPQVKAGHLILGM
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        MKLVTSLLAQYPAIHKWVMKLS QRWVRW+AKICV+LY FFLHSLLRSYLTFIHC SLMFVFTLKLLRWV GFFN
Subjt:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

TrEMBL top hitse value%identityAlignment
A0A0A0KMI4 SnoaL-like domain-containing protein1.5e-6456.13Show/hide
Query:  MALITSPPPQALTL-GSQPFRTF-AYTSIPKRPSYLFQQKK------LRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLK
        M+LITS  PQA+   GSQ FR F +YT + KR S +FQQKK       RKT+ TL        V SCL D S     S SNSP EM+++ Y+CINEK LK
Subjt:  MALITSPPPQALTL-GSQPFRTF-AYTSIPKRPSYLFQQKK------LRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLK

Query:  ELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEM------------WKNTKIPFTKGCTFIHINNEQR-TIQKAQIIIEPQV
        E+S+Y+SEDCLIEDSLF E F GK+AA+ F ++LT+SMGPDVKFR R VYE             W+N +IP TKGCTFI I +E+R TIQK QII EPQ 
Subjt:  ELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEM------------WKNTKIPFTKGCTFIHINNEQR-TIQKAQIIIEPQV

Query:  KAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMF
        KAGHLIL +MKLVT LLA+  AI +W++K SQQRWV+W++KICV L+N  L S  +SYLTFIH  + ++
Subjt:  KAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMF

A0A1S3ATV9 uncharacterized protein LOC103482777 isoform X13.8e-6353.11Show/hide
Query:  MALITSPPPQALTLGSQP-FRTFAYT-SIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYM
        M+LITS  PQA+  G  P FR F+YT  + KR S +FQQKK         T+V  R VSSCL D S     S SNSP EM++  Y+CINEK LK++++Y+
Subjt:  MALITSPPPQALTLGSQP-FRTFAYT-SIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYM

Query:  SEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEM------------WKNTKIPFTKGCTFIHINNEQR-TIQKAQIIIEPQVKAGHLI
        SEDCLIEDSLF E F GK+AA+ F ++LT+SMGPDVKFR R VYE             W+N +IP TKGCTFI I +E+R TIQ  QII E Q+KAGHL 
Subjt:  SEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEM------------WKNTKIPFTKGCTFIHINNEQR-TIQKAQIIIEPQVKAGHLI

Query:  LGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWV
        L +MKLVT LLA++PAI +W+ K+SQQRWV+ ++KIC+ L+   L + L+SYLTFIH  + ++   L  L +V
Subjt:  LGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWV

A0A6J1DVV2 uncharacterized protein LOC111023569 isoform X13.0e-6854.23Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKK---------LRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKL
        MALI SPPP  ++LGSQ FR   YT++PK  S +FQQKK         +RKT+   +TDVK RFV SCL D S   LDS SN  SEM++  Y CINEK L
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKK---------LRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKL

Query:  KELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNE-QRTIQKAQIIIEPQ
        +EL SY+SEDC+IEDSLF E F G++ AL+FF+ELTQSMG  VKFR  NVYE             WK+ +IPF+KGCTFI+I NE +R IQKAQII+EPQ
Subjt:  KELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNE-QRTIQKAQIIIEPQ

Query:  VKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLM-FVFTLKLLRWVKGF
        VKAGH IL ++KLVTSLL  +PAI +W++KL Q  WV+W++KIC+ L+     S L S L F +  +   +V  L  LR++ GF
Subjt:  VKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLM-FVFTLKLLRWVKGF

A0A6J1ELY0 uncharacterized protein LOC1114357473.2e-14295.27Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE            +WKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
Subjt:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

A0A6J1I4H1 uncharacterized protein LOC1114690811.2e-13389.45Show/hide
Query:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE
        MALITSPPPQALTLGSQPFRTFAYTSIPK+PS LFQQKKLRKTSYTL TDVKPRFV SCLKDGSVCSLDSCSNSPSEMVKK YECINEKKLKELSSY+SE
Subjt:  MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSE

Query:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM
        DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE            +WKNTKIPFTKGCTFI INNE+RTIQKAQII+EPQVKAGHLILGM
Subjt:  DCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN
        MKLVTSLLAQYPAIHKWVMKLSQQRWVRW+AKICV+LY FFLHSL+RSYLTFIHC SLMFVFT+KLLRWV GFFN
Subjt:  MKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G71480.1 Nuclear transport factor 2 (NTF2) family protein5.0e-1526.17Show/hide
Query:  PQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSY------TLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSEDC
        P  ++L   P   F  T + K  S    Q      SY        A DV P               ++   S SE+V   Y  +N   L  ++  +++DC
Subjt:  PQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSY------TLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSEDC

Query:  LIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNV------------YEMWKNTKIPFTKGCTF--IHINNEQRTIQKAQIIIEPQVKAGHLILGM
        + ED +F   F+G++A L FF +  +S   D++F   ++            +  WK    PF+KGC+F  + + + +R I   +  +EP +K G  +L  
Subjt:  LIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNV------------YEMWKNTKIPFTKGCTF--IHINNEQRTIQKAQIIIEPQVKAGHLILGM

Query:  MKLVTSLLAQYPAI
        +K VT LL ++P +
Subjt:  MKLVTSLLAQYPAI

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein2.1e-2132.13Show/hide
Query:  VCSLDSCSNSPSEM-----VKKLYECINEKKLKELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKN
        V  LD  ++ P+++     V K Y  INEK   +LSS +S DC I+D  F + F GK+ A++FF+EL +SMG +VKF   NV E             WK 
Subjt:  VCSLDSCSNSPSEM-----VKKLYECINEKKLKELSSYMSEDCLIEDSLFDEAFIGKEAALKFFKELTQSMGPDVKFRFRNVYE------------MWKN

Query:  TKIPFTKGCTFIHINNE--QRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLR----SYLTFI
         KIPFT+GC+F    +E  +  I+ A+I+IE  +K G + L ++K +T L  ++P   K      ++ +   + +  + +Y  FL  L+     SYL  +
Subjt:  TKIPFTKGCTFIHINNE--QRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAKICVLLYNFFLHSLLR----SYLTFI

Query:  HCASLMFVFTLKLLRWVKGFF
           +  F+  +K++  ++  F
Subjt:  HCASLMFVFTLKLLRWVKGFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTATTACGAGCCCTCCTCCTCAAGCTCTCACCTTGGGCTCCCAACCATTTAGAACATTTGCATATACATCCATCCCTAAGAGACCCTCATATTTGTTTCAACA
AAAGAAGCTTAGGAAAACAAGCTACACATTAGCCACCGACGTTAAGCCCCGTTTCGTGTCGTCGTGCTTAAAGGACGGCTCGGTTTGTAGCTTAGATTCATGTTCAAATT
CTCCATCAGAAATGGTTAAGAAATTGTACGAGTGCATCAATGAAAAGAAGTTGAAGGAGTTGAGTAGTTACATGTCAGAAGATTGCCTTATTGAGGACTCCTTGTTTGAT
GAAGCATTTATAGGGAAGGAGGCTGCTCTCAAGTTCTTTAAAGAACTAACTCAAAGTATGGGTCCGGATGTGAAGTTTAGGTTTCGTAACGTCTATGAAATGTGGAAGAA
CACAAAGATCCCCTTCACCAAAGGTTGCACCTTCATACACATCAACAACGAACAAAGAACCATACAGAAGGCACAAATCATAATAGAACCACAAGTAAAAGCGGGGCATC
TCATCTTGGGTATGATGAAGCTGGTAACCTCCTTGCTTGCTCAGTATCCAGCAATTCATAAATGGGTGATGAAACTTTCCCAGCAGCGTTGGGTAAGGTGGGTGGCCAAG
ATCTGCGTACTTCTCTACAACTTCTTCTTGCACAGCCTCTTGCGGAGCTATCTAACCTTTATACATTGTGCCTCTCTAATGTTTGTTTTTACACTCAAATTGTTACGTTG
GGTTAAAGGCTTCTTCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTTATTACGAGCCCTCCTCCTCAAGCTCTCACCTTGGGCTCCCAACCATTTAGAACATTTGCATATACATCCATCCCTAAGAGACCCTCATATTTGTTTCAACA
AAAGAAGCTTAGGAAAACAAGCTACACATTAGCCACCGACGTTAAGCCCCGTTTCGTGTCGTCGTGCTTAAAGGACGGCTCGGTTTGTAGCTTAGATTCATGTTCAAATT
CTCCATCAGAAATGGTTAAGAAATTGTACGAGTGCATCAATGAAAAGAAGTTGAAGGAGTTGAGTAGTTACATGTCAGAAGATTGCCTTATTGAGGACTCCTTGTTTGAT
GAAGCATTTATAGGGAAGGAGGCTGCTCTCAAGTTCTTTAAAGAACTAACTCAAAGTATGGGTCCGGATGTGAAGTTTAGGTTTCGTAACGTCTATGAAATGTGGAAGAA
CACAAAGATCCCCTTCACCAAAGGTTGCACCTTCATACACATCAACAACGAACAAAGAACCATACAGAAGGCACAAATCATAATAGAACCACAAGTAAAAGCGGGGCATC
TCATCTTGGGTATGATGAAGCTGGTAACCTCCTTGCTTGCTCAGTATCCAGCAATTCATAAATGGGTGATGAAACTTTCCCAGCAGCGTTGGGTAAGGTGGGTGGCCAAG
ATCTGCGTACTTCTCTACAACTTCTTCTTGCACAGCCTCTTGCGGAGCTATCTAACCTTTATACATTGTGCCTCTCTAATGTTTGTTTTTACACTCAAATTGTTACGTTG
GGTTAAAGGCTTCTTCAATTAA
Protein sequenceShow/hide protein sequence
MALITSPPPQALTLGSQPFRTFAYTSIPKRPSYLFQQKKLRKTSYTLATDVKPRFVSSCLKDGSVCSLDSCSNSPSEMVKKLYECINEKKLKELSSYMSEDCLIEDSLFD
EAFIGKEAALKFFKELTQSMGPDVKFRFRNVYEMWKNTKIPFTKGCTFIHINNEQRTIQKAQIIIEPQVKAGHLILGMMKLVTSLLAQYPAIHKWVMKLSQQRWVRWVAK
ICVLLYNFFLHSLLRSYLTFIHCASLMFVFTLKLLRWVKGFFN