; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033563 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033563
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold5:4467302..4469274
RNA-Seq ExpressionSpg033563
SyntenySpg033563
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580709.1 hypothetical protein SDJN03_20711, partial [Cucurbita argyrosperma subsp. sororia]3.6e-11972.27Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPRKLAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL     GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIG++DYMGCAS HG+GPSVAVFDT+RDG LDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

KAG7017467.1 hypothetical protein SDJN02_19332, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-11872.64Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPRKLAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL     GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIG++DYMGCAS HGIGPSVAVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAIL
        LHSREQMEALVDNMKA+L
Subjt:  LHSREQMEALVDNMKAIL

XP_022935262.1 uncharacterized protein LOC111442200 [Cucurbita moschata]4.2e-12072.9Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPRKLAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL     GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIG++DYMGCAS HGIGPSVAVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

XP_022982942.1 uncharacterized protein LOC111481636 [Cucurbita maxima]2.8e-11972.59Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPR LAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL  +  GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIGV+DY+GCAS HGIGPSVAVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

XP_023527975.1 uncharacterized protein LOC111791031 [Cucurbita pepo subsp. pepo]8.0e-11971.65Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  E+ELGMEDLVPRKL KK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL     GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIGV+DY+GCAS HG+GPS+AVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

TrEMBL top hitse value%identityAlignment
A0A6J1CU67 uncharacterized protein LOC111014265 isoform X24.3e-11066.77Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGG----DKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKS
        M   +LYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLM          +K E+ +GMEDLVP KL KKPLLARGLDM+ YS+NSLRLTNLKFKD KS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGG----DKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKS

Query:  TRRSQVARLQMNHSETQKILSTKTPLELS-------------------------------------------------GFYHAAILNSYTVRGGEDLWEL
        +RRSQVARLQ+N ++T K+LS     E+                                                  GFYH+AILNSYTVRGGEDLWEL
Subjt:  TRRSQVARLQMNHSETQKILSTKTPLELS-------------------------------------------------GFYHAAILNSYTVRGGEDLWEL

Query:  AKKISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCV
        AKK STTLEA KNSNKHF+DMSDLNFL+CR I+NPSLTAS A+RTSLMTVFEDTVVDNS  MQEEI VEDYMGCAS+HGIGPS+AVFDTVRDGRLDC+ V
Subjt:  AKKISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCV

Query:  YPAPLHSREQMEALVDNMKAILVKG
        YPAPLHSREQMEAL+D+M+A+LVKG
Subjt:  YPAPLHSREQMEALVDNMKAILVKG

A0A6J1F529 uncharacterized protein LOC1114422002.1e-12072.9Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPRKLAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL     GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIG++DYMGCAS HGIGPSVAVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

A0A6J1FZI6 uncharacterized protein LOC1114493153.3e-11872.81Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMK-GGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRR
        M  V+LYEVGS KWVAVFRLHVAACDRTTAVSLLEELLVLM  GG GDKK+E+ELGME+LVPRKLAKKPLL RGLDMI YS+NSLRLTNLKFKD KS RR
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMK-GGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRR

Query:  SQVARLQMNHSETQKIL-----------------------------------------------STKTPLELS--GFYHAAILNSYTVRGGEDLWELAKK
        SQVARLQMNH++TQKIL                                               S + PL     GFYHAAILNSYTVRGGEDLWELA K
Subjt:  SQVARLQMNHSETQKIL-----------------------------------------------STKTPLELS--GFYHAAILNSYTVRGGEDLWELAKK

Query:  ISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPA
        IS+TLEASKN NKHFTDMSDLNFLLCR IENPSLT+SGA+RTSLMTVFEDTV+DNSG MQEEIGV DYMGCASIHGIGPS+AVFDTVRDG+LDCVCVYPA
Subjt:  ISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPA

Query:  PLHSREQMEALVDNMKAILV
        PLHSREQMEALV+NMK  L+
Subjt:  PLHSREQMEALVDNMKAILV

A0A6J1IXX9 uncharacterized protein LOC1114816361.3e-11972.59Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS
        ML VSLYEVGSGKW+ VFRLHVAACDRTTAVSLLEELL+LM GGG DK  EVELGMEDLVPR LAKK +L+RGL++I YSVNSLRLTNLKFKD KS RRS
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRS

Query:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI
        QVARLQMN +ET KILS                                                + PL  +  GFYHAAILNSYT+RGGE+LWELAKKI
Subjt:  QVARLQMNHSETQKILS-----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGEDLWELAKKI

Query:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP
        STTLEASKNSNKHFTDMSDLNFLLCR +ENPSLT SGA+RTSLMTVFEDTVVDNSGAMQ EIGV+DY+GCAS HGIGPSVAVFDT+RDGRLDC CVYPAP
Subjt:  STTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPAP

Query:  LHSREQMEALVDNMKAILVKG
        LHSREQMEALVDNMKA+LVKG
Subjt:  LHSREQMEALVDNMKAILVKG

A0A6J1KVW7 uncharacterized protein LOC1114986642.1e-11772.19Show/hide
Query:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGG-DKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRR
        M  V+LYEVGS KWVAVFRLHVAACDRTTAVSLL+ELL LM  GGG DKKEE+ELGME+LVPRKLAKKPLL RGLDMI YS+NSLRLTNLKFKD KS RR
Subjt:  MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGG-DKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRR

Query:  SQVARLQMNHSETQKIL--STKTPLELS-----------------------------------------------GFYHAAILNSYTVRGGEDLWELAKK
        SQVARLQMNH++TQKIL    +  ++LS                                               GFYHAAILNSYTVRGGEDLWELA K
Subjt:  SQVARLQMNHSETQKIL--STKTPLELS-----------------------------------------------GFYHAAILNSYTVRGGEDLWELAKK

Query:  ISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPA
        IS+TLEASKNSNKHFTDMSDLNFLLCR IENPSLT+SGA+RTSLMTVFEDTV+DNSG MQEEIGV DYMGCASIHGIGPS+AVFDT+RDG+LDCVCVYPA
Subjt:  ISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMGCASIHGIGPSVAVFDTVRDGRLDCVCVYPA

Query:  PLHSREQMEALVDNMKAILV
        PLHSREQMEALV+NMK  L+
Subjt:  PLHSREQMEALVDNMKAILV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52610.1 unknown protein6.0e-6442.12Show/hide
Query:  VSLYEV--GSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGG-------GDKKEEVELG--MEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFK
        VSLY++     + +  FRL+ AA DRT AV+LL E +      G          +  V LG  +E+L+P     KP  ARG+D++GYS+N+ R +NL F 
Subjt:  VSLYEV--GSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGG-------GDKKEEVELG--MEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFK

Query:  DCK-STRRSQVARLQMNHSETQKILS----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGED
        D + S RRSQ+ RL+++  +T K+++                                               + PL  +  GFYHA IL+++ + G E 
Subjt:  DCK-STRRSQVARLQMNHSETQKILS----------------------------------------------TKTPLELS--GFYHAAILNSYTVRGGED

Query:  LWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQ-EEIGVEDYMGCASIHGIGPSVAVFDTVRDGRL
        LW+LAK+   +  +SKNSNK FTDMSDLNFL+C+ IENP+LT S +LRT+ +++FED V+D S   +   +GV+DY+GCASIHG+GPSVAVFD +RDG+L
Subjt:  LWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQ-EEIGVEDYMGCASIHGIGPSVAVFDTVRDGRL

Query:  DCVCVYPAPLHSREQMEALVDNMKAILVKG
        DC  VYP+PLHSREQM+ L+ +MK IL++G
Subjt:  DCVCVYPAPLHSREQMEALVDNMKAILVKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAACAGTTTCGTTGTACGAGGTTGGGTCTGGGAAGTGGGTGGCCGTGTTCCGGCTACATGTGGCGGCGTGTGACCGGACGACGGCGGTGTCGCTGCTGGAGGAGCT
GCTAGTGTTGATGAAGGGCGGCGGCGGAGATAAGAAAGAGGAAGTGGAGTTGGGAATGGAGGATCTCGTTCCGAGAAAGTTGGCGAAGAAGCCATTGTTGGCAAGAGGAC
TGGACATGATTGGCTACTCTGTGAACTCTTTGAGATTGACGAATCTTAAGTTTAAAGATTGCAAATCTACGAGACGATCGCAGGTCGCGAGGCTTCAGATGAACCACAGC
GAAACCCAGAAGATTCTCTCCACTAAAACACCCTTGGAACTTTCAGGATTTTACCATGCTGCCATCCTGAACTCCTACACTGTAAGAGGAGGAGAAGACCTGTGGGAGCT
AGCAAAGAAGATCTCAACGACATTGGAGGCTTCCAAGAACTCAAACAAGCACTTCACTGACATGTCAGACCTTAACTTTCTGCTGTGCCGCACCATCGAGAACCCGAGCC
TCACAGCGTCGGGGGCGTTGAGGACATCCCTGATGACGGTGTTCGAGGACACGGTGGTCGACAACTCGGGTGCAATGCAGGAGGAGATTGGCGTCGAGGACTACATGGGC
TGCGCCTCCATCCACGGCATCGGCCCCTCCGTTGCCGTGTTCGATACCGTTAGAGACGGGCGACTGGACTGTGTGTGTGTTTATCCAGCTCCATTGCATTCCAGGGAGCA
AATGGAGGCTCTAGTTGATAACATGAAGGCTATTCTTGTTAAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAACAGTTTCGTTGTACGAGGTTGGGTCTGGGAAGTGGGTGGCCGTGTTCCGGCTACATGTGGCGGCGTGTGACCGGACGACGGCGGTGTCGCTGCTGGAGGAGCT
GCTAGTGTTGATGAAGGGCGGCGGCGGAGATAAGAAAGAGGAAGTGGAGTTGGGAATGGAGGATCTCGTTCCGAGAAAGTTGGCGAAGAAGCCATTGTTGGCAAGAGGAC
TGGACATGATTGGCTACTCTGTGAACTCTTTGAGATTGACGAATCTTAAGTTTAAAGATTGCAAATCTACGAGACGATCGCAGGTCGCGAGGCTTCAGATGAACCACAGC
GAAACCCAGAAGATTCTCTCCACTAAAACACCCTTGGAACTTTCAGGATTTTACCATGCTGCCATCCTGAACTCCTACACTGTAAGAGGAGGAGAAGACCTGTGGGAGCT
AGCAAAGAAGATCTCAACGACATTGGAGGCTTCCAAGAACTCAAACAAGCACTTCACTGACATGTCAGACCTTAACTTTCTGCTGTGCCGCACCATCGAGAACCCGAGCC
TCACAGCGTCGGGGGCGTTGAGGACATCCCTGATGACGGTGTTCGAGGACACGGTGGTCGACAACTCGGGTGCAATGCAGGAGGAGATTGGCGTCGAGGACTACATGGGC
TGCGCCTCCATCCACGGCATCGGCCCCTCCGTTGCCGTGTTCGATACCGTTAGAGACGGGCGACTGGACTGTGTGTGTGTTTATCCAGCTCCATTGCATTCCAGGGAGCA
AATGGAGGCTCTAGTTGATAACATGAAGGCTATTCTTGTTAAAGGATGA
Protein sequenceShow/hide protein sequence
MLTVSLYEVGSGKWVAVFRLHVAACDRTTAVSLLEELLVLMKGGGGDKKEEVELGMEDLVPRKLAKKPLLARGLDMIGYSVNSLRLTNLKFKDCKSTRRSQVARLQMNHS
ETQKILSTKTPLELSGFYHAAILNSYTVRGGEDLWELAKKISTTLEASKNSNKHFTDMSDLNFLLCRTIENPSLTASGALRTSLMTVFEDTVVDNSGAMQEEIGVEDYMG
CASIHGIGPSVAVFDTVRDGRLDCVCVYPAPLHSREQMEALVDNMKAILVKG