; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036299 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036299
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionATP-dependent Clp protease proteolytic subunit
Genome locationchr3:43651574..43655903
RNA-Seq ExpressionLag0036299
SyntenyLag0036299
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145880.1 uncharacterized protein LOC111015232 [Momordica charantia]6.3e-13585.37Show/hide
Query:  SSSTERAQMGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSS
        +SST+RA MGNPSISLRFTIFLSLSLSVTPFALS NY+K+S PPPSQSPIP+ATPSDLLNLLGSK QASAVNPSVA+EL+SC KFLVPF PS      S 
Subjt:  SSSTERAQMGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSS

Query:  RRSLRSNRFSD--WHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARS
        RRSLRSNRFS     RR+EDELVWWPP+SVLELARLAVDSGGDPGAIHR LDPAVIPIP + GSK+HKCELTRTPYGRRFISEELNSYLQFLFE+IVARS
Subjt:  RRSLRSNRFSD--WHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARS

Query:  AAMGLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        AAMGLNITL+RFD FHGHLF+AFDNNRLGILFHAKEFPAYDEK FPYNMGYCQIGSNV YDDS+NLRNILWLAPMP+NST  WEAPG
Subjt:  AAMGLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

XP_022961440.1 uncharacterized protein LOC111462023 [Cucurbita moschata]5.3e-13486.64Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSISLRFT+FLSLSLS+TPFALS NY K        SPIPKA PSDLLNLLGSKSQAS VNPSVAKE+ SCFKFLVPF P+ SI KCSSRRSLRS+R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPA+IPIP+VHGSKKHKCELTRTPYGRRFISEELNSYL+FLFELI ARSAA+GLNITLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYD SMN+RNILWLAP+P+ ST  WEA G
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

XP_022968759.1 uncharacterized protein LOC111467901 [Cucurbita maxima]1.1e-13487Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSISLRFT+FLSLSLS+TPFALS NY K        SPIPKA PSDLLNLLGSKSQAS VNPSVAKE+ SCFKFLVPF P+ SIVK SSRRSLRS+R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPA+IPIP+VHGSKKHKCELTRTPYGRRFISEELNSYL+FLFELI ARSAAMGLN+TLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFL+FDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYD SMN+RNILWLAP+P+NST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

XP_023515948.1 uncharacterized protein LOC111779958 [Cucurbita pepo subsp. pepo]5.7e-13687.36Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSISLRFT+FLSLSLS+TPFALS NY K        SPIPKA PSDLLNLLGSKSQAS VNPSVAKE++SCFKFLVPF P+ SI KCSSRRSLRS+R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPA+IPIP+VHGSKKHKCELTRTPYGRRFISEELNSYL+FLFELI ARSAAMGLN+TLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYD SMN+RNILWLAP+P+NST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

XP_038878944.1 uncharacterized protein LOC120071032 [Benincasa hispida]4.5e-13387Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSIS RFTIFLSLSLSVTPFALS  YNK       +SPIPKATPSDLLNLLGSKSQAS+VNPSVAKEL+SCFKFLVPF  +  I K SSRRSLRS R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F +  RREEDELVWWPPQSVLELARL VDSGGDPGAIHRTLDPA+IPIPD+HGS++HKCELTRTPYGRRFISEELNSYLQFLFELIVARS+AMGLNITLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFP NMGYCQIGSNV YDDSMNLRNILWLAPMP+NST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

TrEMBL top hitse value%identityAlignment
A0A1S3BQN4 uncharacterized protein LOC1034921595.2e-12783.39Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MG  SISL+FTIFLSLSLSVTPFAL  NYNK    PP   PIPKATPSDLLNLLGSKSQAS+VNP VAKEL+SCFKFLVPF P+ S  K S RRSLRS  
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARL VDSGGDPGAIHRTLDPA+IPIPD+HGS+ HKCELTRTPYGRRFISEELNSYLQ LFELI  RS+AMG+NI LN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYD+K FP NMGYCQIGSNV YDDSMNLRNILWLAPMP++ST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

A0A5D3CG84 T-box protein 415.2e-12783.39Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MG  SISL+FTIFLSLSLSVTPFAL  NYNK    PP   PIPKATPSDLLNLLGSKSQAS+VNP VAKEL+SCFKFLVPF P+ S  K S RRSLRS  
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARL VDSGGDPGAIHRTLDPA+IPIPD+HGS+ HKCELTRTPYGRRFISEELNSYLQ LFELI  RS+AMG+NI LN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYD+K FP NMGYCQIGSNV YDDSMNLRNILWLAPMP++ST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

A0A6J1CVR7 uncharacterized protein LOC1110152323.0e-13585.37Show/hide
Query:  SSSTERAQMGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSS
        +SST+RA MGNPSISLRFTIFLSLSLSVTPFALS NY+K+S PPPSQSPIP+ATPSDLLNLLGSK QASAVNPSVA+EL+SC KFLVPF PS      S 
Subjt:  SSSTERAQMGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSS

Query:  RRSLRSNRFSD--WHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARS
        RRSLRSNRFS     RR+EDELVWWPP+SVLELARLAVDSGGDPGAIHR LDPAVIPIP + GSK+HKCELTRTPYGRRFISEELNSYLQFLFE+IVARS
Subjt:  RRSLRSNRFSD--WHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARS

Query:  AAMGLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        AAMGLNITL+RFD FHGHLF+AFDNNRLGILFHAKEFPAYDEK FPYNMGYCQIGSNV YDDS+NLRNILWLAPMP+NST  WEAPG
Subjt:  AAMGLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

A0A6J1HA77 uncharacterized protein LOC1114620232.6e-13486.64Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSISLRFT+FLSLSLS+TPFALS NY K        SPIPKA PSDLLNLLGSKSQAS VNPSVAKE+ SCFKFLVPF P+ SI KCSSRRSLRS+R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPA+IPIP+VHGSKKHKCELTRTPYGRRFISEELNSYL+FLFELI ARSAA+GLNITLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYD SMN+RNILWLAP+P+ ST  WEA G
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

A0A6J1I0L2 uncharacterized protein LOC1114679015.2e-13587Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
        MGNPSISLRFT+FLSLSLS+TPFALS NY K        SPIPKA PSDLLNLLGSKSQAS VNPSVAKE+ SCFKFLVPF P+ SIVK SSRRSLRS+R
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR

Query:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN
        F D  RREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPA+IPIP+VHGSKKHKCELTRTPYGRRFISEELNSYL+FLFELI ARSAAMGLN+TLN
Subjt:  FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLN

Query:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        RFDLFHGHLFL+FDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYD SMN+RNILWLAP+P+NST  WEAPG
Subjt:  RFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G27710.1 unknown protein6.3e-9359.51Show/hide
Query:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPF---SPSRSIVKCSSRRSLR
        M N  + LRFTIF+S S++    A S   +  S  P   S  PKAT  DLL++LG  S AS +NP V++E++SC KFLVPF    P     +CS R  L 
Subjt:  MGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPF---SPSRSIVKCSSRRSLR

Query:  SNRFSDWHRR----EEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAM
        S +     RR    EE+ L+WWPP+SVLELARLAVDSGGDPG+I RTL+P +IP+PDV  S+K KC+LTRTPYGR FI+EE+NSY +FLF LI +R  ++
Subjt:  SNRFSDWHRR----EEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAM

Query:  GLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG
        GLN++L+R+DLFHGHLFLA ++ RLGILFHAKE+PAYD+K FPYNMGYCQ GS+V Y+DSMNLRNILWLAP+P+NS+  W APG
Subjt:  GLNITLNRFDLFHGHLFLAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCGCCAGCAGCTCCACAGAGAGGGCTCAAATGGGTAACCCTTCAATCTCTCTCAGATTCACCATTTTTCTCTCCCTCTCTCTCTCAGTCACCCCCTTCGCTCTCTCTCT
CAATTACAACAAACAATCTCGCCCTCCGCCATCGCAATCGCCGATTCCCAAAGCCACTCCCTCCGATTTGCTCAATCTTCTCGGTTCTAAATCTCAAGCTTCGGCTGTGA
ACCCTAGCGTAGCGAAGGAGCTAGAATCCTGTTTCAAATTCCTTGTTCCCTTCAGTCCAAGTCGATCCATCGTGAAATGCTCGAGTAGACGGAGCTTGAGGTCGAACAGA
TTCAGTGATTGGCATCGGAGAGAGGAGGATGAGCTCGTCTGGTGGCCGCCTCAGTCGGTTCTCGAACTCGCTCGGCTTGCTGTTGATTCTGGTGGTGACCCTGGGGCTAT
CCATCGCACCCTTGATCCCGCTGTGATCCCTATACCTGACGTTCATGGATCAAAAAAACACAAATGCGAGCTCACAAGAACACCATATGGGAGACGCTTCATAAGTGAGG
AACTAAATTCATATCTTCAGTTCTTGTTTGAGCTCATTGTTGCTCGATCTGCTGCAATGGGGTTGAACATTACATTGAACCGTTTCGATTTATTTCATGGTCATCTGTTT
CTTGCCTTTGACAACAATAGGCTTGGTATTCTGTTTCATGCCAAGGAATTCCCAGCTTATGATGAAAAGGCCTTTCCATACAATATGGGCTATTGTCAAATAGGATCTAA
TGTACCATACGATGATTCAATGAACTTGAGAAATATCCTCTGGCTGGCACCAATGCCCAACAATTCTACCAATGGCTGGGAAGCACCAGGGGGATGGCCAAAGAACTTCC
TCAAAAATAGAAGAAACATCAATGTTATTAGCCGAAGCAACCCCAAAAGCTCTGAAGAACTGACTCCATATCGGCAGGCGAGCAGGCAAACTAACAATCCCAAAGTAAGT
GATGAAGGTCCTCTTCCTGACTCCTACAAAGAACACACCATTGCGGCCGTAACACGAGAGAGGAGAACCTTTGGATCCGATCCTGAGTACTCACCCACCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATCGCCAGCAGCTCCACAGAGAGGGCTCAAATGGGTAACCCTTCAATCTCTCTCAGATTCACCATTTTTCTCTCCCTCTCTCTCTCAGTCACCCCCTTCGCTCTCTCTCT
CAATTACAACAAACAATCTCGCCCTCCGCCATCGCAATCGCCGATTCCCAAAGCCACTCCCTCCGATTTGCTCAATCTTCTCGGTTCTAAATCTCAAGCTTCGGCTGTGA
ACCCTAGCGTAGCGAAGGAGCTAGAATCCTGTTTCAAATTCCTTGTTCCCTTCAGTCCAAGTCGATCCATCGTGAAATGCTCGAGTAGACGGAGCTTGAGGTCGAACAGA
TTCAGTGATTGGCATCGGAGAGAGGAGGATGAGCTCGTCTGGTGGCCGCCTCAGTCGGTTCTCGAACTCGCTCGGCTTGCTGTTGATTCTGGTGGTGACCCTGGGGCTAT
CCATCGCACCCTTGATCCCGCTGTGATCCCTATACCTGACGTTCATGGATCAAAAAAACACAAATGCGAGCTCACAAGAACACCATATGGGAGACGCTTCATAAGTGAGG
AACTAAATTCATATCTTCAGTTCTTGTTTGAGCTCATTGTTGCTCGATCTGCTGCAATGGGGTTGAACATTACATTGAACCGTTTCGATTTATTTCATGGTCATCTGTTT
CTTGCCTTTGACAACAATAGGCTTGGTATTCTGTTTCATGCCAAGGAATTCCCAGCTTATGATGAAAAGGCCTTTCCATACAATATGGGCTATTGTCAAATAGGATCTAA
TGTACCATACGATGATTCAATGAACTTGAGAAATATCCTCTGGCTGGCACCAATGCCCAACAATTCTACCAATGGCTGGGAAGCACCAGGGGGATGGCCAAAGAACTTCC
TCAAAAATAGAAGAAACATCAATGTTATTAGCCGAAGCAACCCCAAAAGCTCTGAAGAACTGACTCCATATCGGCAGGCGAGCAGGCAAACTAACAATCCCAAAGTAAGT
GATGAAGGTCCTCTTCCTGACTCCTACAAAGAACACACCATTGCGGCCGTAACACGAGAGAGGAGAACCTTTGGATCCGATCCTGAGTACTCACCCACCATGTAA
Protein sequenceShow/hide protein sequence
IASSSTERAQMGNPSISLRFTIFLSLSLSVTPFALSLNYNKQSRPPPSQSPIPKATPSDLLNLLGSKSQASAVNPSVAKELESCFKFLVPFSPSRSIVKCSSRRSLRSNR
FSDWHRREEDELVWWPPQSVLELARLAVDSGGDPGAIHRTLDPAVIPIPDVHGSKKHKCELTRTPYGRRFISEELNSYLQFLFELIVARSAAMGLNITLNRFDLFHGHLF
LAFDNNRLGILFHAKEFPAYDEKAFPYNMGYCQIGSNVPYDDSMNLRNILWLAPMPNNSTNGWEAPGGWPKNFLKNRRNINVISRSNPKSSEELTPYRQASRQTNNPKVS
DEGPLPDSYKEHTIAAVTRERRTFGSDPEYSPTM