; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G000160 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G000160
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr07:194376..195110
RNA-Seq ExpressionLsi07G000160
SyntenyLsi07G000160
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044567.1 uncharacterized protein E6C27_scaffold46G003170 [Cucumis melo var. makuwa]1.8e-9882.87Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK
        MEIKHK KIHPSP    PSSSSSVFKLLPAAILAL SLLSLDDREVLAYMIARSIQSSA  TSTRVSRKKSTKKPSIN GNSNV   TTTTTTTTTTYHK
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK

Query:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV
        TP+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQ +     +KTLPV+  P  V DECVDV L     P VE 
Subjt:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV

Query:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        EGS V EVEE+GP  E+VGGG+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-9880.72Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGK+HPSPSSS PSSSSSVFKLLPAAILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS KK SI GGNSNV          +T+YHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE
        P+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQATDK+LPVVQRP PVVDECV VPL PER+ A      E +
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE

Query:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GSPV EV ESGP +EV  GDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]1.3e-9981.12Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGK+HPSPSSS PSSSSSVFKLLPAAILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS KK SINGGNSNV          +T+YHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE
        P+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQATDK+LPVVQRP PVVDECV VPL PER+ A      E +
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE

Query:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GSPV EV ESGP +EV  GDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]4.4e-10081.53Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGK+HPSPSSS PSSSSSVFKLLPAAILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS KK SINGGNSNV          +T YHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE
        PMFSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRIGRQATDK+LPVVQRPPPV  ECV VP+ PER+ AV     E +
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE

Query:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GSPV EV ESGP +EV  GDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]7.2e-11188.98Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGKIHPSPSSS PSSSSSVFKLLP AILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKST+K SINGGN NV     TTTTTTTTYHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAVEVEGSPVM
        PMFSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQ T KTLPV+Q PPPV DECVDVPL  ER P VE EGSPV 
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAVEVEGSPVM

Query:  EVEESGPEEEV-GGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        EVEESGP EEV GGGDH+KGL TKVLPDVLGF NSRLWSLWSPNL
Subjt:  EVEESGPEEEV-GGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein8.9e-9984.27Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTS-TRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK
        MEIKHK KIHPSP    PSSSSSVFKLLPAAILAL SLLSLDDREVLAYMIARSIQSSAFTS TRVSRKKSTKKP IN GNSNV   TTTTTTT TTYHK
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTS-TRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK

Query:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT--DKTLPVVQRPPPVVDECVDVPLRPEREPAVEVEGS
        TP+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLTNGEKPKKN GRGKRRDRIGRQ +  +KTLPVV  P  V DECVDVPL     P VE EGS
Subjt:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT--DKTLPVVQRPPPVVDECVDVPLRPEREPAVEVEGS

Query:  PVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
         V EVEESGP  E+VGGG+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  PVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A1S3BXS0 uncharacterized protein LOC1034945672.0e-9882.87Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK
        MEIKHK KIHPSP    PSSSSSVFKLLPAAILAL SLLSLDDREVLAYMIARSIQSSA  TSTRVSRKKSTKKPSIN GNSNV   TTTTTTTTTTYHK
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK

Query:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV
        TP+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQ +     +KTLPV+  P  V DECVDV L     P VE 
Subjt:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV

Query:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        EGS V EVEE+GP  E+VGGG+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A5A7TRT9 Uncharacterized protein8.9e-9982.87Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK
        MEIKHK KIHPSP    PSSSSSVFKLLPAAILAL SLLSLDDREVLAYMIARSIQSSA  TSTRVSRKKSTKKPSIN GNSNV   TTTTTTTTTTYHK
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSA-FTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHK

Query:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV
        TP+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQ +     +KTLPV+  P  V DECVDV L     P VE 
Subjt:  TPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQAT-----DKTLPVVQRPPPVVDECVDVPLRPEREPAVEV

Query:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        EGS V EVEE+GP  E+VGGG+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  EGSPVMEVEESGP-EEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1GV32 uncharacterized protein LOC1114577376.2e-10081.12Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGK+HPSPSSS PSSSSSVFKLLPAAILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS KK SINGGNSNV          +T+YHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE
        P+FSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQATDK+LPVVQRP PVVDECV VPL PER+ A      E +
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE

Query:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GSPV EV ESGP +EV  GDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1IQ09 uncharacterized protein LOC1114794672.1e-10081.53Show/hide
Query:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        MEIKHKGK+HPSPSSS PSSSSSVFKLLPAAILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS KK SINGGNSNV          +T YHKT
Subjt:  MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE
        PMFSCDCFYCYTAYW RWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRIGRQATDK+LPVVQRPPPV  ECV VP+ PER+ AV     E +
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAV-----EVE

Query:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GSPV EV ESGP +EV  GDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GSPVMEVEESGPEEEVGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein3.0e-2239.26Show/hide
Query:  EIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTST-RVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT
        ++  KG +HPSP      S+  +  LLP AI +L ++LS +DREVLAY+I+ +  S     T R+++ K+ KK   +                    + +
Subjt:  EIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTST-RVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKT

Query:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKN-NGRGKRRDRIGRQAT
        P+F CDCF CYT+YW RWDSSP+R+LIH+ I+AFED L   +  KKN  G+  RR R G+ ++
Subjt:  PMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKN-NGRGKRRDRIGRQAT

AT1G24270.1 unknown protein2.0e-2642.61Show/hide
Query:  MEIKHKGKIHPSP-----SSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTT
        M++  KGK+HPSP     SSS    S SVFKLL +AIL LVS+LS +D EVLAY+I RS+ ++   S +  R                            
Subjt:  MEIKHKGKIHPSP-----SSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTT

Query:  TYHKTPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGE-----KPKKNNGRGKRRDRIGRQATDKTL
          HK P+  C CF CYT+YW +WDSS NRELI+Q IEAFEDHLT  E       KKN  R K+ +    Q  +K++
Subjt:  TYHKTPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGE-----KPKKNNGRGKRRDRIGRQATDKTL

AT1G62422.1 unknown protein1.1e-2140Show/hide
Query:  KGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKTPMFSC
        KG +HPSP  +   +      LLP AIL+LV+ LS++DREVLAY+I     S++  S R+SR K  K+ +                      H +P+F C
Subjt:  KGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKTPMFSC

Query:  DCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGR-GKRRDRIGRQATDK
        DCF CYT+YW RWD+SP R+LIH+ I+A+ED L   +K K    R GK   R+    T +
Subjt:  DCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGR-GKRRDRIGRQATDK

AT5G13090.1 unknown protein1.0e-3840.69Show/hide
Query:  MEIKHKGKIHPSP-----SSSTPSSSS----------SVFKLLPAAILALVSLLSLDDREVLAYMIAR--SIQSSAFTSTRVSRKKSTKKPSINGGNSNV
        M++K KGK++PSP     SSS+ SSS+          SV KLLPA IL LVS+LS ++REVLAY+I R  +I     +S++   KK + K S N      
Subjt:  MEIKHKGKIHPSP-----SSSTPSSSS----------SVFKLLPAAILALVSLLSLDDREVLAYMIAR--SIQSSAFTSTRVSRKKSTKKPSINGGNSNV

Query:  TITTTTTTTTTTTYHKTPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEK---PKKNNGRGKRRDRIGRQATDKTLPVVQR--------P
                      HK P+F C+CF CYT YWFRWDSSPNRELIH+ IEAFE+H  +GE+    +  + RGK++++ GR+ TD       R         
Subjt:  TITTTTTTTTTTTYHKTPMFSCDCFYCYTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEK---PKKNNGRGKRRDRIGRQATDKTLPVVQR--------P

Query:  PPVVDEC----------VDVPLRPEREPAVEVEGSPVMEVEESGPEEE---------VGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPN
         PVV+            V  P+R       E E    + VEE G EEE                KGLA KVLPDVLG F+S  W LW+PN
Subjt:  PPVVDEC----------VDVPLRPEREPAVEVEGSPVMEVEESGPEEE---------VGGGDHQKGLATKVLPDVLGFFNSRLWSLWSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAAGGTAAAATACATCCATCGCCGTCGTCCTCAACGCCGTCATCTTCCTCCTCCGTCTTCAAGCTTCTTCCCGCCGCTATTTTAGCCCTTGTTTC
ACTTCTCTCTCTCGACGACCGCGAAGTCCTGGCTTACATGATCGCCAGGTCCATCCAATCCTCCGCATTCACTTCAACTCGCGTTTCCAGGAAGAAATCCACCAAAAAAC
CTTCGATCAATGGCGGAAATAGTAATGTTACTATTACCACTACTACTACTACTACTACCACTACTACTTATCACAAAACTCCTATGTTTAGTTGCGATTGCTTCTACTGC
TACACCGCCTACTGGTTCCGCTGGGACTCATCTCCCAATCGCGAACTAATCCATCAAGCGATCGAGGCGTTTGAAGATCACTTGACGAACGGCGAGAAGCCGAAGAAGAA
CAACGGCAGAGGAAAGAGGAGAGATAGAATCGGCCGCCAAGCTACTGATAAGACTCTGCCTGTTGTTCAACGTCCACCGCCGGTTGTCGATGAGTGCGTCGACGTTCCGT
TACGGCCGGAGCGTGAACCGGCGGTGGAAGTGGAAGGAAGTCCGGTGATGGAAGTTGAAGAGAGTGGTCCAGAGGAGGAGGTAGGCGGCGGGGACCATCAGAAGGGTTTG
GCGACGAAGGTACTGCCGGACGTGTTGGGGTTTTTCAATTCGCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCACAAAGGTAAAATACATCCATCGCCGTCGTCCTCAACGCCGTCATCTTCCTCCTCCGTCTTCAAGCTTCTTCCCGCCGCTATTTTAGCCCTTGTTTC
ACTTCTCTCTCTCGACGACCGCGAAGTCCTGGCTTACATGATCGCCAGGTCCATCCAATCCTCCGCATTCACTTCAACTCGCGTTTCCAGGAAGAAATCCACCAAAAAAC
CTTCGATCAATGGCGGAAATAGTAATGTTACTATTACCACTACTACTACTACTACTACCACTACTACTTATCACAAAACTCCTATGTTTAGTTGCGATTGCTTCTACTGC
TACACCGCCTACTGGTTCCGCTGGGACTCATCTCCCAATCGCGAACTAATCCATCAAGCGATCGAGGCGTTTGAAGATCACTTGACGAACGGCGAGAAGCCGAAGAAGAA
CAACGGCAGAGGAAAGAGGAGAGATAGAATCGGCCGCCAAGCTACTGATAAGACTCTGCCTGTTGTTCAACGTCCACCGCCGGTTGTCGATGAGTGCGTCGACGTTCCGT
TACGGCCGGAGCGTGAACCGGCGGTGGAAGTGGAAGGAAGTCCGGTGATGGAAGTTGAAGAGAGTGGTCCAGAGGAGGAGGTAGGCGGCGGGGACCATCAGAAGGGTTTG
GCGACGAAGGTACTGCCGGACGTGTTGGGGTTTTTCAATTCGCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAA
Protein sequenceShow/hide protein sequence
MEIKHKGKIHPSPSSSTPSSSSSVFKLLPAAILALVSLLSLDDREVLAYMIARSIQSSAFTSTRVSRKKSTKKPSINGGNSNVTITTTTTTTTTTTYHKTPMFSCDCFYC
YTAYWFRWDSSPNRELIHQAIEAFEDHLTNGEKPKKNNGRGKRRDRIGRQATDKTLPVVQRPPPVVDECVDVPLRPEREPAVEVEGSPVMEVEESGPEEEVGGGDHQKGL
ATKVLPDVLGFFNSRLWSLWSPNL