; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G004310 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G004310
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationchr08:11826534..11833930
RNA-Seq ExpressionLsi08G004310
SyntenyLsi08G004310
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR008004 - Protein OCTOPUS-like
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602092.1 hypothetical protein SDJN03_07325, partial [Cucurbita argyrosperma subsp. sororia]1.2e-9390.19Show/hide
Query:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEA-EERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTL
        MP  ESFLRQIS RGEGSRSTS+RWGGEFRRS A EERVSEGSHWN+KMEGGV NMYG+D+GGVSRRKRVMVVVD TSQSNHATMWALTHVANKGDVLTL
Subjt:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEA-EERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTL

Query:  LHVINSSSTDSSSSAADSSSSSS-FCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAEC
        LHVI +SSTDSSSS++ SSSSSS FCA+SLGSLCKASRPEVEVEVLV+EGPKL TV+NQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAEC
Subjt:  LHVINSSSTDSSSSAADSSSSSS-FCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAEC

Query:  LTIGVRKQSRDMGG
        LTIGVRKQSRDMGG
Subjt:  LTIGVRKQSRDMGG

XP_004150443.1 uncharacterized protein LOC101206721 [Cucumis sativus]2.9e-9288.84Show/hide
Query:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQI SRRGEG SRSTSRRWGGEFRR+E EERVSEGS W+QKME GGVN+M+GIDNGG+SRRKRVMVVVD TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE
        TLLHVI +SSTDSSS+A    S+SSFCASSLGSLCKASRPEVEVEVLV+EGPKLATV+NQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAE
Subjt:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE

Query:  CLTIGVRKQSRDMGG
        CLTIGVRKQSRDMGG
Subjt:  CLTIGVRKQSRDMGG

XP_008466707.1 PREDICTED: uncharacterized protein LOC103504053 [Cucumis melo]1.4e-9491.16Show/hide
Query:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQI SRRGEG SRSTSRRWGGEFR SE EERVSEGS WNQKME GGVN+MYGIDNGG+SRRKRVMVVVD TSQS+HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE
        TLLHVI +SSTDSSSSAAD  SSSSFCASSLGSLCKASRPEVEVEVLV+EGPKLATV+NQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAE
Subjt:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE

Query:  CLTIGVRKQSRDMGG
        CLTIGVRKQSRDMGG
Subjt:  CLTIGVRKQSRDMGG

XP_038884296.1 uncharacterized protein LOC120075175 [Benincasa hispida]3.6e-9596.74Show/hide
Query:  DPNFCYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEEN
        DPNFCYFHPKE VVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRH KPENSDDEASTSQEDSFISINFGKNGVGSWEEN
Subjt:  DPNFCYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEEN

Query:  KVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE
        KVSEV LENCSLSWNHHL+KDSKETKTVIEH KTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEG KTRKGWIRTLTRSRNTE
Subjt:  KVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE

XP_038884412.1 uncharacterized protein LOC120075267 [Benincasa hispida]3.2e-9992.45Show/hide
Query:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEAEERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL
        MPSTESF+RQISRRGEGSRSTSRRWGGEFRRSE EERVSEG+ WNQKMEGGV NMYGIDNGG+SRRKRVMVVVD TSQSNHATMWALTHVANKGDVLTLL
Subjt:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEAEERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL

Query:  HVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT
        HVI +SSTDSSS+A  SSSSSSFCA+SLGSLCKASRPEVEVEVLV+EGPKLATVINQVKKLEVSVLVVGQR+PSFLSCFCGSGGAGDLVEQCINN ECLT
Subjt:  HVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECLT

Query:  IGVRKQSRDMGG
        IGVRKQSRDMGG
Subjt:  IGVRKQSRDMGG

TrEMBL top hitse value%identityAlignment
A0A0A0KDD8 Usp domain-containing protein1.4e-9288.84Show/hide
Query:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQI SRRGEG SRSTSRRWGGEFRR+E EERVSEGS W+QKME GGVN+M+GIDNGG+SRRKRVMVVVD TSQSNHATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE
        TLLHVI +SSTDSSS+A    S+SSFCASSLGSLCKASRPEVEVEVLV+EGPKLATV+NQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAE
Subjt:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE

Query:  CLTIGVRKQSRDMGG
        CLTIGVRKQSRDMGG
Subjt:  CLTIGVRKQSRDMGG

A0A0A0KFU9 Uncharacterized protein1.4e-9294.59Show/hide
Query:  DPNFCYFHPKEIVVGVCALCLNERLLILASRRGR-HHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEE
        DPN CYFHPKE VVGVCALCLNE+LLILASRRGR HHSS R+CRKTPINLSKIFAFSSFISRLEFRH KPENSDDEASTSQEDSFISINFGKNGVGSWEE
Subjt:  DPNFCYFHPKEIVVGVCALCLNERLLILASRRGR-HHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEE

Query:  NKVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE
        NKVSEVSLENC LSWNHHL+KDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIR LTRSRNTE
Subjt:  NKVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE

A0A1S3CRX0 uncharacterized protein LOC1035040536.6e-9591.16Show/hide
Query:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL
        MPSTESFLRQI SRRGEG SRSTSRRWGGEFR SE EERVSEGS WNQKME GGVN+MYGIDNGG+SRRKRVMVVVD TSQS+HATMWALTH+ANKGDVL
Subjt:  MPSTESFLRQI-SRRGEG-SRSTSRRWGGEFRRSEAEERVSEGSHWNQKME-GGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVL

Query:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE
        TLLHVI +SSTDSSSSAAD  SSSSFCASSLGSLCKASRPEVEVEVLV+EGPKLATV+NQVKKLEVSVLVVGQRRPS  SCFCGSGGAGDLVEQCINNAE
Subjt:  TLLHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAE

Query:  CLTIGVRKQSRDMGG
        CLTIGVRKQSRDMGG
Subjt:  CLTIGVRKQSRDMGG

A0A6J1CHY8 uncharacterized protein LOC1110118074.0e-9294.44Show/hide
Query:  CYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEENKVSE
        CYFHPKEIVVGVCALCLNERLLILAS+RGRHHSSAR+CRKTPINLSKIFAFSSFI+RLEFRH KPENSDDEASTS EDSFISINFGKNGVGSWE+NKVSE
Subjt:  CYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEENKVSE

Query:  VSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE
        VSLENCSLSWNHHL+KDSKETKTVIEH KTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGW+RTLTRSRN+E
Subjt:  VSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHVEGVKTRKGWIRTLTRSRNTE

A0A6J1EA58 uncharacterized protein LOC1114307482.4e-9288.73Show/hide
Query:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEA-EERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTL
        MP  ESFLRQIS RGEGSRSTS+RWGGEFRRS A EERVSEGSHWN+KMEGGV NMYG+D+GGVSRRKRVMVVVD TSQSNHATMWALTHVANKGDVLTL
Subjt:  MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEA-EERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTL

Query:  LHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECL
        LH+I ++STDSSSS   SSSSS FCA+SLGSLCKASRPEVEVEVLV+EGP+L+TV+NQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECL
Subjt:  LHVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGG
        TIGVRKQSRDMGG
Subjt:  TIGVRKQSRDMGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21830.1 unknown protein3.1e-3647.92Show/hide
Query:  FCYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSK-PENSDDEASTSQEDSFISINFGKNGVGSWEENK-
        +CYFHPKE  VGVC LCLNERLL+LAS++ R   +  S     I+L KIFA SS +SRL+ RH K   +SD + STSQEDSFISI F  +G  SWE+   
Subjt:  FCYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSKIFAFSSFISRLEFRHSK-PENSDDEASTSQEDSFISINFGKNGVGSWEENK-

Query:  VSEVSLENCSLSWNHHLSKDSKETKT----VIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCH-----VEGVKTRK-GW-IRTLTRSRN
        V++V ++N + +  +   K  +   T    ++EH+  ++SLRWRKRIGHLF +I+ K  +  + CH     VEG K RK GW +RTLTR ++
Subjt:  VSEVSLENCSLSWNHHLSKDSKETKT----VIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCH-----VEGVKTRK-GW-IRTLTRSRN

AT1G44608.1 unknown protein4.4e-1434.86Show/hide
Query:  QGNSDPNFCYFHPKEIVVGVCALCLNERLLILAS----RRGRHHSSARSCRKTPINLS---KIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINF
        Q  S  + C  HP +++ GVC LCLNERLL++AS    R     SS ++  KT +  S   K     SF S  E RH K ++  + +  S EDSFISINF
Subjt:  QGNSDPNFCYFHPKEIVVGVCALCLNERLLILAS----RRGRHHSSARSCRKTPINLS---KIFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINF

Query:  GKNGVGSWEENKVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHV
          N   SWE+ K ++          + H  K+     T +        L WRKRI  L  +I  K  ++   CH+
Subjt:  GKNGVGSWEENKVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTVCHV

AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.4e-4652.11Show/hide
Query:  STESFLRQISRRGEGSRSTSRRWGGEFRRSEAEERVSEGSHWNQKMEGGVNNMYGIDNGG--VSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL
        S  S LRQ+SR+ EG RS S+RW      +   +  S G +    MEG    +YG+ +GG   +R KRVMVVVD++S+S HA MWALTH+ NKGD++TLL
Subjt:  STESFLRQISRRGEGSRSTSRRWGGEFRRSEAEERVSEGSHWNQKMEGGVNNMYGIDNGG--VSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLL

Query:  HVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPS-FLSCFCGSGGAGDLVEQCINNAECL
        HV+         S  D ++ S   A SLGSLCKA +PEV+VE LV++GPKLATV++QVKKLEVSVLV+GQ++ +  +SC CG   + +LV +CIN A+CL
Subjt:  HVINSSSTDSSSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPS-FLSCFCGSGGAGDLVEQCINNAECL

Query:  TIGVRKQSRDMGG
        TIGVRKQ + +GG
Subjt:  TIGVRKQSRDMGG

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.3e-1531.52Show/hide
Query:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVINSSSTDSSSSAADSSSSSSFC-----------ASSLGSLCKASRPEVEVEVLVVEG-PKLATV
        +R++VVVD  S++ +A +W L+H A   D + LLH + + ++ S   A         C            S+L ++C+  RPEV+ EV+ V+G  K  T+
Subjt:  KRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVINSSSTDSSSSAADSSSSSSFC-----------ASSLGSLCKASRPEVEVEVLVVEG-PKLATV

Query:  INQVKKLEVSVLVVGQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGG
        + + ++ E S+LV+GQ++       L  +          D VE CINN+ C+ I VRK+ + +GG
Subjt:  INQVKKLEVSVLVVGQRRP----SFLSCFCGSG---GAGDLVEQCINNAECLTIGVRKQSRDMGG

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.4e-1435.33Show/hide
Query:  MVVVDQTSQSNHATMWALTHVANKGDVLTLLHVINSSSTDS-SSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVE--GPKLATVINQVKKLEVSVL
        MVVVD TSQ+ +A  WALTH     D +TLLHV  +    +   +  + +S +      L + C+  +P V+ E++VVE    K  T++ + KK    VL
Subjt:  MVVVDQTSQSNHATMWALTHVANKGDVLTLLHVINSSSTDS-SSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVE--GPKLATVINQVKKLEVSVL

Query:  VVGQRRPS-----FLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMG
        V+GQR+ +             G  G +VE CI+N++C+ I VRK+S + G
Subjt:  VVGQRRPS-----FLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTACGGAATCGTTTCTGAGGCAGATAAGTAGAAGAGGGGAGGGTTCTAGATCAACTTCGAGAAGGTGGGGTGGGGAGTTCAGGAGAAGCGAAGCTGAGGAGCG
TGTCAGTGAGGGGAGCCATTGGAATCAGAAGATGGAGGGTGGTGTTAACAACATGTATGGGATCGACAATGGCGGGGTGTCAAGAAGGAAGAGGGTGATGGTGGTGGTGG
ATCAGACTTCTCAATCTAACCATGCAACTATGTGGGCTCTCACTCATGTGGCTAACAAGGGCGATGTTCTCACTCTTCTACATGTCATTAACAGCTCCTCTACAGACTCT
TCTTCTTCTGCTGCAGATTCTTCTTCTTCTTCCTCTTTCTGTGCTAGCTCTCTTGGTTCTCTCTGCAAGGCTTCTAGACCTGAGGTAGAGGTGGAAGTGCTGGTAGTAGA
GGGACCGAAGCTGGCCACAGTGATAAACCAAGTTAAGAAGCTGGAGGTGTCGGTGCTGGTTGTGGGACAGAGAAGGCCATCCTTTCTCAGCTGCTTTTGTGGGAGTGGTG
GGGCGGGAGATTTGGTGGAACAGTGCATAAACAACGCAGAGTGCTTGACCATTGGAGTTAGAAAGCAGAGCAGGGACATGGGTGGACCTGGGCCTCACTACATCTGTTCC
TATTCAGAACTAAGCCTCACTATCATCCACCCATTACAGTTGAGCAGAACTGGGCAACGCCAGAATCCTTATTGGATCCACATGTATCAGGCTAAAGAAGCTAGCTTACT
CGCACCTCGACTAATCTCACAGGACAACCTACCTGACCCTATAATATTTGGGTGTCAAGGAAACTCAGATCCAAACTTCTGTTACTTCCATCCTAAAGAAATAGTGGTGG
GCGTCTGCGCTTTGTGCTTAAACGAGCGGCTTCTTATTTTGGCGTCCAGACGAGGCCGCCACCACTCTTCAGCCCGAAGCTGCCGGAAAACTCCCATCAACCTCTCCAAG
ATCTTTGCTTTTAGCTCTTTCATTAGCCGTCTCGAATTCCGGCATTCGAAGCCGGAAAACTCCGACGATGAAGCTTCCACCAGTCAAGAAGACTCGTTCATCTCAATCAA
CTTTGGGAAAAATGGAGTTGGGTCATGGGAGGAGAACAAGGTATCAGAGGTGTCCCTGGAAAACTGCAGCCTGTCATGGAATCATCACTTGAGCAAAGACTCCAAGGAGA
CCAAGACTGTGATAGAACACAGCAAGACCCGTGCCTCACTTAGGTGGCGGAAGCGGATTGGCCACCTCTTCCAGCTCATCAGAAGGAAAAGGTCCAACAAAGGGACAGTC
TGCCACGTGGAAGGAGTTAAGACAAGGAAAGGCTGGATAAGAACTCTGACAAGGTCAAGAAATACTGAATAG
mRNA sequenceShow/hide mRNA sequence
CATAAATCCAAATTTTTAATACAAATTTTGTCAGATATAATATTGATATTGATCTAACCGTACTTTACTCACTTTAGGTTGGCAGATTCGGGAGAAAAGATCCAAAAGAA
AGGTTGAAAAGAGAAGTAAAATATTTCCCAGAAGAGAGAGAGAGAGAGTTCAGAGTAGAGAGGGAATCCCTTTACCTTTCATCATCATACTGAAACAGCAGTGATTGTGT
TTCACTTCCATGTGTGTGCTGTTCAATAAAGGCACGCCTTCCCCCATCTCTTCCCGTGACGATTGAACCTATGTTTTTATATGCACATCCAAATTATGGCATTCCTTTGC
CACAGTCTTTTCCCTCTCATAGGCTCCTCCTACTCCTTCCTAACTTAACTAGTAATTGAAAGCAAAAGCCTCAGTCCCATCCATCAATCCCTGCCATTTTTCCACAACTT
AATGACCCATTAAACAGATCTAAGCTAAGCCACTAATTTCTGTGTCTGGTTCTGTTGTAGACGAGGTGTTCGAGAAAATGCCAAGTACGGAATCGTTTCTGAGGCAGATA
AGTAGAAGAGGGGAGGGTTCTAGATCAACTTCGAGAAGGTGGGGTGGGGAGTTCAGGAGAAGCGAAGCTGAGGAGCGTGTCAGTGAGGGGAGCCATTGGAATCAGAAGAT
GGAGGGTGGTGTTAACAACATGTATGGGATCGACAATGGCGGGGTGTCAAGAAGGAAGAGGGTGATGGTGGTGGTGGATCAGACTTCTCAATCTAACCATGCAACTATGT
GGGCTCTCACTCATGTGGCTAACAAGGGCGATGTTCTCACTCTTCTACATGTCATTAACAGCTCCTCTACAGACTCTTCTTCTTCTGCTGCAGATTCTTCTTCTTCTTCC
TCTTTCTGTGCTAGCTCTCTTGGTTCTCTCTGCAAGGCTTCTAGACCTGAGGTAGAGGTGGAAGTGCTGGTAGTAGAGGGACCGAAGCTGGCCACAGTGATAAACCAAGT
TAAGAAGCTGGAGGTGTCGGTGCTGGTTGTGGGACAGAGAAGGCCATCCTTTCTCAGCTGCTTTTGTGGGAGTGGTGGGGCGGGAGATTTGGTGGAACAGTGCATAAACA
ACGCAGAGTGCTTGACCATTGGAGTTAGAAAGCAGAGCAGGGACATGGGTGGACCTGGGCCTCACTACATCTGTTCCTATTCAGAACTAAGCCTCACTATCATCCACCCA
TTACAGTTGAGCAGAACTGGGCAACGCCAGAATCCTTATTGGATCCACATGTATCAGGCTAAAGAAGCTAGCTTACTCGCACCTCGACTAATCTCACAGGACAACCTACC
TGACCCTATAATATTTGGGTGTCAAGGAAACTCAGATCCAAACTTCTGTTACTTCCATCCTAAAGAAATAGTGGTGGGCGTCTGCGCTTTGTGCTTAAACGAGCGGCTTC
TTATTTTGGCGTCCAGACGAGGCCGCCACCACTCTTCAGCCCGAAGCTGCCGGAAAACTCCCATCAACCTCTCCAAGATCTTTGCTTTTAGCTCTTTCATTAGCCGTCTC
GAATTCCGGCATTCGAAGCCGGAAAACTCCGACGATGAAGCTTCCACCAGTCAAGAAGACTCGTTCATCTCAATCAACTTTGGGAAAAATGGAGTTGGGTCATGGGAGGA
GAACAAGGTATCAGAGGTGTCCCTGGAAAACTGCAGCCTGTCATGGAATCATCACTTGAGCAAAGACTCCAAGGAGACCAAGACTGTGATAGAACACAGCAAGACCCGTG
CCTCACTTAGGTGGCGGAAGCGGATTGGCCACCTCTTCCAGCTCATCAGAAGGAAAAGGTCCAACAAAGGGACAGTCTGCCACGTGGAAGGAGTTAAGACAAGGAAAGGC
TGGATAAGAACTCTGACAAGGTCAAGAAATACTGAATAGGCAAAAGAGAATATCTATGTGTGGACTGGAATTCAAATGTGATTTGTATAAAGATTTTGAGTCAAATCAAC
TACTCATTTTTTTTTTTCCTTTTGATTTGAGGTCTTGCAGGTTAAAGAATTTATGAGTGTAGATGAACTTCTGTAATGTGCTTGAAATATC
Protein sequenceShow/hide protein sequence
MPSTESFLRQISRRGEGSRSTSRRWGGEFRRSEAEERVSEGSHWNQKMEGGVNNMYGIDNGGVSRRKRVMVVVDQTSQSNHATMWALTHVANKGDVLTLLHVINSSSTDS
SSSAADSSSSSSFCASSLGSLCKASRPEVEVEVLVVEGPKLATVINQVKKLEVSVLVVGQRRPSFLSCFCGSGGAGDLVEQCINNAECLTIGVRKQSRDMGGPGPHYICS
YSELSLTIIHPLQLSRTGQRQNPYWIHMYQAKEASLLAPRLISQDNLPDPIIFGCQGNSDPNFCYFHPKEIVVGVCALCLNERLLILASRRGRHHSSARSCRKTPINLSK
IFAFSSFISRLEFRHSKPENSDDEASTSQEDSFISINFGKNGVGSWEENKVSEVSLENCSLSWNHHLSKDSKETKTVIEHSKTRASLRWRKRIGHLFQLIRRKRSNKGTV
CHVEGVKTRKGWIRTLTRSRNTE