; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026283 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026283
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLEA_2 domain-containing protein
Genome locationtig00153031:3590169..3593902
RNA-Seq ExpressionSgr026283
SyntenySgr026283
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595030.1 hypothetical protein SDJN03_11583, partial [Cucurbita argyrosperma subsp. sororia]1.8e-8984.33Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLT SSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

KAG7027054.1 hypothetical protein SDJN02_11063, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-8984.33Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLT SSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

XP_022962796.1 uncharacterized protein LOC111463178 [Cucurbita moschata]1.8e-8984.33Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLT SSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

XP_023003762.1 uncharacterized protein LOC111497248 [Cucurbita maxima]1.6e-9084.79Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLTPSSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

XP_023517684.1 uncharacterized protein LOC111781365 [Cucurbita pepo subsp. pepo]5.2e-8983.87Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLT SSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG S+GFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

TrEMBL top hitse value%identityAlignment
A0A1S3B0C4 uncharacterized protein LOC1034847823.4e-8678.79Show/hide
Query:  VLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPW
        VLAKTDSEVSSLTPS    SPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRK  N KIPKPW
Subjt:  VLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPW

Query:  KRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFG
        KRFDAIEEERLL+DDG SDGF+RRCYF+AFVISFV+LF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVAT L TMNATVK IFRNTATFFG
Subjt:  KRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFG

Query:  VHVTSTPLQLSYSQLTLASGTNKIEYCKRGS
        V VTSTPLQLSYSQLTLASGT +  + +R S
Subjt:  VHVTSTPLQLSYSQLTLASGTNKIEYCKRGS

A0A5D3CN58 Late embryogenesis abundant protein, LEA-143.4e-8681.45Show/hide
Query:  VLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPW
        VLAKTDSEVSSLTPS    SPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRK  N KIPKPW
Subjt:  VLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPW

Query:  KRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFG
        KRFDAIEEERLL+DDG SDGF+RRCYF+AFVISFV+LF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVAT L TMNATVK IFRNTATFFG
Subjt:  KRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFG

Query:  VHVTSTPLQLSYSQLTLASGT
        V VTSTPLQLSYSQLTLASGT
Subjt:  VHVTSTPLQLSYSQLTLASGT

A0A6J1BTU7 uncharacterized protein LOC1110056322.0e-8682.06Show/hide
Query:  MSVLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRK-AGNQKIP
        MSVLAKTDSEVSSLTPS    SPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRK A   KIP
Subjt:  MSVLAKTDSEVSSLTPS----SPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRK-AGNQKIP

Query:  KPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTAT
        KPWKRFDAIEEERLLEDDGGSDGF+RRCYF+AFVISFVVLF+LFSLILWGASRPQKP I+MKSILFDKFVIQAGADFSGVATDL TMNATVK IFRNTAT
Subjt:  KPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTAT

Query:  FFGVHVTSTPLQLSYSQLTLASG
        FF VHVTSTPLQ+SYSQLTLASG
Subjt:  FFGVHVTSTPLQLSYSQLTLASG

A0A6J1HE75 uncharacterized protein LOC1114631788.7e-9084.33Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLT SSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

A0A6J1KNI5 uncharacterized protein LOC1114972487.9e-9184.79Show/hide
Query:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD
        VLAKTDSEVSSLTPSSPSSRRPV                       PVLSPMGSPPHSHSNSSLGPHSRDSSSTR+SASVKPGSRKA N KIPKPW+RFD
Subjt:  VLAKTDSEVSSLTPSSPSSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFD

Query:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT
        AIEEERLLEDDG SDGFSRRCYF+AFVISFVVLF+LFSLILWGASRPQKPTI+MKSILFDKFVIQAGADFSGVATDLATMNATVK +FRNTATFFGVHVT
Subjt:  AIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVT

Query:  STPLQLSYSQLTLASGT
        STPLQLSYSQLTLASGT
Subjt:  STPLQLSYSQLTLASGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G45688.1 unknown protein1.2e-5146Show/hide
Query:  AKTDSEVSSLTPSSP--SSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKA----GNQKI----
        AKTDSEV+SL  SSP  S RRPV                       PVLSPMGSPPHSH  SS+G HSR+SSS+RFS S+KPGSRK     G+++     
Subjt:  AKTDSEVSSLTPSSP--SSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKA----GNQKI----

Query:  PKPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTA
         K WK    IEEE LL+D     G  RRCY +AF++ F +LF  FSLIL+GA++P KP I +KSI F+   IQAG D  GV TD+ TMNAT+++++RNT 
Subjt:  PKPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTA

Query:  TFFGVHVTSTPLQLSYSQLTLASGTNKIEYCKRGSLGNADAKVPPSEKEPASDNGNGKRKQHSIVRWGSQPRQLQRSADRTGAPEPSIHGPVQSQRVGQT
        TFFGVHVTSTP+ LS+SQ+ + SG+ K  Y  R S       V   EK P   +G       S +   + P  L +   + GAP P    P     V  T
Subjt:  TFFGVHVTSTPLQLSYSQLTLASGTNKIEYCKRGSLGNADAKVPPSEKEPASDNGNGKRKQHSIVRWGSQPRQLQRSADRTGAPEPSIHGPVQSQRVGQT

AT1G45688.2 unknown protein1.6e-5152.89Show/hide
Query:  AKTDSEVSSLTPSSP--SSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKA----GNQKI----
        AKTDSEV+SL  SSP  S RRPV                       PVLSPMGSPPHSH  SS+G HSR+SSS+RFS S+KPGSRK     G+++     
Subjt:  AKTDSEVSSLTPSSP--SSRRPV-----------------------PVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKA----GNQKI----

Query:  PKPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTA
         K WK    IEEE LL+D     G  RRCY +AF++ F +LF  FSLIL+GA++P KP I +KSI F+   IQAG D  GV TD+ TMNAT+++++RNT 
Subjt:  PKPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTA

Query:  TFFGVHVTSTPLQLSYSQLTLASGT
        TFFGVHVTSTP+ LS+SQ+ + SG+
Subjt:  TFFGVHVTSTPLQLSYSQLTLASGT

AT2G41990.1 CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864)6.9e-2337.61Show/hide
Query:  AKTDSEVSS-----LTPSSPSSRRPVPVLSP----------------MGSPPHSH-SNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFDAIE
        AKTDSE +S     L+P   + R    V SP                MGSP H H  + S   HSR+SS++RFS       R   + K  +  +R+    
Subjt:  AKTDSEVSS-----LTPSSPSSRRPVPVLSP----------------MGSPPHSH-SNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFDAIE

Query:  EERLLEDDGGSDG--FSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVTS
        +++    DGG D   F     ++  ++S + LFT+FSLILWGAS+   P + +K +L     +QAG D SGV TD+ ++N+TV++ +RN +TFF VHVT+
Subjt:  EERLLEDDGGSDG--FSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVTS

Query:  TPLQLSYSQLTLASG-TNKIEYCKRG
        +PL L YS L L+SG  NK    + G
Subjt:  TPLQLSYSQLTLASG-TNKIEYCKRG

AT4G35170.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.0e-2138.46Show/hide
Query:  SPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFDAIEEERLLEDDGGSDGFSRRC--YFIAFVISFVVLFTLFSLILWGASRP
        SP GSP +     S   H   + S+ +  S  P   +  + ++    +R     E+   ++  G D   RR   ++   + + V+ FTLF LILWG S+ 
Subjt:  SPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFDAIEEERLLEDDGGSDGFSRRC--YFIAFVISFVVLFTLFSLILWGASRP

Query:  QKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVTSTPLQLSYSQLTLASG
          P   +K ++ +   +Q+G D SGV TD+ T+N+TV++++RN ATFF VHVTS PLQLSYSQL LASG
Subjt:  QKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVTSTPLQLSYSQLTLASG

AT5G42860.1 unknown protein1.3e-4047.16Show/hide
Query:  AKTDSEVSSLTPSSP--SSRRP-----------------------VPVL-SPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRF
        AKTDSEV+SL+ SSP  S RRP                        PVL SPMGSPPHSH           SSS+RFS     GS++ G+       K+F
Subjt:  AKTDSEVSSLTPSSP--SSRRP-----------------------VPVL-SPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRF

Query:  DAIEEERLLED-DGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVH
          IEEE LL+D D   +   RRCY +AF++ F +LF  FSLIL+ A++PQKP I +KSI F++  +QAG D  G+ TD+ TMNAT+++++RNT TFFGVH
Subjt:  DAIEEERLLED-DGGSDGFSRRCYFIAFVISFVVLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVH

Query:  VTSTPLQLSYSQLTLASGTNKIEYCKRGS
        VTS+P+ LS+SQ+T+ SG+ K  Y  R S
Subjt:  VTSTPLQLSYSQLTLASGTNKIEYCKRGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGTGCTGGCGAAGACCGACTCCGAGGTGAGTAGTCTGACGCCGTCGTCGCCGAGCTCTCGTCGGCCGGTCCCAGTCCTGAGTCCGATGGGGTCTCCTCCTCACTC
CCACTCCAACTCTTCCTTGGGCCCCCATTCCCGTGACTCTTCCTCCACGCGATTCTCCGCCTCCGTCAAGCCCGGATCTCGTAAGGCCGGCAACCAGAAGATTCCGAAGC
CGTGGAAGCGTTTCGACGCCATTGAAGAAGAGCGTCTCCTGGAGGACGACGGCGGCTCAGATGGGTTCAGCCGCCGGTGCTACTTCATTGCTTTTGTTATAAGTTTTGTG
GTTCTTTTTACTCTCTTTTCTCTGATTCTGTGGGGTGCGAGCCGGCCCCAGAAACCGACGATTATCATGAAAAGCATTTTGTTCGATAAGTTCGTGATCCAAGCGGGAGC
AGATTTCTCAGGGGTGGCGACGGATTTGGCGACGATGAATGCGACGGTGAAGTTGATATTTCGAAACACGGCGACGTTCTTTGGAGTTCATGTGACTTCCACTCCGCTTC
AGCTTTCGTATTCTCAGCTCACTCTCGCCTCTGGAACTAATAAAATAGAATATTGTAAAAGGGGTAGTCTTGGAAATGCAGATGCAAAAGTTCCACCAAGCGAGAAAGAG
CCAGCGAGCGATAACGGTAACGGTAAAAGGAAGCAGCATTCCATTGTACGGTGGGGGAGCCAGCCTAGGCAGCTTCAAAGGAGCGCCGATCGAACCGGTGCCCCTGAACC
TTCAATTCACGGTCCGGTCCAGAGCCAACGTGTTGGGCAAACTGGTGAAGCCCAAGTTCTACAAGAGCGTCGACTACGGACCAATCAACTGGCCCATTGTCCACCAGCTG
AAGCAGCCGCTCCGTCGGAGAAGACGAATCCAACCAAGAATTTGATTCTTCTTCAAATCTTTGATCGAGCTCCTGCAAGTGAACCGATGCCCAATCTTCGAAATCGAAGG
GGCACCTCGGCGACGTTGATGATTCGTCCATGGCCGTCAATGGAGCAGACTTTCCGGCGTCTGCTACGAACGGGTGATTTAAAAGCATCTCAGCCGTCCACCTCTTTGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGTCGGTGCTGGCGAAGACCGACTCCGAGGTGAGTAGTCTGACGCCGTCGTCGCCGAGCTCTCGTCGGCCGGTCCCAGTCCTGAGTCCGATGGGGTCTCCTCCTCACTC
CCACTCCAACTCTTCCTTGGGCCCCCATTCCCGTGACTCTTCCTCCACGCGATTCTCCGCCTCCGTCAAGCCCGGATCTCGTAAGGCCGGCAACCAGAAGATTCCGAAGC
CGTGGAAGCGTTTCGACGCCATTGAAGAAGAGCGTCTCCTGGAGGACGACGGCGGCTCAGATGGGTTCAGCCGCCGGTGCTACTTCATTGCTTTTGTTATAAGTTTTGTG
GTTCTTTTTACTCTCTTTTCTCTGATTCTGTGGGGTGCGAGCCGGCCCCAGAAACCGACGATTATCATGAAAAGCATTTTGTTCGATAAGTTCGTGATCCAAGCGGGAGC
AGATTTCTCAGGGGTGGCGACGGATTTGGCGACGATGAATGCGACGGTGAAGTTGATATTTCGAAACACGGCGACGTTCTTTGGAGTTCATGTGACTTCCACTCCGCTTC
AGCTTTCGTATTCTCAGCTCACTCTCGCCTCTGGAACTAATAAAATAGAATATTGTAAAAGGGGTAGTCTTGGAAATGCAGATGCAAAAGTTCCACCAAGCGAGAAAGAG
CCAGCGAGCGATAACGGTAACGGTAAAAGGAAGCAGCATTCCATTGTACGGTGGGGGAGCCAGCCTAGGCAGCTTCAAAGGAGCGCCGATCGAACCGGTGCCCCTGAACC
TTCAATTCACGGTCCGGTCCAGAGCCAACGTGTTGGGCAAACTGGTGAAGCCCAAGTTCTACAAGAGCGTCGACTACGGACCAATCAACTGGCCCATTGTCCACCAGCTG
AAGCAGCCGCTCCGTCGGAGAAGACGAATCCAACCAAGAATTTGATTCTTCTTCAAATCTTTGATCGAGCTCCTGCAAGTGAACCGATGCCCAATCTTCGAAATCGAAGG
GGCACCTCGGCGACGTTGATGATTCGTCCATGGCCGTCAATGGAGCAGACTTTCCGGCGTCTGCTACGAACGGGTGATTTAAAAGCATCTCAGCCGTCCACCTCTTTGTA
G
Protein sequenceShow/hide protein sequence
MSVLAKTDSEVSSLTPSSPSSRRPVPVLSPMGSPPHSHSNSSLGPHSRDSSSTRFSASVKPGSRKAGNQKIPKPWKRFDAIEEERLLEDDGGSDGFSRRCYFIAFVISFV
VLFTLFSLILWGASRPQKPTIIMKSILFDKFVIQAGADFSGVATDLATMNATVKLIFRNTATFFGVHVTSTPLQLSYSQLTLASGTNKIEYCKRGSLGNADAKVPPSEKE
PASDNGNGKRKQHSIVRWGSQPRQLQRSADRTGAPEPSIHGPVQSQRVGQTGEAQVLQERRLRTNQLAHCPPAEAAAPSEKTNPTKNLILLQIFDRAPASEPMPNLRNRR
GTSATLMIRPWPSMEQTFRRLLRTGDLKASQPSTSL