; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024234 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024234
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTyrosine-specific transport protein, putative
Genome locationtig00001047:4367859..4369850
RNA-Seq ExpressionSgr024234
SyntenySgr024234
Gene Ontology termsGO:0003333 - amino acid transmembrane transport (biological process)
InterPro domainsIPR018227 - Amino acid/polyamine transporter 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602318.1 hypothetical protein SDJN03_07551, partial [Cucurbita argyrosperma subsp. sororia]1.9e-5641.25Show/hide
Query:  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLIVPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQK
        +GPHGIFSNIFGN+HS+ D FPA TF+   +   S+     + +         H     I+P+ P + P + + +       +RS +  Y RRFLCY QK
Subjt:  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLIVPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQK

Query:  EKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEIN
        E+ ++SREELQPV   +K                       + +I+G  +  +  S+   + +  + +K      F    +++   GFLLVEAL+L+EI+
Subjt:  EKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEIN

Query:  VVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-------------WMVCNGRWRRLG------
        VV+   ++KK E GETGM+VISVRTMAQETLGD GGTLATVAYVF G                   +LP              ++  GR R +       
Subjt:  VVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-------------WMVCNGRWRRLG------

Query:  -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
                                       KVPTT+PVIIFALVYHDVIPVLCAYL GDLPRLRVSVL+GSFIPLL LLVWDA+A  L AQADQVVDPV
Subjt:  -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

KAG7033001.1 Tyrosine-specific transport protein, partial [Cucurbita argyrosperma subsp. argyrosperma]8.7e-5741.25Show/hide
Query:  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLIVPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQK
        +GPHGIFSNIFGN+HS+ D FPA TF+   +   S+     + +         H     I+P+ P + P + + +       +RS +  Y RRFLCY QK
Subjt:  IGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALH--------HHLFHLIVPQKPRTTPANSSAI---AMFKDRSTKGTYNRRFLCYNQK

Query:  EKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEIN
        E+ ++SREELQPV                        S   + +I+G  +  +  S+   + +  + +K      F    +++   GFLLVEAL+L+EI+
Subjt:  EKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEIN

Query:  VVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-------------WMVCNGRWRRLG------
        VV+   ++KK E GETGM+VISVRTMAQETLGD GGTLATVAYVF G                   +LP              ++  GR R +       
Subjt:  VVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-------------WMVCNGRWRRLG------

Query:  -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
                                       KVPTT+PVIIFALVYHDVIPVLCAYL GDLPRLRVSVL+GSFIPLL LLVWDA+A  L AQADQVVDPV
Subjt:  -------------------------------KVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

XP_022133499.1 uncharacterized protein LOC111006064 isoform X1 [Momordica charantia]1.8e-5443.41Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS K + NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

XP_022133500.1 uncharacterized protein LOC111006064 isoform X2 [Momordica charantia]9.0e-5443.15Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS     NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

XP_022133501.1 uncharacterized protein LOC111006064 isoform X3 [Momordica charantia]1.8e-5443.41Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS K + NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

TrEMBL top hitse value%identityAlignment
A0A1S4E1A6 tyrosine-specific transport protein 1-like isoform X32.0e-5143.15Show/hide
Query:  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALAN
        ++P+ P  +P   + +   +       YNRR LC+ QKE+ LQS EELQPV   +K                       + +++G  +  +  S+   + 
Subjt:  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALAN

Query:  LFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------
        +  I  K      F    +++   GFLLVEALVL+EI+VVLWR  KKKKK EEGETGMEVISVRTMAQETLGD GGTLATV YVF G             
Subjt:  LFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------

Query:  ------SLP-----------------------------WMVC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL
              +LP                             W+                           G WR   KVPTT+PVIIFALVYHDVIPVLCAYL
Subjt:  ------SLP-----------------------------WMVC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL

Query:  GGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
         GDLPRLRVSVLLGS IPLLALLVWD +AL L AQADQ++DPV
Subjt:  GGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

A0A1S4E201 tyrosine-specific transport protein 1-like isoform X12.0e-5143.15Show/hide
Query:  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALAN
        ++P+ P  +P   + +   +       YNRR LC+ QKE+ LQS EELQPV   +K                       + +++G  +  +  S+   + 
Subjt:  IVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALAN

Query:  LFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------
        +  I  K      F    +++   GFLLVEALVL+EI+VVLWR  KKKKK EEGETGMEVISVRTMAQETLGD GGTLATV YVF G             
Subjt:  LFEICRKQRDFSQFDIYNAML---GFLLVEALVLIEINVVLWR--KKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------

Query:  ------SLP-----------------------------WMVC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL
              +LP                             W+                           G WR   KVPTT+PVIIFALVYHDVIPVLCAYL
Subjt:  ------SLP-----------------------------WMVC------------------------NGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYL

Query:  GGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
         GDLPRLRVSVLLGS IPLLALLVWD +AL L AQADQ++DPV
Subjt:  GGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

A0A6J1BVA3 uncharacterized protein LOC111006064 isoform X38.8e-5543.41Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS K + NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

A0A6J1BVF1 uncharacterized protein LOC111006064 isoform X24.4e-5443.15Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS     NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

A0A6J1BW54 uncharacterized protein LOC111006064 isoform X18.8e-5543.41Show/hide
Query:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP
        + GNF   +D   +  F+     L   +   G H + +   LF        +   PR+    S  I    DRS K + NRR LC+ QKE+SLQSREELQP
Subjt:  IFGNFHSQLDPFPATTFK---LQLTLFASAFGFHELALHHHLF-----HLIVPQKPRTTPANSSAIAMFKDRSTKGTYNRRFLCYNQKEKSLQSREELQP

Query:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE
        V   +K                       + +++G  +L +  S+ S +  L E       F          GFLL+EAL+LIEINVVLW R+KKKK EE
Subjt:  VGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSL-SALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEINVVLW-RKKKKKNEE

Query:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------
        GETGMEVISVRTM QETLGDCGGTLA+VAYVF G                   +LP                             W+             
Subjt:  GETGMEVISVRTMAQETLGDCGGTLATVAYVFSG-------------------SLP-----------------------------WM-------------

Query:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV
          V +G W  +      GK PTT+PVIIFALVYHDVIPVLCAYL GDL RLRVSVLLGSFIPLLALL+WDA+AL LSAQADQVVDPV
Subjt:  --VCNGRWRRL------GKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLSAQADQVVDPV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G33260.1 Tryptophan/tyrosine permease7.0e-0426.92Show/hide
Query:  RLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLS-----AQADQVVDPVN
        ++  V   VPV++  L +H + P +C   G  +   R ++L+G  +PL  +L W+ + L L+     A     +DP++
Subjt:  RLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVALSLS-----AQADQVVDPVN

AT5G19500.1 Tryptophan/tyrosine permease1.5e-1140Show/hide
Query:  TLATVAYVFSGSLPWMVCNGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVAL-----SLSAQADQVVDPV
        + A +  V SG L W            VP +VP+I  + VY +V+PVLC  L GDLPR+R +++LG+ IPL   LVWDAV L           +++VDP+
Subjt:  TLATVAYVFSGSLPWMVCNGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLALLVWDAVAL-----SLSAQADQVVDPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTTCTGATCCGGGTACAGGACACTCCAGGCCCAATTGGGGTGTACCTGATATGCTGTGGATCACGGCAAAGGCCCATGGAATCCAATCTTTTGGAATTGGACC
CCATGGAATCTTCTCCAACATATTTGGAAACTTCCATTCACAATTGGATCCATTTCCAGCCACTACTTTCAAGCTTCAACTGACCCTCTTCGCCTCTGCATTTGGCTTCC
ATGAACTTGCACTCCATCATCATCTCTTCCACCTTATTGTGCCCCAAAAGCCACGAACGACGCCTGCGAACTCTTCCGCCATCGCCATGTTCAAGGATCGCTCTACAAAA
GGCACCTACAACCGTCGCTTTCTCTGCTACAACCAGAAAGAGAAAAGCCTTCAATCAAGAGAAGAGCTACAGCCTGTAGGGCGCCTGAAAAAAAGGAAATGTAGCAGGAG
CCATGGCTCTCGTAATTGGCACCAGTATCGGATCAGGGTTTCTTGCACTTCCAGAGAAAGCATCTCCGGCTGTAACTCTCTTTCCCTCTCTTTCTCTCTCTCGGCTTTAG
CAAATTTATTCGAAATATGTAGAAAACAGAGGGACTTTTCCCAGTTCGATATCTATAATGCTATGTTGGGGTTTCTTCTAGTAGAAGCACTCGTGCTCATTGAAATTAAT
GTGGTTCTGTGGAGGAAGAAGAAGAAGAAGAATGAAGAGGGAGAGACGGGGATGGAGGTGATTTCCGTCAGGACTATGGCGCAGGAGACGCTAGGGGACTGCGGTGGAAC
CCTGGCCACTGTTGCCTATGTTTTCTCGGGCTCACTTCCATGGATGGTTTGCAATGGACGGTGGAGGAGACTGGGGAAGGTCCCAACTACAGTACCTGTCATAATCTTCG
CTTTGGTATATCATGATGTAATACCAGTTCTTTGTGCTTATTTGGGTGGTGACCTTCCTCGCCTAAGAGTTTCAGTTTTGCTTGGTAGCTTTATTCCATTGCTAGCATTG
CTTGTTTGGGATGCAGTTGCGCTTAGCTTGTCAGCGCAAGCTGATCAAGTGGTTGATCCTGTGAATTGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTTCTGATCCGGGTACAGGACACTCCAGGCCCAATTGGGGTGTACCTGATATGCTGTGGATCACGGCAAAGGCCCATGGAATCCAATCTTTTGGAATTGGACC
CCATGGAATCTTCTCCAACATATTTGGAAACTTCCATTCACAATTGGATCCATTTCCAGCCACTACTTTCAAGCTTCAACTGACCCTCTTCGCCTCTGCATTTGGCTTCC
ATGAACTTGCACTCCATCATCATCTCTTCCACCTTATTGTGCCCCAAAAGCCACGAACGACGCCTGCGAACTCTTCCGCCATCGCCATGTTCAAGGATCGCTCTACAAAA
GGCACCTACAACCGTCGCTTTCTCTGCTACAACCAGAAAGAGAAAAGCCTTCAATCAAGAGAAGAGCTACAGCCTGTAGGGCGCCTGAAAAAAAGGAAATGTAGCAGGAG
CCATGGCTCTCGTAATTGGCACCAGTATCGGATCAGGGTTTCTTGCACTTCCAGAGAAAGCATCTCCGGCTGTAACTCTCTTTCCCTCTCTTTCTCTCTCTCGGCTTTAG
CAAATTTATTCGAAATATGTAGAAAACAGAGGGACTTTTCCCAGTTCGATATCTATAATGCTATGTTGGGGTTTCTTCTAGTAGAAGCACTCGTGCTCATTGAAATTAAT
GTGGTTCTGTGGAGGAAGAAGAAGAAGAAGAATGAAGAGGGAGAGACGGGGATGGAGGTGATTTCCGTCAGGACTATGGCGCAGGAGACGCTAGGGGACTGCGGTGGAAC
CCTGGCCACTGTTGCCTATGTTTTCTCGGGCTCACTTCCATGGATGGTTTGCAATGGACGGTGGAGGAGACTGGGGAAGGTCCCAACTACAGTACCTGTCATAATCTTCG
CTTTGGTATATCATGATGTAATACCAGTTCTTTGTGCTTATTTGGGTGGTGACCTTCCTCGCCTAAGAGTTTCAGTTTTGCTTGGTAGCTTTATTCCATTGCTAGCATTG
CTTGTTTGGGATGCAGTTGCGCTTAGCTTGTCAGCGCAAGCTGATCAAGTGGTTGATCCTGTGAATTGCTCTTGA
Protein sequenceShow/hide protein sequence
MVCSDPGTGHSRPNWGVPDMLWITAKAHGIQSFGIGPHGIFSNIFGNFHSQLDPFPATTFKLQLTLFASAFGFHELALHHHLFHLIVPQKPRTTPANSSAIAMFKDRSTK
GTYNRRFLCYNQKEKSLQSREELQPVGRLKKRKCSRSHGSRNWHQYRIRVSCTSRESISGCNSLSLSFSLSALANLFEICRKQRDFSQFDIYNAMLGFLLVEALVLIEIN
VVLWRKKKKKNEEGETGMEVISVRTMAQETLGDCGGTLATVAYVFSGSLPWMVCNGRWRRLGKVPTTVPVIIFALVYHDVIPVLCAYLGGDLPRLRVSVLLGSFIPLLAL
LVWDAVALSLSAQADQVVDPVNCS