; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004894 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004894
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionintracellular protein transport protein USO1-like
Genome locationchr6:8175557..8176879
RNA-Seq ExpressionLag0004894
SyntenyLag0004894
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585310.1 hypothetical protein SDJN03_18043, partial [Cucurbita argyrosperma subsp. sororia]1.6e-8263.85Show/hide
Query:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA
        +PQE+D YIKESID TLGLPVSTQTL LKL +SEAT++RLEAQCLSLQSKLNQ I STK A+ EA +SAQGLKK IE+NE+L +ENS L +Q K+MK +A
Subjt:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA

Query:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRAL
        +RGK A++ AKEAE R R+L DE   LTEDLNFFK+QY+MRKF FASMEEVLVE+ L DL+ G+T  +VG+FL+ N+ ++SCRRLLHLSQSLRPST RAL
Subjt:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRAL

Query:  SLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK------------------LRENKRPWEPSDSED
        SLAA+IK LEE+  +L+I LS AQGEV+LLSD+IG L+ED+E      +  + KPPASSSSVK                  LR+NKRP E S SED
Subjt:  SLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK------------------LRENKRPWEPSDSED

KAG7020228.1 hypothetical protein SDJN02_16911, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-8164.38Show/hide
Query:  IDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGK
        +D YIKESID TLGLPVSTQTL LKL +SEAT++RLEAQCLSLQSKLNQ I STK A+ EA +SAQGLKK IE+NE+L +ENS L +Q K+MK +A+RGK
Subjt:  IDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGK

Query:  SAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAA
         A++ AKEAE R R+L DE   LTEDLNFFK+QY+MRKF FASMEEVLVE+AL DL+ G+T S+VG+FL+ N+ ++SCRRLLHLSQSLRPST RALSLAA
Subjt:  SAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAA

Query:  KIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK------------------LRENKRPWEPSDSED
        +IK LEE+  +L+I LS AQGEV+LLSD+IG L+ED+E      +  + KPPASSSSVK                  LR+NKRP E S SED
Subjt:  KIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK------------------LRENKRPWEPSDSED

XP_022131622.1 uncharacterized protein LOC111004753 [Momordica charantia]9.9e-7260.56Show/hide
Query:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA
        +PQEIDDYIKESI  +LGLPVS  TLELKL++SEA +Q L  QCLSLQSKLN QI++ +++K EAKLSAQ LKK IEKN  L EENS L  QCKK+KA A
Subjt:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA

Query:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALS
        N+ K AV+RA E  ARVRELE E+  LT+DL FF++QY+ RKF+ +SMEE LV+S L   IGETE+   EFLL N  + SCRRLL+LS SL P+T +ALS
Subjt:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALS

Query:  LAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGK-----GKPPASS--SSVKLRENKRPWEPSDSED
        LAAKIK LE++KE+L++LL TAQ EV  L++QI YL+ED+ +  H  E+G      GKPPASS  +SVKL       E SDS+D
Subjt:  LAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGK-----GKPPASS--SSVKLRENKRPWEPSDSED

XP_022951817.1 uncharacterized protein LOC111454550 [Cucurbita moschata]1.3e-5864.15Show/hide
Query:  LSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETE
        +SAQGLKK IE+NE+L +ENS L +Q KKMK +A+RGK A++  KEAE R R+L +E   LTEDLNFFK+Q++MRKF FASMEEVLVE+AL DL+  +T 
Subjt:  LSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETE

Query:  SSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVKLRE
        S+VG+FL+ N+ ++SCRRLLHLSQSLRPST RAL LAA+IK LEE+  +L+I LS AQGEV+LLSD+IG L+ED+E      +  + KPPASSSSVKLR+
Subjt:  SSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVKLRE

Query:  NKRPWEPSDSED
        NKRPWE   SED
Subjt:  NKRPWEPSDSED

XP_038885825.1 uncharacterized protein LOC120076121 [Benincasa hispida]6.4e-6367.46Show/hide
Query:  EEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIG
        EEAKL+AQGLKK IEKN+QL EENS LVEQCKKMKA  NRGK A++RAKE + RV ELEDEV+ L  DL FFKDQ++M KF F SMEE+LVESAL  LIG
Subjt:  EEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIG

Query:  ETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK
        E ESS  EFLL NIE++SCRRLLHLS+SL P + R L+LAAKIK+LE+D E+L++LL TAQGE  LLSDQIGYL+E++ T       G GKPPA SSS K
Subjt:  ETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVK

Query:  LRENKRPWE
        + E KR  E
Subjt:  LRENKRPWE

TrEMBL top hitse value%identityAlignment
A0A1Q3CT80 Uncharacterized protein4.4e-4142.81Show/hide
Query:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK-
        + LPQEIDDYIKESID++LGLP+STQTL+LKL ++E T +RL  Q   L SKL Q+  +   A+ EA ++A  ++K +E+N++L  E + L+ QC K + 
Subjt:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK-

Query:  ----------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMR-----KFYFASMEEVLVESALGDLIGETESSVGE-FLLTNIEHESC
                  A+ + G  A  RAKEAEARV++LE+E+  L EDL F+K  Y  R          + EE L+ES L  LIG+ E + G  FL  N  +ESC
Subjt:  ----------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMR-----KFYFASMEEVLVESALGDLIGETESSVGE-FLLTNIEHESC

Query:  RRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEK-----HAGEKGKGKPPASSSSVKLRENKRPWEPSDS
        ++LL +  SLRPST +ALSLAA++K LE+DKE L + L  A+ EV LL ++   L  DEE ++     H      G     ++S   + NKR   P  S
Subjt:  RRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEK-----HAGEKGKGKPPASSSSVKLRENKRPWEPSDS

A0A1R3JNQ4 Uncharacterized protein4.8e-4041.64Show/hide
Query:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---
        + LPQEIDDYI+E+I+ +LGLP+ST++L+LKL SSE  ++RL  QCL L SKL ++  + + +K EA ++A  LK+ +E+N++L +E + L+ QCKK   
Subjt:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---

Query:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMR----KFYFASMEEVLVESALGDLIGETESSVGE-FLLTNIEHESCR
                 +A+ + G  A  RAK AE RV ELE+E+  LT +L F+K  Y  R     +   ++E+ L+ES L  LI E E+  G  FL  N   ESC+
Subjt:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMR----KFYFASMEEVLVESALGDLIGETESSVGE-FLLTNIEHESCR

Query:  RLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEE--TEKHAGEKG-KGKPPASSSSVKLRENKRPWEP
        RLL +   LRPS+   LSL AK+K LEEDKE L + L+ A+ EV +L ++   L E+ +    +H  EK   G     S+S   + NKR   P
Subjt:  RLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEE--TEKHAGEKG-KGKPPASSSSVKLRENKRPWEP

A0A2N9IYL3 Uncharacterized protein1.5e-4141.72Show/hide
Query:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---
        + LPQEIDDYIK+SID +LGLPVSTQTLELKL  S+  ++RL ++CLSLQ KL ++       + E+ ++AQ LKK +E+N++L  E + L+ QC K   
Subjt:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---

Query:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFAS----MEEVLVESALGDLIGETE-SSVGEFLLTNIEHESCR
                 +A+ + G  A  RAKEAE RV ELE+EV+ L+E+L F+K +  MR    ++    +E  L+ES L  LI + + +S   FL  N  HESC+
Subjt:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFAS----MEEVLVESALGDLIGETE-SSVGEFLLTNIEHESCR

Query:  RLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVED--------EETEKHAGEKGKGKPPASSSSVKLRENKRPWEPSD
        RLL +   LRPST + LSLAA++K LE+DK+ L I L+ A+ EV LL ++   L E+        ++   H G  GK    AS+ S K + + +   P +
Subjt:  RLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVED--------EETEKHAGEKGKGKPPASSSSVKLRENKRPWEPSD

Query:  SE
         +
Subjt:  SE

A0A6J1BRI9 uncharacterized protein LOC1110047534.8e-7260.56Show/hide
Query:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA
        +PQEIDDYIKESI  +LGLPVS  TLELKL++SEA +Q L  QCLSLQSKLN QI++ +++K EAKLSAQ LKK IEKN  L EENS L  QCKK+KA A
Subjt:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVA

Query:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALS
        N+ K AV+RA E  ARVRELE E+  LT+DL FF++QY+ RKF+ +SMEE LV+S L   IGETE+   EFLL N  + SCRRLL+LS SL P+T +ALS
Subjt:  NRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALS

Query:  LAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGK-----GKPPASS--SSVKLRENKRPWEPSDSED
        LAAKIK LE++KE+L++LL TAQ EV  L++QI YL+ED+ +  H  E+G      GKPPASS  +SVKL       E SDS+D
Subjt:  LAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGK-----GKPPASS--SSVKLRENKRPWEPSDSED

A0A6J1GIQ2 uncharacterized protein LOC1114545506.1e-5964.15Show/hide
Query:  LSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETE
        +SAQGLKK IE+NE+L +ENS L +Q KKMK +A+RGK A++  KEAE R R+L +E   LTEDLNFFK+Q++MRKF FASMEEVLVE+AL DL+  +T 
Subjt:  LSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLI-GETE

Query:  SSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVKLRE
        S+VG+FL+ N+ ++SCRRLLHLSQSLRPST RAL LAA+IK LEE+  +L+I LS AQGEV+LLSD+IG L+ED+E      +  + KPPASSSSVKLR+
Subjt:  SSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVKLRE

Query:  NKRPWEPSDSED
        NKRPWE   SED
Subjt:  NKRPWEPSDSED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G14680.1 unknown protein2.6e-3037.1Show/hide
Query:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---
        + L +E+DD+IKE+ID +LGLP+S   L+ KL ++E +++RL  Q LSL S+L ++       + EA ++AQ LKK +E+N++L  E   LV QCKK   
Subjt:  LPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKK---

Query:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETES-SVGE-FLLTNIEHES-CRRL
                 +++   G     RA+EAE+RVRELE+EV+ +++++          K    S E+ LV+S L   + + ES S+G  FL  N E +  C  L
Subjt:  --------MKAVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETES-SVGE-FLLTNIEHES-CRRL

Query:  LHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQ
        L     L+P+T + +SL + +K +E++KE L + L+ A+ EV L+S+Q
Subjt:  LHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLSDQ

AT4G09060.1 unknown protein1.1e-3338.93Show/hide
Query:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK---
        L QE+++YIK++ID +LGLP+S ++L+ KL ++E +++RL  Q   L S+L ++       + EA ++AQ LKK +E+N++L  E   L+ QCKK++   
Subjt:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK---

Query:  --------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFAS--MEEVLVESALGDLIGETESSVGE-FLLTNIEHESCRRLLH
                A+   G  +  RA+EAEARVRELEDE+  ++E++  FK Q    +    +  +EE L++S LG LI + E+ +G  FL  NI  +SC+ LL 
Subjt:  --------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFAS--MEEVLVESALGDLIGETESSVGE-FLLTNIEHESCRRLLH

Query:  LSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLS
            L+PST + LSL +K K  E++KE + + L+ A+ EV L+S
Subjt:  LSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLS

AT4G09060.2 unknown protein6.3e-3235.93Show/hide
Query:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK---
        L QE+++YIK++ID +LGLP+S ++L+ KL ++E +++RL  Q   L S+L ++       + EA ++AQ LKK +E+N++L  E   L+ QCKK++   
Subjt:  LPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMK---

Query:  --------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYR-------------MRKFYFAS---------------MEEVLVESALGDLI
                A+   G  +  RA+EAEARVRELEDE+  ++E++  FK Q                +K Y  +               +EE L++S LG LI
Subjt:  --------AVANRGKSAVNRAKEAEARVRELEDEVKGLTEDLNFFKDQYR-------------MRKFYFAS---------------MEEVLVESALGDLI

Query:  GETESSVGE-FLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLS
         + E+ +G  FL  NI  +SC+ LL     L+PST + LSL +K K  E++KE + + L+ A+ EV L+S
Subjt:  GETESSVGE-FLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLEILLSTAQGEVMLLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCTTCCCCTTCCCCAAGAAATCGATGATTACATCAAAGAATCAATAGACCAGACTCTGGGCCTCCCCGTCTCCACGCAGACTCTGGAATTGAAACTCCTCTCCTC
CGAGGCCACGCGCCAACGCCTCGAAGCCCAATGCCTCTCTTTGCAGTCTAAATTAAACCAACAAATTGTCTCCACTAAAACTGCCAAAGAGGAAGCCAAGTTGAGTGCGC
AGGGCCTGAAGAAAGTCATCGAGAAAAACGAACAATTGAGGGAAGAGAATTCGGGTTTGGTGGAGCAATGCAAGAAAATGAAAGCCGTCGCCAACAGAGGGAAATCCGCC
GTCAATCGCGCCAAGGAAGCGGAAGCTCGCGTGCGTGAGTTGGAAGACGAGGTCAAGGGATTGACAGAGGATTTGAATTTCTTCAAGGATCAATACCGAATGCGAAAGTT
TTATTTTGCTTCCATGGAGGAGGTTCTGGTCGAGTCGGCCTTAGGGGATTTAATTGGAGAAACAGAGTCGTCCGTTGGCGAGTTCTTGTTGACAAATATCGAGCACGAAT
CGTGTCGAAGGCTTCTACATTTGTCGCAAAGTTTGAGACCTTCAACTCTGAGAGCTCTGTCTCTGGCGGCTAAAATCAAGGCTCTCGAGGAAGATAAGGAAAAACTCGAA
ATCCTCCTCTCCACAGCCCAAGGGGAGGTCATGCTGTTGTCGGATCAAATCGGTTATCTCGTGGAGGATGAAGAAACAGAGAAACATGCTGGAGAGAAAGGGAAAGGGAA
ACCACCGGCAAGCAGTTCCTCTGTGAAGTTACGTGAAAATAAAAGACCTTGGGAACCGTCTGATTCGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCTTCCCCTTCCCCAAGAAATCGATGATTACATCAAAGAATCAATAGACCAGACTCTGGGCCTCCCCGTCTCCACGCAGACTCTGGAATTGAAACTCCTCTCCTC
CGAGGCCACGCGCCAACGCCTCGAAGCCCAATGCCTCTCTTTGCAGTCTAAATTAAACCAACAAATTGTCTCCACTAAAACTGCCAAAGAGGAAGCCAAGTTGAGTGCGC
AGGGCCTGAAGAAAGTCATCGAGAAAAACGAACAATTGAGGGAAGAGAATTCGGGTTTGGTGGAGCAATGCAAGAAAATGAAAGCCGTCGCCAACAGAGGGAAATCCGCC
GTCAATCGCGCCAAGGAAGCGGAAGCTCGCGTGCGTGAGTTGGAAGACGAGGTCAAGGGATTGACAGAGGATTTGAATTTCTTCAAGGATCAATACCGAATGCGAAAGTT
TTATTTTGCTTCCATGGAGGAGGTTCTGGTCGAGTCGGCCTTAGGGGATTTAATTGGAGAAACAGAGTCGTCCGTTGGCGAGTTCTTGTTGACAAATATCGAGCACGAAT
CGTGTCGAAGGCTTCTACATTTGTCGCAAAGTTTGAGACCTTCAACTCTGAGAGCTCTGTCTCTGGCGGCTAAAATCAAGGCTCTCGAGGAAGATAAGGAAAAACTCGAA
ATCCTCCTCTCCACAGCCCAAGGGGAGGTCATGCTGTTGTCGGATCAAATCGGTTATCTCGTGGAGGATGAAGAAACAGAGAAACATGCTGGAGAGAAAGGGAAAGGGAA
ACCACCGGCAAGCAGTTCCTCTGTGAAGTTACGTGAAAATAAAAGACCTTGGGAACCGTCTGATTCGGAAGATTGA
Protein sequenceShow/hide protein sequence
MSLPLPQEIDDYIKESIDQTLGLPVSTQTLELKLLSSEATRQRLEAQCLSLQSKLNQQIVSTKTAKEEAKLSAQGLKKVIEKNEQLREENSGLVEQCKKMKAVANRGKSA
VNRAKEAEARVRELEDEVKGLTEDLNFFKDQYRMRKFYFASMEEVLVESALGDLIGETESSVGEFLLTNIEHESCRRLLHLSQSLRPSTLRALSLAAKIKALEEDKEKLE
ILLSTAQGEVMLLSDQIGYLVEDEETEKHAGEKGKGKPPASSSSVKLRENKRPWEPSDSED