; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016839 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016839
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAnkyrin repeat-containing protein
Genome locationchr12:41759717..41760547
RNA-Seq ExpressionLag0016839
SyntenyLag0016839
Gene Ontology termsNA
InterPro domainsIPR026961 - PGG domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581828.1 hypothetical protein SDJN03_21830, partial [Cucurbita argyrosperma subsp. sororia]1.9e-6758.08Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  N  NI  SN  K SL +Q     N  ++RRESAS       G WK W+KKLKYKGDWV+EVQGTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAGTAI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSWLL LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV SLL  F SK+K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

XP_022955455.1 uncharacterized protein LOC111457477 isoform X1 [Cucurbita moschata]8.8e-6556.15Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  +  NI ASN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

XP_022955457.1 uncharacterized protein LOC111457477 isoform X2 [Cucurbita moschata]3.1e-6255.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT      I+ SN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

XP_022955459.1 ankyrin repeat-containing protein At5g02620-like isoform X4 [Cucurbita moschata]3.1e-6255.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT      I+ SN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

XP_022980639.1 ankyrin repeat-containing protein NPR4-like isoform X1 [Cucurbita maxima]7.7e-6155.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  N  N+      K SL +Q     N  ++RRESAS       G WK W+KKLKYKGDWV+EVQG MMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+YI + Y +      + NGT+ PAG+AI  +++ + Y IY +AN +SF+AS SVILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN        AV+G    YY  +G++ +VG+ ++IRFLVW VKSLL  F SK+K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

TrEMBL top hitse value%identityAlignment
A0A6J1GTP7 uncharacterized protein LOC111457477 isoform X21.5e-6255.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT      I+ SN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

A0A6J1GV64 uncharacterized protein LOC111457477 isoform X14.3e-6556.15Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  +  NI ASN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

A0A6J1GWC0 ankyrin repeat-containing protein At5g02620-like isoform X41.5e-6255.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT      I+ SN  K SL      S+N K++R+ESAS       G WK W+KKLKYKGDWV+EV+GTMMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+Y ++ Y  +     L NGT+ PAG+AI  +++ + Y IY MAN +SF+AS  VILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN     D  A IG+ +A Y  +G++ +VG+ ++IRFLVWVV  LL  F S +K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

A0A6J1IX06 ankyrin repeat-containing protein NPR4-like isoform X28.3e-6155.25Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  N  N+      K SL +Q     N  ++RRESAS       G WK W+KKLKYKGDWV+EVQG MMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+YI + Y +      + NGT+ PAG+AI  +++ + Y IY +AN +SF+AS SVILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKS
         MVN        AVIG+ +A Y  +G++ +VG+ ++IRFLVWVVKSL   F SK ++
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKS

A0A6J1IZU6 ankyrin repeat-containing protein NPR4-like isoform X13.7e-6155.38Show/hide
Query:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
        LLS+ EVKT  N  N+      K SL +Q     N  ++RRESAS       G WK W+KKLKYKGDWV+EVQG MMLVATVIATVTFQA +NP GGVWQ
Subjt:  LLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ

Query:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV
        QDT YNSN+YI + Y +      + NGT+ PAG+AI  +++ + Y IY +AN +SF+AS SVILLIIS+ PLKNRVCSW+L LAMC AV+FL L +L G 
Subjt:  QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGV

Query:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF
         MVN        AV+G    YY  +G++ +VG+ ++IRFLVW VKSLL  F SK+K+ SF
Subjt:  SMVNCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G13950.1 unknown protein8.1e-1632.11Show/hide
Query:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQ--QNIIYKIYIMANTISFL
        K LK +GDW+E+ +G +M+ ATVIA ++FQ  +NPPGGVWQ D     N     +    P  +G        AGTA+  ++  + I Y   I+++T+SF 
Subjt:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQ--QNIIYKIYIMANTISFL

Query:  ASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGVSMVNC---TIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVV
         S S+ILL+IS   L+NR+   +L   M +AV+ ++  +   + +V      I Y  +  +GF      W     ++ +  ++RF+ W++
Subjt:  ASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGVSMVNC---TIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVV

AT4G13266.1 unknown protein2.9e-1328.88Show/hide
Query:  LKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIH--QQNIIYKIYIMANTISFLAS
        L ++GDW+E+ +G +++ ATVIA ++F   +NPPGGVWQ +   +      Q  +     EG         GT+I  H   + I Y   +++N +SF AS
Subjt:  LKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIH--QQNIIYKIYIMANTISFLAS

Query:  ASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGVSMV-NCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVK
          +I L+I  F  +NR+   ++ + M +AV+ ++  +     +V +   F +++  I  G     W  +  +V +  ++RFL WV++
Subjt:  ASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGVSMV-NCTIFYDQVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVK

AT5G50140.1 Ankyrin repeat family protein5.4e-0423.08Show/hide
Query:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLAS
        K+ + + + ++  + T+ +VA +IA+VTF   +NPPGGV+Q                     +G   G     GT        + +K++ ++N+I+   S
Subjt:  KKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLAS

Query:  ASVILLIISQFPLKNRVCSWLLTLA---MCMAVIFLTLGYLLG
          +++L++S  P + +     L +    + +AVI +   Y+ G
Subjt:  ASVILLIISQFPLKNRVCSWLLTLA---MCMAVIFLTLGYLLG

AT5G51160.1 Ankyrin repeat family protein2.6e-0626.77Show/hide
Query:  DWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLI
        D   E +  +++VA+++AT TFQA++ PPGG WQ     +S   + Q  + +       N     AG +I      + + +++  NTI F  S S++ ++
Subjt:  DWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQQDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLI

Query:  ISQFPLKNRVCSWLLTLAMCMAVIFLT
           FPL+         L +CM  ++ +
Subjt:  ISQFPLKNRVCSWLLTLAMCMAVIFLT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTGCTGGACTTGTTTCCTTAATAACTTACTCTCGATTCCAGAAGTAAAAACTGGAACAAACATTGCGAACATTGCAGCATCAAATGCCATGAAAGAAAGTTTAGG
AACCCAAAAGAAAAGGTCAAAAAACCACAAAAGAGAAAGACGTGAATCTGCATCTTTGTCCGCCAAAAAGACGAAAGGGTGTTGGAAGATATGGAAGAAGAAATTGAAAT
ACAAAGGGGATTGGGTTGAAGAAGTGCAAGGCACAATGATGCTAGTGGCTACCGTGATAGCAACTGTGACTTTTCAAGCTGCAATCAATCCTCCCGGCGGCGTTTGGCAA
CAAGACACCCAATACAATTCCAACGACTATATCAAACAACGTTATTCATTGTTGCCCTTATATGAAGGCTTGGACAACGGGACAGTTTTCCCAGCTGGAACTGCAATAAA
GATTCATCAGCAAAACATAATTTACAAGATTTACATAATGGCAAACACAATATCGTTTTTGGCATCGGCGAGCGTGATTCTGCTAATCATCAGTCAGTTTCCACTCAAAA
ATAGGGTTTGTAGTTGGTTGTTGACACTGGCCATGTGTATGGCGGTGATCTTCTTAACACTTGGATATTTGCTGGGAGTTTCAATGGTTAACTGCACAATTTTTTATGAT
CAAGTTGCAGTCATTGGATTTGGAGTAGCATATTACTCTTGGTACGGGATGATTTTTGTGGTTGGTATTTGCTACATGATTCGTTTTCTGGTTTGGGTGGTCAAGAGCCT
ATTGCGCACGTTCAAATCCAAGTATAAATCCTGCAGCTTCAACAACGACGCCACACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTGCTGGACTTGTTTCCTTAATAACTTACTCTCGATTCCAGAAGTAAAAACTGGAACAAACATTGCGAACATTGCAGCATCAAATGCCATGAAAGAAAGTTTAGG
AACCCAAAAGAAAAGGTCAAAAAACCACAAAAGAGAAAGACGTGAATCTGCATCTTTGTCCGCCAAAAAGACGAAAGGGTGTTGGAAGATATGGAAGAAGAAATTGAAAT
ACAAAGGGGATTGGGTTGAAGAAGTGCAAGGCACAATGATGCTAGTGGCTACCGTGATAGCAACTGTGACTTTTCAAGCTGCAATCAATCCTCCCGGCGGCGTTTGGCAA
CAAGACACCCAATACAATTCCAACGACTATATCAAACAACGTTATTCATTGTTGCCCTTATATGAAGGCTTGGACAACGGGACAGTTTTCCCAGCTGGAACTGCAATAAA
GATTCATCAGCAAAACATAATTTACAAGATTTACATAATGGCAAACACAATATCGTTTTTGGCATCGGCGAGCGTGATTCTGCTAATCATCAGTCAGTTTCCACTCAAAA
ATAGGGTTTGTAGTTGGTTGTTGACACTGGCCATGTGTATGGCGGTGATCTTCTTAACACTTGGATATTTGCTGGGAGTTTCAATGGTTAACTGCACAATTTTTTATGAT
CAAGTTGCAGTCATTGGATTTGGAGTAGCATATTACTCTTGGTACGGGATGATTTTTGTGGTTGGTATTTGCTACATGATTCGTTTTCTGGTTTGGGTGGTCAAGAGCCT
ATTGCGCACGTTCAAATCCAAGTATAAATCCTGCAGCTTCAACAACGACGCCACACCTTAA
Protein sequenceShow/hide protein sequence
MLCWTCFLNNLLSIPEVKTGTNIANIAASNAMKESLGTQKKRSKNHKRERRESASLSAKKTKGCWKIWKKKLKYKGDWVEEVQGTMMLVATVIATVTFQAAINPPGGVWQ
QDTQYNSNDYIKQRYSLLPLYEGLDNGTVFPAGTAIKIHQQNIIYKIYIMANTISFLASASVILLIISQFPLKNRVCSWLLTLAMCMAVIFLTLGYLLGVSMVNCTIFYD
QVAVIGFGVAYYSWYGMIFVVGICYMIRFLVWVVKSLLRTFKSKYKSCSFNNDATP