; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011104 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011104
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr1:14595801..14597411
RNA-Seq ExpressionLag0011104
SyntenyLag0011104
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.5e-5242.11Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP
        MA  +LLE W+ F LT EE+   VD+D  A   TGK L  SLI KL+  R IS  V++   K AW +     +V+ +G N+FLF+     ++ RIL+  P
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP

Query:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW
        W FD+ ++++  P+ + K   MDF++V+ WVHFF+L +   N  MA RLGNA+G F D ++      W   +RVRV+ D+ KPL RGIK+ LD P+G CW
Subjt:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW

Query:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY
          I+YE+LP+    CG + H   +CS   +D  S S+  QYG WL++
Subjt:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY

XP_022149484.1 uncharacterized protein LOC111017902 [Momordica charantia]2.4e-3434.69Show/hide
Query:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD
        +++ + WE F  T +E  T V +DR    +T  ++   ++ KL   + IS + +R   KS W + N    E LG+N+++   KS  E++R+L   PW F+
Subjt:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD

Query:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWK-ESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSI
        K +LVL+ P    +   M+F    FW+    +P +  ++ MA  LG  +G   + +  G   GW    IRVRV+ID++KPLRRGIK++  D     W  +
Subjt:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWK-ESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSI

Query:  RYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQYT
        RYEKLP+ C  CG IGH+   C        +++   QYG WL+ T
Subjt:  RYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQYT

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]9.4e-6346.15Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPW
        M  E+LL +W+KF LT EE+   +DVD  A  +  + L +SL+GKL+  R+IS DV+ +    AW + + LTVE +G NLFLF    E +  R++K  PW
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPW

Query:  LFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWS
         FDK ++VL KP        ++F  V FW+H F+LPM   N  MA RLGNA+G FVD D   +   W  S+R+RV IDITKPLRRGIK+ +D P+G CW 
Subjt:  LFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWS

Query:  SIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQ-RHQYGMWLQY
         I+YE+LP+ C FCG+IGH++H+C + Y+ +   S+   +YG WL++
Subjt:  SIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQ-RHQYGMWLQY

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.3e-5136.47Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP
        MA  DLLE W+ F LT EEE T +DVD  A A TG  L   L+GKL   R I+  VM+   ++AW +  N   V+ LG NLFLFS     ++ +I K  P
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP

Query:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW
        W FD+ +++++KP+ ++    +DF  +  WV FF+LP+      MA RLGNA+G F + D       W  ++RVRV +DI+KPLRRGIK+ LD P+G  W
Subjt:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW

Query:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY--TWRTT------------------SFFRSPSTSPLGQNKIMTDIVHGGTCSA
          I+YE+LP+ C  CG+                SS ++HQYG WL+Y  T + T                  SF  S STSP+G           G  SA
Subjt:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY--TWRTT------------------SFFRSPSTSPLGQNKIMTDIVHGGTCSA

Query:  AATG--SIPRKKVGSDDGVLDGQEQSSGTVPMEISPAMENAVTFPVTATNP
         ATG  +IP +   ++      +    G  P+ I    +      ++  NP
Subjt:  AATG--SIPRKKVGSDDGVLDGQEQSSGTVPMEISPAMENAVTFPVTATNP

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]5.4e-3435.1Show/hide
Query:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD
        + LL+     +LT EE+          + + GKS    L+GKL+  R  + + M+    S W    G+ V  +G NLF+F      ++ R+L   PW FD
Subjt:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD

Query:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLD--DPLGSCWSS
        K +L+L +  P V+   +    V FWVH   LP+ L N  + E +GNAVG+F+D D       W  ++R+RV +D+ KPLRRG+K+ L   +P+   W  
Subjt:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLD--DPLGSCWSS

Query:  IRYEKLPELCSFCGIIGHTAHNCSSFYMDS-GSSSQRHQYGMWLQ
         +YE+LP  C FCG +GH+   C      + GS     QYG WL+
Subjt:  IRYEKLPELCSFCGIIGHTAHNCSSFYMDS-GSSSQRHQYGMWLQ

TrEMBL top hitse value%identityAlignment
A0A1R3K847 Uncharacterized protein7.6e-3432.51Show/hide
Query:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD
        E L   WE FNLT +EE  E+ V+    A +       LIGKL+  R ++ DVMR      W +  GL V ++G  L++F  +SE E+ R+ +Q PW F+
Subjt:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD

Query:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIR
        K +LVL      +  + +  +   FW+   +LP+     ++ + +G++ G+ ++ D  G +  W + +R+R  +++ KPLRRG+ +   +  G    S R
Subjt:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIR

Query:  YEKLPELCSFCGIIGHTAHNC-SSFYMDSGSSSQRHQYGMWLQ
        YEKLP+ C  CG + HT + C  +  M   S   + +YG WL+
Subjt:  YEKLPELCSFCGIIGHTAHNC-SSFYMDSGSSSQRHQYGMWLQ

A0A6J1BSZ1 uncharacterized protein LOC1110054817.3e-5342.11Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP
        MA  +LLE W+ F LT EE+   VD+D  A   TGK L  SLI KL+  R IS  V++   K AW +     +V+ +G N+FLF+     ++ RIL+  P
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP

Query:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW
        W FD+ ++++  P+ + K   MDF++V+ WVHFF+L +   N  MA RLGNA+G F D ++      W   +RVRV+ D+ KPL RGIK+ LD P+G CW
Subjt:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW

Query:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY
          I+YE+LP+    CG + H   +CS   +D  S S+  QYG WL++
Subjt:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY

A0A6J1D765 uncharacterized protein LOC1110179021.2e-3434.69Show/hide
Query:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD
        +++ + WE F  T +E  T V +DR    +T  ++   ++ KL   + IS + +R   KS W + N    E LG+N+++   KS  E++R+L   PW F+
Subjt:  EDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFD

Query:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWK-ESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSI
        K +LVL+ P    +   M+F    FW+    +P +  ++ MA  LG  +G   + +  G   GW    IRVRV+ID++KPLRRGIK++  D     W  +
Subjt:  KFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWK-ESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSI

Query:  RYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQYT
        RYEKLP+ C  CG IGH+   C        +++   QYG WL+ T
Subjt:  RYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQYT

A0A6J1DU55 uncharacterized protein LOC1110231354.6e-6346.15Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPW
        M  E+LL +W+KF LT EE+   +DVD  A  +  + L +SL+GKL+  R+IS DV+ +    AW + + LTVE +G NLFLF    E +  R++K  PW
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPW

Query:  LFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWS
         FDK ++VL KP        ++F  V FW+H F+LPM   N  MA RLGNA+G FVD D   +   W  S+R+RV IDITKPLRRGIK+ +D P+G CW 
Subjt:  LFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWS

Query:  SIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQ-RHQYGMWLQY
         I+YE+LP+ C FCG+IGH++H+C + Y+ +   S+   +YG WL++
Subjt:  SIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQ-RHQYGMWLQY

A0A6J1DX30 uncharacterized protein LOC1110248744.0e-5136.47Show/hide
Query:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP
        MA  DLLE W+ F LT EEE T +DVD  A A TG  L   L+GKL   R I+  VM+   ++AW +  N   V+ LG NLFLFS     ++ +I K  P
Subjt:  MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIP-NGLTVEKLGVNLFLFSLKSEEEQTRILKQEP

Query:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW
        W FD+ +++++KP+ ++    +DF  +  WV FF+LP+      MA RLGNA+G F + D       W  ++RVRV +DI+KPLRRGIK+ LD P+G  W
Subjt:  WLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCW

Query:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY--TWRTT------------------SFFRSPSTSPLGQNKIMTDIVHGGTCSA
          I+YE+LP+ C  CG+                SS ++HQYG WL+Y  T + T                  SF  S STSP+G           G  SA
Subjt:  SSIRYEKLPELCSFCGIIGHTAHNCSSFYMDSGSSSQRHQYGMWLQY--TWRTT------------------SFFRSPSTSPLGQNKIMTDIVHGGTCSA

Query:  AATG--SIPRKKVGSDDGVLDGQEQSSGTVPMEISPAMENAVTFPVTATNP
         ATG  +IP +   ++      +    G  P+ I    +      ++  NP
Subjt:  AATG--SIPRKKVGSDDGVLDGQEQSSGTVPMEISPAMENAVTFPVTATNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.5e-1022.49Show/hide
Query:  VMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKF
        V+ +  +  W     +TV  L    F+   + EEE    L   PW      L++            D  +   WV    +P + Y+  +   +   +G+ 
Subjt:  VMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFDKFILVLSKPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKF

Query:  VDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIRYEKLPELCSFCGIIGHTAHNC
        +  D            RV +++++ KPL+  + +  D         + YE L ++CS CGI GH  H+C
Subjt:  VDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIRYEKLPELCSFCGIIGHTAHNC

AT5G36228.1 nucleic acid binding;zinc ion binding6.2e-1222.75Show/hide
Query:  NLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFDKFILVLSKPI
        ++ +  E  E+ +  HA      S   SL+G+++ P+  S +         W +   +    L    F    +SE +    L++ PW+F+++ + L +  
Subjt:  NLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFDKFILVLSKPI

Query:  PMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIRYEKLPELCSF
               + F  +  WVH   +P+   +    E + + +G+ V  D           IRV+V++D T+PLR   +VR             YEKL  +C+ 
Subjt:  PMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIRYEKLPELCSF

Query:  CGIIGHTAHNC
        C  + H   +C
Subjt:  CGIIGHTAHNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTTGAGGACTTATTGGAAAATTGGGAAAAGTTTAACCTCACTGTTGAAGAAGAGGTCACTGAGGTCGATGTCGACCGTCATGCTGCGGCTGTCACTGGCAAGTC
CTTGGGATTTAGCCTCATCGGGAAATTGATCTGTCCAAGAGTCATTTCCGGTGATGTCATGAGGAAAAACTTCAAGTCAGCATGGAATATCCCTAATGGTCTCACCGTCG
AAAAGCTCGGTGTTAATTTATTCTTATTTTCATTGAAGTCGGAAGAGGAGCAAACCAGAATTCTGAAGCAAGAACCGTGGCTATTCGATAAATTCATATTGGTGCTCTCT
AAACCAATTCCCATGGTGAAAGCTCAAGCAATGGATTTTAAATCGGTAACGTTTTGGGTCCATTTCTTTGAACTCCCCATGGATCTATACAATTCTGCTATGGCAGAGCG
TCTTGGCAATGCTGTGGGAAAATTCGTTGATTACGACAATGGAGGACGGCGTCATGGATGGAAGGAAAGCATCCGAGTTCGTGTTCAGATCGATATTACTAAACCTCTCC
GACGGGGTATCAAGGTTAGGCTCGATGACCCATTGGGAAGTTGTTGGTCCTCGATTCGTTACGAGAAACTCCCGGAACTGTGTTCGTTTTGTGGCATTATCGGCCACACG
GCTCACAACTGCAGCTCCTTCTATATGGACAGTGGTTCATCCTCCCAAAGACACCAGTACGGAATGTGGTTACAGTATACATGGCGCACGACAAGCTTCTTTCGATCTCC
GAGCACAAGTCCATTGGGACAAAATAAAATCATGACTGATATTGTACATGGTGGAACTTGCTCTGCGGCGGCAACTGGATCAATCCCTAGGAAAAAAGTAGGTTCGGACG
ATGGTGTTTTGGACGGTCAAGAACAAAGTTCCGGTACTGTGCCGATGGAGATCTCGCCGGCGATGGAAAACGCCGTTACATTCCCCGTAACAGCTACTAATCCAAGTAAT
TATGACGGCAATAATGGCGAAGGTAATGCGGACCAGTCGAGAGTTAAGAAGAAATTGGACTTTGCTGACGTTATAATTTCCCCAATTAATGCACCGTTCCAATCTCAAAA
TCAGCAACCGGCGATTTTTCAGGCCACGACCCTGGACAATAATAAGCCAACATTTAAACCATTGCCTGTTGAGTCTAACAACTCAAATGGACCTGGAATGTATTCTCTCG
GTGGTTTTAATCTCCAGGAGGAAAATAAGAGAAGAGATGCCTATAGGTTCAGCCTCAATCCAATTGTTCAACAAATGGGCCCTTTTAAAGTGGCCCATGAACCTATTGTG
CAACCAGTCCCAAAGACCAATGAAAATGTTGCGAATCCACTCCCAGATGTGAAGCCTAACCCATCTTTGCCGCAAGTGGTTGTGGGCTTGCCTAATTCAAAATCTTGGAA
GAGGCGTGCACGATCTAATTTACCCGGACATGGAACTGCAACGTCGGATCCTCTCAAGAAGCGGATTGGAGATGGGCTCGCTGGTGGGTCAAAAAAAAGGCCACGCATGG
AGAATGAAGATGATGATACTCCACATGAAGAATCGGCGGAGGCTGTTGATCAGCCCCGCCGAGAGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTTGAGGACTTATTGGAAAATTGGGAAAAGTTTAACCTCACTGTTGAAGAAGAGGTCACTGAGGTCGATGTCGACCGTCATGCTGCGGCTGTCACTGGCAAGTC
CTTGGGATTTAGCCTCATCGGGAAATTGATCTGTCCAAGAGTCATTTCCGGTGATGTCATGAGGAAAAACTTCAAGTCAGCATGGAATATCCCTAATGGTCTCACCGTCG
AAAAGCTCGGTGTTAATTTATTCTTATTTTCATTGAAGTCGGAAGAGGAGCAAACCAGAATTCTGAAGCAAGAACCGTGGCTATTCGATAAATTCATATTGGTGCTCTCT
AAACCAATTCCCATGGTGAAAGCTCAAGCAATGGATTTTAAATCGGTAACGTTTTGGGTCCATTTCTTTGAACTCCCCATGGATCTATACAATTCTGCTATGGCAGAGCG
TCTTGGCAATGCTGTGGGAAAATTCGTTGATTACGACAATGGAGGACGGCGTCATGGATGGAAGGAAAGCATCCGAGTTCGTGTTCAGATCGATATTACTAAACCTCTCC
GACGGGGTATCAAGGTTAGGCTCGATGACCCATTGGGAAGTTGTTGGTCCTCGATTCGTTACGAGAAACTCCCGGAACTGTGTTCGTTTTGTGGCATTATCGGCCACACG
GCTCACAACTGCAGCTCCTTCTATATGGACAGTGGTTCATCCTCCCAAAGACACCAGTACGGAATGTGGTTACAGTATACATGGCGCACGACAAGCTTCTTTCGATCTCC
GAGCACAAGTCCATTGGGACAAAATAAAATCATGACTGATATTGTACATGGTGGAACTTGCTCTGCGGCGGCAACTGGATCAATCCCTAGGAAAAAAGTAGGTTCGGACG
ATGGTGTTTTGGACGGTCAAGAACAAAGTTCCGGTACTGTGCCGATGGAGATCTCGCCGGCGATGGAAAACGCCGTTACATTCCCCGTAACAGCTACTAATCCAAGTAAT
TATGACGGCAATAATGGCGAAGGTAATGCGGACCAGTCGAGAGTTAAGAAGAAATTGGACTTTGCTGACGTTATAATTTCCCCAATTAATGCACCGTTCCAATCTCAAAA
TCAGCAACCGGCGATTTTTCAGGCCACGACCCTGGACAATAATAAGCCAACATTTAAACCATTGCCTGTTGAGTCTAACAACTCAAATGGACCTGGAATGTATTCTCTCG
GTGGTTTTAATCTCCAGGAGGAAAATAAGAGAAGAGATGCCTATAGGTTCAGCCTCAATCCAATTGTTCAACAAATGGGCCCTTTTAAAGTGGCCCATGAACCTATTGTG
CAACCAGTCCCAAAGACCAATGAAAATGTTGCGAATCCACTCCCAGATGTGAAGCCTAACCCATCTTTGCCGCAAGTGGTTGTGGGCTTGCCTAATTCAAAATCTTGGAA
GAGGCGTGCACGATCTAATTTACCCGGACATGGAACTGCAACGTCGGATCCTCTCAAGAAGCGGATTGGAGATGGGCTCGCTGGTGGGTCAAAAAAAAGGCCACGCATGG
AGAATGAAGATGATGATACTCCACATGAAGAATCGGCGGAGGCTGTTGATCAGCCCCGCCGAGAGCCATGA
Protein sequenceShow/hide protein sequence
MAFEDLLENWEKFNLTVEEEVTEVDVDRHAAAVTGKSLGFSLIGKLICPRVISGDVMRKNFKSAWNIPNGLTVEKLGVNLFLFSLKSEEEQTRILKQEPWLFDKFILVLS
KPIPMVKAQAMDFKSVTFWVHFFELPMDLYNSAMAERLGNAVGKFVDYDNGGRRHGWKESIRVRVQIDITKPLRRGIKVRLDDPLGSCWSSIRYEKLPELCSFCGIIGHT
AHNCSSFYMDSGSSSQRHQYGMWLQYTWRTTSFFRSPSTSPLGQNKIMTDIVHGGTCSAAATGSIPRKKVGSDDGVLDGQEQSSGTVPMEISPAMENAVTFPVTATNPSN
YDGNNGEGNADQSRVKKKLDFADVIISPINAPFQSQNQQPAIFQATTLDNNKPTFKPLPVESNNSNGPGMYSLGGFNLQEENKRRDAYRFSLNPIVQQMGPFKVAHEPIV
QPVPKTNENVANPLPDVKPNPSLPQVVVGLPNSKSWKRRARSNLPGHGTATSDPLKKRIGDGLAGGSKKRPRMENEDDDTPHEESAEAVDQPRREP