; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038853 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038853
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr2:29221895..29229142
RNA-Seq ExpressionLag0038853
SyntenyLag0038853
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]3.6e-3035.16Show/hide
Query:  PQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLS
        P F+   I  HGW  FC +P +    +VREFYAN++  +   V V+ V+V ++  AIN+++ ++      Y + A   +DEQL   + EV IEGA WQ+S
Subjt:  PQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGYVILF
             T     LKR A  W  F+  + +P+TH  T++++RVLL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  +A VP  +   I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGYVILF

Query:  DKGIIDTPNLALLQRVQEV
        + G I T +++ + + + V
Subjt:  DKGIIDTPNLALLQRVQEV

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.1e-2935.38Show/hide
Query:  KEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAI
        K  +  K + E  + +Y   ++ R    E GF        G LP F+   I  H W+ FCA+P+     +VREFYAN+       V VRGV+V WS  AI
Subjt:  KEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAI

Query:  NALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGR
        NA++ + + P   ++E     ++  L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  T+S++R+LL  ++L   SINVGR
Subjt:  NALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGR

Query:  IIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQRVQE
        +I +EI  C  +K G LFFP+ IT LCR A  P       L + G ID   +A+ +  QE
Subjt:  IIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQRVQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.8e-3832.16Show/hide
Query:  EREKAEREEREKNEAEDKEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDG
        ER    R          K  +  K + E    +Y   ++ R    E GF        G LP F+   I  H W+ FCA+P+     +VREFYAN+   + 
Subjt:  EREKAEREEREKNEAEDKEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDG

Query:  FQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRER
          V VRGV+V WS  AINA++ + + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ ++LPTTH  T+S++R
Subjt:  FQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRER

Query:  VLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---------------GG
        +LL  ++L   SINVGR+I +EI  C  +K G LFFP+ IT LCR A  P       L + G ID   +A + +    +  +Q               G 
Subjt:  VLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---------------GG

Query:  LIHGINSIIEQLALSTSRQ-------EFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL
        ++  + ++ ++L+    +Q       +   +Q   FW Y K RD +LKKALQ NF++P P  PAFP ++L
Subjt:  LIHGINSIIEQLALSTSRQ-------EFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.4e-3635.38Show/hide
Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK
        +VREFYAN+   +   + VRGV+V WS  AINA++ + + P   ++E     ++ +L   +  V   GA+W +S     T   + L   A  W  F++ +
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK

Query:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---
        +LPTTH   +S++R+LL  ++L   SINVGR+I +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +A + +    +  +Q   
Subjt:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---

Query:  ------------GGLIHGINSIIEQLALSTSRQEFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL
                    G ++  + ++ ++L    S+QE   +Q   FW Y K RD +LKKALQ NF++P P  PAFP ++L
Subjt:  ------------GGLIHGINSIIEQLALSTSRQEFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL

XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]4.0e-2930.27Show/hide
Query:  RMWHDNKIKSKEFVKGQKVLLYNSRLKLFPGKLKSKWSGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQISRMHRKDFSRAEEQEKVTEDAA
        + W D KI  K       VLL+NSRLKLFPGKLKS+WSG FVV  V+PHGAV L+  K   +FKV      K   E I R          EQ  V  +  
Subjt:  RMWHDNKIKSKEFVKGQKVLLYNSRLKLFPGKLKSKWSGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQISRMHRKDFSRAEEQEKVTEDAA

Query:  AAVVEENLEELQEQTPGEADQRLADTDTAREENVE------ENQEQQAEMVRDEQ------VDAVSERGNEQDQEARVEVIMPEPPKRRRIKRKVGRIQV
            +E+  EL ++  G+  +   D D   +  V+      E+Q+   E    EQ      VD  SE   E    +    +      R R +  V R+  
Subjt:  AAVVEENLEELQEQTPGEADQRLADTDTAREENVE------ENQEQQAEMVRDEQ------VDAVSERGNEQDQEARVEVIMPEPPKRRRIKRKVGRIQV

Query:  IQTDTPSPPSSDSEREKAER--EE--REKNEAEDKEREETKKKIEEE--QVKYAELRKRDFLFECGF---SGDLPQFLGTGIADHGWELFCANPKSVNAQ
         Q +  + PS  ++  + +R  EE   E NE E    E+T  +++    +V+      RD L E GF      +P+++   I ++GWE   A    V+  
Subjt:  IQTDTPSPPSSDSEREKAER--EE--REKNEAEDKEREETKKKIEEE--QVKYAELRKRDFLFECGF---SGDLPQFLGTGIADHGWELFCANPKSVNAQ

Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK
        +V+EFY  I    G +V VRG                        NE+ V PSDEQ+ EA R +      W +S   K + +   +  +A  WM  ++ +
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK

Query:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVG
        ++PT++DS+I R R ++ + +++ +  N G +I NEI  C +K  G
Subjt:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVG

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.1e-3035.38Show/hide
Query:  KEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAI
        K  +  K + E  + +Y   ++ R    E GF        G LP F+   I  H W+ FCA+P+     +VREFYAN+       V VRGV+V WS  AI
Subjt:  KEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAI

Query:  NALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGR
        NA++ + + P   ++E     ++  L   +  V + GA+W +S     T   + L   A  W  F++  +LPTTH  T+S++R+LL  ++L   SINVGR
Subjt:  NALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGR

Query:  IIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQRVQE
        +I +EI  C  +K G LFFP+ IT LCR A  P       L + G ID   +A+ +  QE
Subjt:  IIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQRVQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-3832.16Show/hide
Query:  EREKAEREEREKNEAEDKEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDG
        ER    R          K  +  K + E    +Y   ++ R    E GF        G LP F+   I  H W+ FCA+P+     +VREFYAN+   + 
Subjt:  EREKAEREEREKNEAEDKEREETKKKIEEEQVKYA-ELRKRDFLFECGF-------SGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDG

Query:  FQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRER
          V VRGV+V WS  AINA++ + + P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ ++LPTTH  T+S++R
Subjt:  FQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRER

Query:  VLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---------------GG
        +LL  ++L   SINVGR+I +EI  C  +K G LFFP+ IT LCR A  P       L + G ID   +A + +    +  +Q               G 
Subjt:  VLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---------------GG

Query:  LIHGINSIIEQLALSTSRQ-------EFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL
        ++  + ++ ++L+    +Q       +   +Q   FW Y K RD +LKKALQ NF++P P  PAFP ++L
Subjt:  LIHGINSIIEQLALSTSRQ-------EFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL

A0A2P5DXM3 Uncharacterized protein2.1e-3635.38Show/hide
Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK
        +VREFYAN+   +   + VRGV+V WS  AINA++ + + P   ++E     ++ +L   +  V   GA+W +S     T   + L   A  W  F++ +
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK

Query:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---
        +LPTTH   +S++R+LL  ++L   SINVGR+I +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +A + +    +  +Q   
Subjt:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQR---VQEVRQ---

Query:  ------------GGLIHGINSIIEQLALSTSRQEFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL
                    G ++  + ++ ++L    S+QE   +Q   FW Y K RD +LKKALQ NF++P P  PAFP ++L
Subjt:  ------------GGLIHGINSIIEQLALSTSRQEFAKRQTLTFWNYVKNRDASLKKALQENFSKPYPAVPAFPDDLL

A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.9e-2930.27Show/hide
Query:  RMWHDNKIKSKEFVKGQKVLLYNSRLKLFPGKLKSKWSGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQISRMHRKDFSRAEEQEKVTEDAA
        + W D KI  K       VLL+NSRLKLFPGKLKS+WSG FVV  V+PHGAV L+  K   +FKV      K   E I R          EQ  V  +  
Subjt:  RMWHDNKIKSKEFVKGQKVLLYNSRLKLFPGKLKSKWSGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQISRMHRKDFSRAEEQEKVTEDAA

Query:  AAVVEENLEELQEQTPGEADQRLADTDTAREENVE------ENQEQQAEMVRDEQ------VDAVSERGNEQDQEARVEVIMPEPPKRRRIKRKVGRIQV
            +E+  EL ++  G+  +   D D   +  V+      E+Q+   E    EQ      VD  SE   E    +    +      R R +  V R+  
Subjt:  AAVVEENLEELQEQTPGEADQRLADTDTAREENVE------ENQEQQAEMVRDEQ------VDAVSERGNEQDQEARVEVIMPEPPKRRRIKRKVGRIQV

Query:  IQTDTPSPPSSDSEREKAER--EE--REKNEAEDKEREETKKKIEEE--QVKYAELRKRDFLFECGF---SGDLPQFLGTGIADHGWELFCANPKSVNAQ
         Q +  + PS  ++  + +R  EE   E NE E    E+T  +++    +V+      RD L E GF      +P+++   I ++GWE   A    V+  
Subjt:  IQTDTPSPPSSDSEREKAER--EE--REKNEAEDKEREETKKKIEEE--QVKYAELRKRDFLFECGF---SGDLPQFLGTGIADHGWELFCANPKSVNAQ

Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK
        +V+EFY  I    G +V VRG                        NE+ V PSDEQ+ EA R +      W +S   K + +   +  +A  WM  ++ +
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQK

Query:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVG
        ++PT++DS+I R R ++ + +++ +  N G +I NEI  C +K  G
Subjt:  MLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGCWKKKVG

W9QTD9 Uncharacterized protein1.8e-3035.16Show/hide
Query:  PQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLS
        P F+   I  HGW  FC +P +    +VREFYAN++  +   V V+ V+V ++  AIN+++ ++      Y + A   +DEQL   + EV IEGA WQ+S
Subjt:  PQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLS

Query:  KTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGYVILF
             T     LKR A  W  F+  + +P+TH  T++++RVLL ++IL  +S+N+  I   EI  C   +K G L+FP+ IT L  +A VP  +   I+ 
Subjt:  KTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSINVGRIIANEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGYVILF

Query:  DKGIIDTPNLALLQRVQEV
        + G I T +++ + + + V
Subjt:  DKGIIDTPNLALLQRVQEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTTCTAAGACTTTTATTGACTACATGACTGAGATATTGAGCCCGAGGCTCTATGGTACCGTGTGCACACAGGTAGAGATCGAGCTCCCGGTGCCTGATACACT
GCCAACGTCTGCTGAAAGTTCCAGACCAAGCTCCAGTTATTTTTCTAAACCGAGTTATGTCAGGGCTTCACCCCATGGTCAACAGGTATCGTTAGAGGAGGACAATGTCC
GTTGGCTTCTCGCCACCTTTCGGACTAAGCTAGCAAGTAGTTTGGGAGGGGGTGTGACACTATCGTCAACAATTGGGACACCACCAATTGATCTTCCCCGCTATAACTCT
GAGATTAGGAACATTATGCTGCTGAGCAACTGGAGGGAGCAAATTTTGTGCTGCAGCAAAACTAGGAACAGAAACTGCCACATCACAGCTCGAGCAATAAGAATGTGGCA
TGACAACAAAATTAAATCTAAGGAGTTTGTCAAGGGTCAAAAAGTTTTGCTTTATAATTCTAGATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAGGACCGT
TTGTTGTGATTGAGGTTTTCCCCCATGGAGCAGTTACTTTGCAAGATGAAAAAGATGGGAGAGTATTCAAGGTTCAGAAGATTGTTGCAGCAAAGATAATGCTGGAGCAA
ATTTCTCGTATGCATAGGAAGGATTTTAGTAGGGCGGAAGAACAAGAAAAAGTGACAGAGGATGCGGCTGCTGCAGTGGTTGAAGAAAACCTGGAAGAACTGCAAGAACA
GACCCCAGGAGAGGCTGATCAGAGACTTGCGGATACAGATACGGCTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGTGAGAGACGAGCAGGTAG
ACGCAGTGTCTGAAAGAGGAAACGAGCAGGACCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAAAGAAAGGTCGGCCGTATTCAG
GTGATTCAGACGGATACCCCATCACCACCATCATCAGATTCCGAGAGAGAAAAGGCAGAGCGAGAGGAAAGAGAGAAGAACGAAGCTGAGGACAAAGAAAGAGAAGAAAC
AAAGAAGAAGATTGAGGAAGAGCAAGTAAAATACGCTGAGCTTCGGAAAAGGGATTTCTTGTTCGAGTGCGGTTTCAGTGGTGATCTTCCACAGTTTCTGGGGACCGGTA
TTGCAGACCATGGCTGGGAGCTATTTTGTGCGAATCCGAAATCTGTAAATGCACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGTTTCCAGGTAATT
GTCCGAGGAGTCGAGGTGGATTGGAGTCCAAGTGCTATCAATGCACTGTACAACGTTCAAAACTTCCCCCATGCGGCATATAATGAGATGGCTGTGGCGCCATCTGATGA
GCAACTAAGTGAGGCTGTAAGGGAGGTAGGAATTGAAGGGGCACAGTGGCAGTTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGAAGCGAACA
CATGGATGGGATTTATTAGACAGAAGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTACGGTCTCTCAGTATTAAT
GTAGGGAGGATCATTGCGAATGAGATTTCTGGATGTTGGAAAAAGAAGGTGGGGAAACTATTCTTTCCAAACACAATCACGATGTTGTGCAGAAGAGCAGGGGTTCCAGT
GGATGAGGGATATGTGATCCTGTTTGATAAGGGGATCATAGACACGCCCAATTTGGCACTGCTCCAGCGTGTGCAGGAGGTACGTCAAGGTGGGCTTATCCATGGCATCA
ACTCGATTATAGAACAACTGGCACTTTCGACCAGTAGGCAGGAGTTTGCTAAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTGAAGAAG
GCGCTGCAAGAGAATTTTTCAAAACCTTATCCAGCCGTTCCAGCATTCCCTGACGATCTGTTGAACCCCTGGATTCCGCCCCCACCTGTTGAAAGAGGAGAAGAGGATGA
TGAAAATGAACCGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTTCTAAGACTTTTATTGACTACATGACTGAGATATTGAGCCCGAGGCTCTATGGTACCGTGTGCACACAGGTAGAGATCGAGCTCCCGGTGCCTGATACACT
GCCAACGTCTGCTGAAAGTTCCAGACCAAGCTCCAGTTATTTTTCTAAACCGAGTTATGTCAGGGCTTCACCCCATGGTCAACAGGTATCGTTAGAGGAGGACAATGTCC
GTTGGCTTCTCGCCACCTTTCGGACTAAGCTAGCAAGTAGTTTGGGAGGGGGTGTGACACTATCGTCAACAATTGGGACACCACCAATTGATCTTCCCCGCTATAACTCT
GAGATTAGGAACATTATGCTGCTGAGCAACTGGAGGGAGCAAATTTTGTGCTGCAGCAAAACTAGGAACAGAAACTGCCACATCACAGCTCGAGCAATAAGAATGTGGCA
TGACAACAAAATTAAATCTAAGGAGTTTGTCAAGGGTCAAAAAGTTTTGCTTTATAATTCTAGATTAAAATTATTTCCTGGAAAATTAAAATCTAAATGGTCAGGACCGT
TTGTTGTGATTGAGGTTTTCCCCCATGGAGCAGTTACTTTGCAAGATGAAAAAGATGGGAGAGTATTCAAGGTTCAGAAGATTGTTGCAGCAAAGATAATGCTGGAGCAA
ATTTCTCGTATGCATAGGAAGGATTTTAGTAGGGCGGAAGAACAAGAAAAAGTGACAGAGGATGCGGCTGCTGCAGTGGTTGAAGAAAACCTGGAAGAACTGCAAGAACA
GACCCCAGGAGAGGCTGATCAGAGACTTGCGGATACAGATACGGCTCGAGAGGAAAATGTTGAGGAGAATCAAGAACAGCAGGCTGAGATGGTGAGAGACGAGCAGGTAG
ACGCAGTGTCTGAAAGAGGAAACGAGCAGGACCAAGAAGCTCGTGTTGAGGTAATCATGCCCGAACCGCCGAAACGTCGCCGCATAAAAAGAAAGGTCGGCCGTATTCAG
GTGATTCAGACGGATACCCCATCACCACCATCATCAGATTCCGAGAGAGAAAAGGCAGAGCGAGAGGAAAGAGAGAAGAACGAAGCTGAGGACAAAGAAAGAGAAGAAAC
AAAGAAGAAGATTGAGGAAGAGCAAGTAAAATACGCTGAGCTTCGGAAAAGGGATTTCTTGTTCGAGTGCGGTTTCAGTGGTGATCTTCCACAGTTTCTGGGGACCGGTA
TTGCAGACCATGGCTGGGAGCTATTTTGTGCGAATCCGAAATCTGTAAATGCACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGTTTCCAGGTAATT
GTCCGAGGAGTCGAGGTGGATTGGAGTCCAAGTGCTATCAATGCACTGTACAACGTTCAAAACTTCCCCCATGCGGCATATAATGAGATGGCTGTGGCGCCATCTGATGA
GCAACTAAGTGAGGCTGTAAGGGAGGTAGGAATTGAAGGGGCACAGTGGCAGTTATCCAAGACTCAGAAGAGGACATTCCAGTCGGCTTATTTGAAAAGGGAAGCGAACA
CATGGATGGGATTTATTAGACAGAAGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTACGGTCTCTCAGTATTAAT
GTAGGGAGGATCATTGCGAATGAGATTTCTGGATGTTGGAAAAAGAAGGTGGGGAAACTATTCTTTCCAAACACAATCACGATGTTGTGCAGAAGAGCAGGGGTTCCAGT
GGATGAGGGATATGTGATCCTGTTTGATAAGGGGATCATAGACACGCCCAATTTGGCACTGCTCCAGCGTGTGCAGGAGGTACGTCAAGGTGGGCTTATCCATGGCATCA
ACTCGATTATAGAACAACTGGCACTTTCGACCAGTAGGCAGGAGTTTGCTAAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTGAAGAAG
GCGCTGCAAGAGAATTTTTCAAAACCTTATCCAGCCGTTCCAGCATTCCCTGACGATCTGTTGAACCCCTGGATTCCGCCCCCACCTGTTGAAAGAGGAGAAGAGGATGA
TGAAAATGAACCGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MSFSKTFIDYMTEILSPRLYGTVCTQVEIELPVPDTLPTSAESSRPSSSYFSKPSYVRASPHGQQVSLEEDNVRWLLATFRTKLASSLGGGVTLSSTIGTPPIDLPRYNS
EIRNIMLLSNWREQILCCSKTRNRNCHITARAIRMWHDNKIKSKEFVKGQKVLLYNSRLKLFPGKLKSKWSGPFVVIEVFPHGAVTLQDEKDGRVFKVQKIVAAKIMLEQ
ISRMHRKDFSRAEEQEKVTEDAAAAVVEENLEELQEQTPGEADQRLADTDTAREENVEENQEQQAEMVRDEQVDAVSERGNEQDQEARVEVIMPEPPKRRRIKRKVGRIQ
VIQTDTPSPPSSDSEREKAEREEREKNEAEDKEREETKKKIEEEQVKYAELRKRDFLFECGFSGDLPQFLGTGIADHGWELFCANPKSVNAQVVREFYANIVKEDGFQVI
VRGVEVDWSPSAINALYNVQNFPHAAYNEMAVAPSDEQLSEAVREVGIEGAQWQLSKTQKRTFQSAYLKREANTWMGFIRQKMLPTTHDSTISRERVLLAFAILRSLSIN
VGRIIANEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGYVILFDKGIIDTPNLALLQRVQEVRQGGLIHGINSIIEQLALSTSRQEFAKRQTLTFWNYVKNRDASLKK
ALQENFSKPYPAVPAFPDDLLNPWIPPPPVERGEEDDENEPGQED