; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040738 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040738
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationchr13:7833432..7834868
RNA-Seq ExpressionLag0040738
SyntenyLag0040738
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647976.1 hypothetical protein Csa_000363 [Cucumis sativus]7.0e-5761.33Show/hide
Query:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA
        EV+LKLL+D K +RV+FGEADKN+IDFLFNLLSLPLGTV+RLLK + M+G L NLY SVE LN+TY QPNQSKD++L P+V+F  S +LLPNIE    A 
Subjt:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA

Query:  QSRV----GQRDMRRASRRTV------ATLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVT
        Q ++     +     AS  T       + + R C +  P ++ ++        GG+VKGVVTYMVMDDLSV PMSTIS ITLLNKFNIKEVGAL+EKVVT
Subjt:  QSRV----GQRDMRRASRRTV------ATLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVT

Query:  LDVNKGVKLLQASLQSKTVLTDVFL
        LD   G+KLL+ASLQSKTVLTDVFL
Subjt:  LDVNKGVKLLQASLQSKTVLTDVFL

XP_004147723.1 uncharacterized protein LOC101207526 [Cucumis sativus]4.4e-5961.78Show/hide
Query:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA
        EV+LKLL+D K +RV+FGEADKN+IDFLFNLLSLPLGTV+RLLK + M+G L NLY SVE LN+TY QPNQSKD++L P+V+F  S +LLPNIE    A 
Subjt:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA

Query:  QSRV----GQRDMRRASRRTV------ATLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVT
        Q ++     +     AS  T       + + R C +  P ++ ++        GG+VKGVVTYMVMDDLSV PMSTIS ITLLNKFNIKEVGAL+EKVVT
Subjt:  QSRV----GQRDMRRASRRTV------ATLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVT

Query:  LDVNKGVKLLQASLQSKTVLTDVFL
        LDV++G+KLL+ASLQSKTVLTDVFL
Subjt:  LDVNKGVKLLQASLQSKTVLTDVFL

XP_008461735.1 PREDICTED: uncharacterized protein LOC103500268 [Cucumis melo]1.8e-6063.23Show/hide
Query:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA
        EVRLKLL+D + +RV+FGEADKNLIDFLFNLLSLPLGTV+RLLK + M G L NLY+SVE LN+TY QPNQSKD +L P+V+F  S +LLPNIE      
Subjt:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA

Query:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD
        +  + G R     AS  T         + R C    P  + ++        GG+VKGVVTYMVMDDLSV PMSTIS ITLLNKFNIKEVGAL+EKV+TLD
Subjt:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD

Query:  VNKGVKLLQASLQSKTVLTDVFL
        VN+GVKLL+ASLQSKTVLTDVFL
Subjt:  VNKGVKLLQASLQSKTVLTDVFL

XP_022138964.1 uncharacterized protein LOC111010013 [Momordica charantia]1.8e-6562.98Show/hide
Query:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSA--MLLPNIE
        MA   VRLKLL+D K +RV+FGEADKN+IDFLFNLLSLPLGTV+RLLK + M GCLGNLY+SVETLN+TY QPNQSKD +L P+V+FCGS+  MLLPNI+
Subjt:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSA--MLLPNIE

Query:  CSLAAAQSRVGQRDMRRASRRTVA----TLMRACE---------YREPATSAS--------EGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEV
         S AA    +         RR+V+     +   C           + P+ SA+        EGG+VKGVVTYMVMDDLSV PMSTIS I LLNKFN+KEV
Subjt:  CSLAAAQSRVGQRDMRRASRRTVA----TLMRACE---------YREPATSAS--------EGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEV

Query:  GALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR
        GAL+EKVVTLDVN+GVKLL+ASL SKTVLTDVF+R
Subjt:  GALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR

XP_022139194.1 uncharacterized protein LOC111010162 [Momordica charantia]1.0e-5257.33Show/hide
Query:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCG-SAMLLPNIEC
        MA   VRLK L+D K +RV+FGEAD+N IDFLFNLLSLPLGTVVR LK + M GCLGNLY+SVETLN+TY QPNQSKD++L P   +CG SAMLLP+   
Subjt:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCG-SAMLLPNIEC

Query:  SLAAAQSRVGQRDMRRASRRTVA------------TLMRACEYREPATSA-SEGGYVK-GVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVV
        S  +    V Q +         A            ++ +   + +P      EGG+VK G VTYMVMDDLSV+PMS ISC+ LLNKFN+ +VG L+EK++
Subjt:  SLAAAQSRVGQRDMRRASRRTVA------------TLMRACEYREPATSA-SEGGYVK-GVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVV

Query:  TLDVNKGVKLLQASLQSKTVLTDVF
        TLDVN+GVKLL+ASLQS T+LTDVF
Subjt:  TLDVNKGVKLLQASLQSKTVLTDVF

TrEMBL top hitse value%identityAlignment
A0A1Q3CAG0 DUF674 domain-containing protein (Fragment)2.1e-5158.8Show/hide
Query:  LKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAM--LLPNIECS-----
        LKLL+D K  RV+F EA K+ +DFLFNLLSLP+GTV+RLL  + M GCLGNLYDS+E L +TY QPNQ KD +LNP++      +  LLP IE S     
Subjt:  LKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAM--LLPNIECS-----

Query:  LAAAQSRVGQRDMRRASRRTVATLMR-ACEYREP----ATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLDVNKGVKL
             S V     RRA       LMR    Y  P     TS+SEGGYVKGVVTYMVMDDL V PMSTIS ITLLN+FNI EVG L+EKV+ + +++GVKL
Subjt:  LAAAQSRVGQRDMRRASRRTVATLMR-ACEYREP----ATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLDVNKGVKL

Query:  LQASLQSKTVLTDVFL
        L+ASLQSKT LTDVFL
Subjt:  LQASLQSKTVLTDVFL

A0A1S3CGQ2 uncharacterized protein LOC1035002688.6e-6163.23Show/hide
Query:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA
        EVRLKLL+D + +RV+FGEADKNLIDFLFNLLSLPLGTV+RLLK + M G L NLY+SVE LN+TY QPNQSKD +L P+V+F  S +LLPNIE      
Subjt:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA

Query:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD
        +  + G R     AS  T         + R C    P  + ++        GG+VKGVVTYMVMDDLSV PMSTIS ITLLNKFNIKEVGAL+EKV+TLD
Subjt:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD

Query:  VNKGVKLLQASLQSKTVLTDVFL
        VN+GVKLL+ASLQSKTVLTDVFL
Subjt:  VNKGVKLLQASLQSKTVLTDVFL

A0A5A7U8V2 DUF674 domain-containing protein8.6e-6163.23Show/hide
Query:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA
        EVRLKLL+D + +RV+FGEADKNLIDFLFNLLSLPLGTV+RLLK + M G L NLY+SVE LN+TY QPNQSKD +L P+V+F  S +LLPNIE      
Subjt:  EVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAA

Query:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD
        +  + G R     AS  T         + R C    P  + ++        GG+VKGVVTYMVMDDLSV PMSTIS ITLLNKFNIKEVGAL+EKV+TLD
Subjt:  QSRV-GQR-DMRRASRRTVA------TLMRACEYREPATSASE--------GGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLD

Query:  VNKGVKLLQASLQSKTVLTDVFL
        VN+GVKLL+ASLQSKTVLTDVFL
Subjt:  VNKGVKLLQASLQSKTVLTDVFL

A0A6J1CBJ8 uncharacterized protein LOC1110100138.9e-6662.98Show/hide
Query:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSA--MLLPNIE
        MA   VRLKLL+D K +RV+FGEADKN+IDFLFNLLSLPLGTV+RLLK + M GCLGNLY+SVETLN+TY QPNQSKD +L P+V+FCGS+  MLLPNI+
Subjt:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSA--MLLPNIE

Query:  CSLAAAQSRVGQRDMRRASRRTVA----TLMRACE---------YREPATSAS--------EGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEV
         S AA    +         RR+V+     +   C           + P+ SA+        EGG+VKGVVTYMVMDDLSV PMSTIS I LLNKFN+KEV
Subjt:  CSLAAAQSRVGQRDMRRASRRTVA----TLMRACE---------YREPATSAS--------EGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEV

Query:  GALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR
        GAL+EKVVTLDVN+GVKLL+ASL SKTVLTDVF+R
Subjt:  GALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR

A0A6J1CC87 uncharacterized protein LOC1110101625.1e-5357.33Show/hide
Query:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCG-SAMLLPNIEC
        MA   VRLK L+D K +RV+FGEAD+N IDFLFNLLSLPLGTVVR LK + M GCLGNLY+SVETLN+TY QPNQSKD++L P   +CG SAMLLP+   
Subjt:  MAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCG-SAMLLPNIEC

Query:  SLAAAQSRVGQRDMRRASRRTVA------------TLMRACEYREPATSA-SEGGYVK-GVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVV
        S  +    V Q +         A            ++ +   + +P      EGG+VK G VTYMVMDDLSV+PMS ISC+ LLNKFN+ +VG L+EK++
Subjt:  SLAAAQSRVGQRDMRRASRRTVA------------TLMRACEYREPATSA-SEGGYVK-GVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVV

Query:  TLDVNKGVKLLQASLQSKTVLTDVF
        TLDVN+GVKLL+ASLQS T+LTDVF
Subjt:  TLDVNKGVKLLQASLQSKTVLTDVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)5.1e-1327.83Show/hide
Query:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEK-----MSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGS--
        +K   + +  L+LL+D++  RV+  EA K+ +D L +LL+LP+GT+VRLL+  +     + GCL NLY SV  ++   ++    K  +L+PR +  GS  
Subjt:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEK-----MSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGS--

Query:  ---AMLLPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC---EYRE--PATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQE
            + + + E +           +  R     V+T+   C    +RE       ++G +     ++++ DDL V   S    + +LN F       LQE
Subjt:  ---AMLLPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC---EYRE--PATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQE

Query:  KVVTLDVNKGVKLLQASLQSKTVLTDVFLR
         ++ +   + + LL     S+  LTD FLR
Subjt:  KVVTLDVNKGVKLLQASLQSKTVLTDVFLR

AT5G01120.1 Protein of unknown function (DUF674)7.3e-1227.9Show/hide
Query:  VRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCGSAMLLPN-
        + LKLL+D++  +VVF EA  + +D LF+  +LP+GT+VRLL+    S     GC  N+Y SV ++   ++     K  +L P       C +  L  + 
Subjt:  VRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCGSAMLLPN-

Query:  ---IECSLAAAQSRVGQRDMR----RASRRTVATLMRACEYREPATSASEGG---------YVKGVVT-YMVMDDLSVMPMSTISCITLLNKFNIKEVGA
            +C +     R GQ        + SR +    M      E      EGG         +V+G  T +++ DDL V   S  S + +L      +   
Subjt:  ---IECSLAAAQSRVGQRDMR----RASRRTVATLMRACEYREPATSASEGG---------YVKGVVT-YMVMDDLSVMPMSTISCITLLNKFNIKEVGA

Query:  LQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR
        L E ++ +++ +   LL     S T LTD FL+
Subjt:  LQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR

AT5G01130.1 Protein of unknown function (DUF674)2.1e-1430.77Show/hide
Query:  EKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNPR-VTFCGSAMLLPN
        E +V L+L +D++  +VV  EA K  +D LF+LL+LP+GT++RLL+  + S     GC  NLY SV  +    ++ +  K  +L+PR V       L+ N
Subjt:  EKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNPR-VTFCGSAMLLPN

Query:  I---ECSLAAAQSRVGQRDMRRASRRTVATLMRACEYREP---ATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLDVN
        I   E  L               SR    + M   E++EP     ++S    V GV  +++ DDL V   ST   +  L      ++  L+E +V +   
Subjt:  I---ECSLAAAQSRVGQRDMRRASRRTVATLMRACEYREP---ATSASEGGYVKGVVTYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLDVN

Query:  KGVKLLQASLQSKTVLTDVFL
        + + LL+    SK  LT+ FL
Subjt:  KGVKLLQASLQSKTVLTDVFL

AT5G43240.1 Protein of unknown function (DUF674)7.3e-1227.2Show/hide
Query:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCG
        S  +    ++LKLL+D++  +VVF EA K+ +D LF+  +LP+GT+VRLL+  K S     GC  N+Y SV ++   ++     K  +L P       C 
Subjt:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCG

Query:  SAML-LPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC--------EYREPATSASEGGYVKGVV-------TYMVMDDLSVMPMSTISCITLLNKFN
        +  L + + E +      +  +R+    S     T   +C        +       AS G  V+G V       ++M+ DDL V   S    + +L    
Subjt:  SAML-LPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC--------EYREPATSASEGGYVKGVV-------TYMVMDDLSVMPMSTISCITLLNKFN

Query:  IKEVGALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR
          +   L EK+  +++ +   LL+    S   LTD FL+
Subjt:  IKEVGALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR

AT5G43240.3 Protein of unknown function (DUF674)7.3e-1227.2Show/hide
Query:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCG
        S  +    ++LKLL+D++  +VVF EA K+ +D LF+  +LP+GT+VRLL+  K S     GC  N+Y SV ++   ++     K  +L P       C 
Subjt:  SKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLSLPLGTVVRLLKNEKMS-----GCLGNLYDSVETLNETYWQPNQSKDTVLNP---RVTFCG

Query:  SAML-LPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC--------EYREPATSASEGGYVKGVV-------TYMVMDDLSVMPMSTISCITLLNKFN
        +  L + + E +      +  +R+    S     T   +C        +       AS G  V+G V       ++M+ DDL V   S    + +L    
Subjt:  SAML-LPNIECSLAAAQSRVGQRDMRRASRRTVATLMRAC--------EYREPATSASEGGYVKGVV-------TYMVMDDLSVMPMSTISCITLLNKFN

Query:  IKEVGALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR
          +   L EK+  +++ +   LL+    S   LTD FL+
Subjt:  IKEVGALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCTCAACTGTGTTTGTCCAAGAGGTATTCACAATACTCCAACTTCAGAGGAGTAAGAATGGATAAGAAGCTCTCTGTCGCTTTCATGTTATTGAAAAGGATATC
CATGAGTTTTACTTGTGCAAATCAACTGCTCGTTCCCATTGCCGCCGTTCTATTTCTGATGGTCCTAATGCGCTATGTCCAAATATATAGTAATAGTAAAAGTAAAGTAA
TGGCCGAAAAGGAAGTGAGATTGAAACTTCTAGTGGACAAAAAATCACGAAGAGTTGTTTTCGGCGAAGCGGACAAAAACCTGATCGACTTCCTTTTCAACTTACTTTCC
CTCCCACTTGGGACAGTGGTCAGGCTCTTGAAAAACGAAAAAATGTCAGGGTGCTTGGGGAATTTGTATGACAGTGTTGAAACTTTGAACGAAACGTATTGGCAGCCAAA
TCAGAGCAAAGACACCGTTTTGAACCCTAGAGTTACATTCTGTGGTTCGGCCATGCTTCTGCCTAATATTGAATGTTCCTTAGCTGCAGCCCAATCTCGCGTAGGTCAAC
GAGATATGAGGCGCGCGAGCAGACGCACCGTTGCAACTTTAATGCGCGCTTGTGAATACAGAGAACCCGCTACGAGTGCTTCAGAGGGAGGGTATGTGAAAGGAGTGGTG
ACTTACATGGTGATGGATGATTTGAGTGTGATGCCAATGTCCACCATCTCCTGCATTACTCTCTTGAACAAGTTCAATATCAAAGAAGTTGGTGCTTTGCAGGAGAAGGT
CGTCACTCTGGATGTCAATAAGGGTGTGAAATTGCTCCAGGCCTCTCTACAATCCAAGACTGTTCTCACAGATGTCTTCCTCAGGAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCTCAACTGTGTTTGTCCAAGAGGTATTCACAATACTCCAACTTCAGAGGAGTAAGAATGGATAAGAAGCTCTCTGTCGCTTTCATGTTATTGAAAAGGATATC
CATGAGTTTTACTTGTGCAAATCAACTGCTCGTTCCCATTGCCGCCGTTCTATTTCTGATGGTCCTAATGCGCTATGTCCAAATATATAGTAATAGTAAAAGTAAAGTAA
TGGCCGAAAAGGAAGTGAGATTGAAACTTCTAGTGGACAAAAAATCACGAAGAGTTGTTTTCGGCGAAGCGGACAAAAACCTGATCGACTTCCTTTTCAACTTACTTTCC
CTCCCACTTGGGACAGTGGTCAGGCTCTTGAAAAACGAAAAAATGTCAGGGTGCTTGGGGAATTTGTATGACAGTGTTGAAACTTTGAACGAAACGTATTGGCAGCCAAA
TCAGAGCAAAGACACCGTTTTGAACCCTAGAGTTACATTCTGTGGTTCGGCCATGCTTCTGCCTAATATTGAATGTTCCTTAGCTGCAGCCCAATCTCGCGTAGGTCAAC
GAGATATGAGGCGCGCGAGCAGACGCACCGTTGCAACTTTAATGCGCGCTTGTGAATACAGAGAACCCGCTACGAGTGCTTCAGAGGGAGGGTATGTGAAAGGAGTGGTG
ACTTACATGGTGATGGATGATTTGAGTGTGATGCCAATGTCCACCATCTCCTGCATTACTCTCTTGAACAAGTTCAATATCAAAGAAGTTGGTGCTTTGCAGGAGAAGGT
CGTCACTCTGGATGTCAATAAGGGTGTGAAATTGCTCCAGGCCTCTCTACAATCCAAGACTGTTCTCACAGATGTCTTCCTCAGGAGTTAA
Protein sequenceShow/hide protein sequence
MVAQLCLSKRYSQYSNFRGVRMDKKLSVAFMLLKRISMSFTCANQLLVPIAAVLFLMVLMRYVQIYSNSKSKVMAEKEVRLKLLVDKKSRRVVFGEADKNLIDFLFNLLS
LPLGTVVRLLKNEKMSGCLGNLYDSVETLNETYWQPNQSKDTVLNPRVTFCGSAMLLPNIECSLAAAQSRVGQRDMRRASRRTVATLMRACEYREPATSASEGGYVKGVV
TYMVMDDLSVMPMSTISCITLLNKFNIKEVGALQEKVVTLDVNKGVKLLQASLQSKTVLTDVFLRS