; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037402 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037402
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF1985 domain-containing protein
Genome locationchr2:5896422..5898792
RNA-Seq ExpressionLag0037402
SyntenyLag0037402
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038883715.1 uncharacterized protein LOC120074618 isoform X1 [Benincasa hispida]1.6e-3435.69Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HL+RR C S N + + FNLEG + +FG+KEFS+ITGLNC   P   ID       +  + +F  + TI R  L   F+      +D+  +  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  I+ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + G    L+ WA E IP L   +   A ++   
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN
          TPRM +W     P+W    EKVF ++ F V PL+ T  EM    M  F  +K  E++ + ++++E N
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN

XP_038883716.1 uncharacterized protein LOC120074618 isoform X2 [Benincasa hispida]1.6e-3435.69Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HL+RR C S N + + FNLEG + +FG+KEFS+ITGLNC   P   ID       +  + +F  + TI R  L   F+      +D+  +  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  I+ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + G    L+ WA E IP L   +   A ++   
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN
          TPRM +W     P+W    EKVF ++ F V PL+ T  EM    M  F  +K  E++ + ++++E N
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN

XP_038883717.1 uncharacterized protein LOC120074618 isoform X3 [Benincasa hispida]1.6e-3435.69Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HL+RR C S N + + FNLEG + +FG+KEFS+ITGLNC   P   ID       +  + +F  + TI R  L   F+      +D+  +  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  I+ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + G    L+ WA E IP L   +   A ++   
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN
          TPRM +W     P+W    EKVF ++ F V PL+ T  EM    M  F  +K  E++ + ++++E N
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN

XP_038883718.1 uncharacterized protein LOC120074618 isoform X4 [Benincasa hispida]1.6e-3435.69Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HL+RR C S N + + FNLEG + +FG+KEFS+ITGLNC   P   ID       +  + +F  + TI R  L   F+      +D+  +  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  I+ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + G    L+ WA E IP L   +   A ++   
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN
          TPRM +W     P+W    EKVF ++ F V PL+ T  EM    M  F  +K  E++ + ++++E N
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN

XP_038883719.1 uncharacterized protein LOC120074618 isoform X5 [Benincasa hispida]1.6e-3435.69Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HL+RR C S N + + FNLEG + +FG+KEFS+ITGLNC   P   ID       +  + +F  + TI R  L   F+      +D+  +  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  I+ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + G    L+ WA E IP L   +   A ++   
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN
          TPRM +W     P+W    EKVF ++ F V PL+ T  EM    M  F  +K  E++ + ++++E N
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHF--IKEMEKEKEERMEREKN

TrEMBL top hitse value%identityAlignment
A0A5A7TFY0 Uncharacterized protein4.1e-2835.89Show/hide
Query:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF
        + D+EE +LN+PWGRLS++LT+EY++K + + T    LQ FP +L+ WALEIIPKLSE   G A +I     +PR+ +W++   P W       F + DF
Subjt:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF

Query:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK
        +V+P  PT +E+ S N + F+ +             NE+EK+R EE E        +EN+K   EE   +E ++  + E     I  ++ D+++LK    
Subjt:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK

Query:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVH
         ++NGQ E+L LLNN++ I+  +  +EKQSQ S H+N   T SL+ LE    +H  D      E  EQ  N + ++    NID+ ++
Subjt:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVH

A0A5D3CVC6 Uncharacterized protein1.3e-2936.24Show/hide
Query:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF
        + D+EE +LN+PWGRLS +LT+EY++KA+ + T    LQGFP +L+ WALEIIPKLS+   G A +I+    +PR+ +W++   P W+      F + DF
Subjt:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF

Query:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK
        +V+P  PT +E+ S N + F+ +             NE+EK+R EE E        +EN+K   EE   +E ++  + E     I  ++ D+++LK    
Subjt:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK

Query:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVH
         ++NGQ E+L LLNN++ I+  +  +EKQSQ S H+N   T SL++LE    +H  D      E  EQ  N + ++    NID+ ++
Subjt:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVH

A0A5D3CVR6 Uncharacterized protein2.7e-2733.93Show/hide
Query:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF
        + D+EE +LN+PWGRLS++LT+EY++KA+ + T    LQGFP VL+ WALEIIPKLSE   G A +I     + R+ +W++   P W+      F + DF
Subjt:  IFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQFDEKVFSAEDF

Query:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK
        +V+P  PT +E+ S N + F+ +             NE+EK+R EE E        +EN+K   EE   +E ++  + E     I  ++ D+++LK    
Subjt:  SVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQK

Query:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVHYATSYAAPEIEDV
         ++NGQ E+L LLNN++ I+  +  +EKQSQ S H+N   T SL++LE    +H  D      E  EQ  N + ++    NID+ ++     A    E++
Subjt:  ELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLE---FMHATDQFIQVAEASEQVANKDPID----NIDRIVHYATSYAAPEIEDV

Query:  ------IEDEGMMNKKKKKEKRKKRRKMKMKKI
               ED+ M  + + K   K   + +M ++
Subjt:  ------IEDEGMMNKKKKKEKRKKRRKMKMKKI

A0A6J1BX50 uncharacterized protein LOC1110055248.6e-3435Show/hide
Query:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL
        K   QL  HLIRR C S N + + FNLEG V +FG+K+F++ITG+NC   P   ID      +   + +F  + TI R  L   F       +D+  D  
Subjt:  KIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPL

Query:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP
        +VK + LY+LE  ++ KQ    ++  +  + D++E    YPWGR+SY++TI+++KKA  +N   AI + GFP  L+ WA E IP LS  +  FA KI S 
Subjt:  IVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKA-STNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSP

Query:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHFIKEMEKEKEER---MEREKNEMEKKRHEEE
           PRM +W     P+W    EK+F ++ F V P+  TD E+  + M   +  +E+ K++    +++E+N   +  + ++
Subjt:  TSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHFIKEMEKEKEER---MEREKNEMEKKRHEEE

W9SF50 DUF1985 domain-containing protein1.7e-2930.39Show/hide
Query:  MKKIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDD
        +KK P QLI HLI R C     + + F++EG +++FG+KEF++ITGLNC+ +P   I +    +S  K+ FF+   ++ R  L   F   +    DED  
Subjt:  MKKIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDD

Query:  PLIVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIY-LQGFPIVLIYWALEIIPKLSEETLGFARKIQ
          IVK + LY LE L++PK+  N +   H+K+ DN EL  NYPWGRLSY++TI YIK++  +     Y + GFP  +I WA E IP L ++ +  A++I 
Subjt:  PLIVKFSSLYLLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIY-LQGFPIVLIYWALEIIPKLSEETLGFARKIQ

Query:  SPTSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKE
        +    PR+ +WE    P + +  ++VF + +  V  ++P+ EEM    M  F KE +K        E N +E+++ E + E          + R+ E + 
Subjt:  SPTSTPRMTHWEILDTPDWSQFDEKVFSAEDFSVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKE

Query:  EEEKREQGKNEKEKERISLLEHDIKNLKEGQKELKNGQKEILRLLNNLLEIVT--PKDFSEKQSQESLHKNCPQTSSLEKLEFMHATDQFIQVAEASEQV
              QG+ +   + I  L   ++ +++ Q E+K   +E+  +L  +   V   P  F  K S +       +   +EK       D+  ++ E  EQ 
Subjt:  EEEKREQGKNEKEKERISLLEHDIKNLKEGQKELKNGQKEILRLLNNLLEIVT--PKDFSEKQSQESLHKNCPQTSSLEKLEFMHATDQFIQVAEASEQV

Query:  ANKDPIDNIDRIVHYATSYAAPEIEDVIEDEGMMNKKKKKE
          KD  + ID          + ++ + IE +G      +KE
Subjt:  ANKDPIDNIDRIVHYATSYAAPEIEDVIEDEGMMNKKKKKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAAATTCCTTTACAACTCATCACACACCTCATTAGGAGGTTGTGTGTTTCCCCTAATGATGATACCATTGCTTTCAATCTTGAGGGCAATGTAATACAATTTGG
ATTAAAAGAATTTTCAATCATTACTGGATTAAATTGCAACCCATTTCCAGTTGAAAATATTGATGATGGGGATGAGTTGGACTCAGAAATTAAACAATCCTTCTTCAAAC
ATAAAGACACAATTGATAGGAAGGATTTAGGTACTGCATTCAGTATTTGTCAAAGCCCTTTGTTAGATGAGGATGATGATCCTTTAATTGTTAAATTTTCATCCTTATAT
CTATTGGAATGTCTTATAGTACCCAAACAACATCACAATAGGGTTAGCTGGAGGCATGTAAAAATCTTTGACAATGAAGAGTTAATCCTAAATTATCCATGGGGGAGGTT
GTCATATGATTTAACCATAGAATATATAAAAAAAGCCTCCACAAATACCACAGGAGCCATATATCTCCAAGGATTTCCAATAGTCCTAATATATTGGGCATTGGAAATCA
TTCCGAAATTATCTGAGGAAACCTTAGGATTTGCTAGGAAAATCCAAAGCCCCACCTCTACCCCAAGGATGACTCATTGGGAGATATTGGACACGCCAGATTGGAGCCAA
TTCGATGAAAAGGTTTTCTCCGCTGAAGATTTCTCTGTTGTTCCATTAGTGCCAACTGATGAAGAAATGGCATCAAAAAATATGGAACACTTCATCAAAGAAATGGAGAA
GGAGAAGGAGGAAAGGATGGAAAGAGAAAAGAATGAAATGGAAAAGAAAAGACACGAGGAAGAAGAAGAAAAAAGAAAAAGAGAAGAAAATGAAGAAAATGAAAAAAGAA
AAAGAGAAGAAAAGGAAGAAGAAGAAAAAAGAGAACAAGGAAAGAATGAAAAAGAAAAAGAAAGGATATCATTGTTGGAACATGATATCAAAAATTTGAAGGAAGGACAA
AAAGAATTAAAAAATGGACAAAAAGAGATTTTAAGATTGCTAAATAATTTGCTGGAAATTGTCACACCAAAAGATTTTTCTGAAAAACAATCTCAAGAATCTCTTCACAA
AAATTGTCCTCAAACTTCTAGTCTTGAAAAACTAGAATTTATGCATGCTACTGATCAATTTATACAAGTTGCTGAAGCTTCTGAACAAGTTGCCAATAAAGACCCAATTG
ACAACATTGACAGGATTGTTCATTATGCAACCTCATATGCTGCTCCAGAGATAGAAGATGTGATAGAAGATGAGGGGATGATGAACAAGAAGAAGAAGAAAGAGAAAAGG
AAGAAGAGAAGGAAAATGAAAATGAAGAAGATATGGATGAAGAAGAAAAAGAAGAAGAAGAGAACAGGGAAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAAATTCCTTTACAACTCATCACACACCTCATTAGGAGGTTGTGTGTTTCCCCTAATGATGATACCATTGCTTTCAATCTTGAGGGCAATGTAATACAATTTGG
ATTAAAAGAATTTTCAATCATTACTGGATTAAATTGCAACCCATTTCCAGTTGAAAATATTGATGATGGGGATGAGTTGGACTCAGAAATTAAACAATCCTTCTTCAAAC
ATAAAGACACAATTGATAGGAAGGATTTAGGTACTGCATTCAGTATTTGTCAAAGCCCTTTGTTAGATGAGGATGATGATCCTTTAATTGTTAAATTTTCATCCTTATAT
CTATTGGAATGTCTTATAGTACCCAAACAACATCACAATAGGGTTAGCTGGAGGCATGTAAAAATCTTTGACAATGAAGAGTTAATCCTAAATTATCCATGGGGGAGGTT
GTCATATGATTTAACCATAGAATATATAAAAAAAGCCTCCACAAATACCACAGGAGCCATATATCTCCAAGGATTTCCAATAGTCCTAATATATTGGGCATTGGAAATCA
TTCCGAAATTATCTGAGGAAACCTTAGGATTTGCTAGGAAAATCCAAAGCCCCACCTCTACCCCAAGGATGACTCATTGGGAGATATTGGACACGCCAGATTGGAGCCAA
TTCGATGAAAAGGTTTTCTCCGCTGAAGATTTCTCTGTTGTTCCATTAGTGCCAACTGATGAAGAAATGGCATCAAAAAATATGGAACACTTCATCAAAGAAATGGAGAA
GGAGAAGGAGGAAAGGATGGAAAGAGAAAAGAATGAAATGGAAAAGAAAAGACACGAGGAAGAAGAAGAAAAAAGAAAAAGAGAAGAAAATGAAGAAAATGAAAAAAGAA
AAAGAGAAGAAAAGGAAGAAGAAGAAAAAAGAGAACAAGGAAAGAATGAAAAAGAAAAAGAAAGGATATCATTGTTGGAACATGATATCAAAAATTTGAAGGAAGGACAA
AAAGAATTAAAAAATGGACAAAAAGAGATTTTAAGATTGCTAAATAATTTGCTGGAAATTGTCACACCAAAAGATTTTTCTGAAAAACAATCTCAAGAATCTCTTCACAA
AAATTGTCCTCAAACTTCTAGTCTTGAAAAACTAGAATTTATGCATGCTACTGATCAATTTATACAAGTTGCTGAAGCTTCTGAACAAGTTGCCAATAAAGACCCAATTG
ACAACATTGACAGGATTGTTCATTATGCAACCTCATATGCTGCTCCAGAGATAGAAGATGTGATAGAAGATGAGGGGATGATGAACAAGAAGAAGAAGAAAGAGAAAAGG
AAGAAGAGAAGGAAAATGAAAATGAAGAAGATATGGATGAAGAAGAAAAAGAAGAAGAAGAGAACAGGGAAAGAATAA
Protein sequenceShow/hide protein sequence
MKKIPLQLITHLIRRLCVSPNDDTIAFNLEGNVIQFGLKEFSIITGLNCNPFPVENIDDGDELDSEIKQSFFKHKDTIDRKDLGTAFSICQSPLLDEDDDPLIVKFSSLY
LLECLIVPKQHHNRVSWRHVKIFDNEELILNYPWGRLSYDLTIEYIKKASTNTTGAIYLQGFPIVLIYWALEIIPKLSEETLGFARKIQSPTSTPRMTHWEILDTPDWSQ
FDEKVFSAEDFSVVPLVPTDEEMASKNMEHFIKEMEKEKEERMEREKNEMEKKRHEEEEEKRKREENEENEKRKREEKEEEEKREQGKNEKEKERISLLEHDIKNLKEGQ
KELKNGQKEILRLLNNLLEIVTPKDFSEKQSQESLHKNCPQTSSLEKLEFMHATDQFIQVAEASEQVANKDPIDNIDRIVHYATSYAAPEIEDVIEDEGMMNKKKKKEKR
KKRRKMKMKKIWMKKKKKKKRTGKE