; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021423 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021423
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold4:463135..472600
RNA-Seq ExpressionSpg021423
SyntenySpg021423
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR029021 - Protein-tyrosine phosphatase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039949.1 hypothetical protein E6C27_scaffold122G002290 [Cucumis melo var. makuwa]9.1e-3028.53Show/hide
Query:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMI---RDFLMKFVETKSVE
        I ++   L WL++T   LL +P    F  + R  +  + + K ++++G+  E       G K  + VP G  K GW +F +M+   +    K   T+   
Subjt:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMI---RDFLMKFVETKSVE

Query:  NISKKSKYEVSSTSVNNTSKNLDRSYADMV------RVKSGEPHFGSHTKKLPEISSFW------VRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALE
        N  K  +    S   +  S+   ++Y + V       + S      S TK  P +  F       +RKE++  D+  ++  ++SR   H+DW ++   L+
Subjt:  NISKKSKYEVSSTSVNNTSKNLDRSYADMV------RVKSGEPHFGSHTKKLPEISSFW------VRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALE

Query:  NYFHSK---VLLNPFMADKALVKLKDS-----LKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISS
        +    K       PF ADKAL+ +KD      L  +  W  +G F++K E WS + H   +VI  YGGW   + +PL  WN   F  IG   GG +  + 
Subjt:  NYFHSK---VLLNPFMADKALVKLKDS-----LKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISS

Query:  QTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALR
        +++N L+ +EA I+VK N+ GFLPA I + D+   +F ++
Subjt:  QTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALR

KAA0047189.1 hypothetical protein E6C27_scaffold83G00690 [Cucumis melo var. makuwa]9.7e-3234.6Show/hide
Query:  WVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKL-----KDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG
        WV +  EV+    +   +++++FA +D +++++ LENYF +K+++NP   + AL+ L     KD +  +GKW+++G+F+LK E W  HK+  P  ++GYG
Subjt:  WVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKL-----KDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG

Query:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID
        GW+ +KNL    W     E                      SEARI+VK N CGF+P+ I + D   GN  L FGD   L+PP      + +SDF   I 
Subjt:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID

Query:  LKRVSQIMMDE
        L R+ +++ DE
Subjt:  LKRVSQIMMDE

KAA0063414.1 uncharacterized protein E6C27_scaffold508G00510 [Cucumis melo var. makuwa]1.5e-3230.23Show/hide
Query:  GWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENISKKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEI-
        GW L C+VWP SGG+  + +PVG  + GW+ F  MI+DFL                       SV+ + +++     DM+ +    P F     + P   
Subjt:  GWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENISKKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEI-

Query:  --SSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKLKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG
          SS WV K  EV+                 D+K +                                                         +V++GYG
Subjt:  --SSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKLKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG

Query:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID
        GWI++KNLPL +W+  +++ IG  FGG  SIS +T+NL++CSEA+I+V +N CGFLPA + + D    N  L FGDI  L+ P+ I   L +S  +N ID
Subjt:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID

Query:  LKRVSQIMMDE
        L R++Q+++DE
Subjt:  LKRVSQIMMDE

TYJ98837.1 putative 3,4-dihydroxy-2-butanone kinase isoform X5 [Cucumis melo var. makuwa]1.7e-2854.87Show/hide
Query:  GKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGN
        GKW+  GNFHLKIE W   KH  P V +GYGGW+ +KNLPL +W R   E+IG+HF GL  I+ +TLNL + SEARI+VK+N CGF+P+ I +TD    N
Subjt:  GKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGN

Query:  FALRFGDISPLDP
          L FGD   L+P
Subjt:  FALRFGDISPLDP

XP_038904899.1 uncharacterized protein LOC120091119 isoform X2 [Benincasa hispida]8.8e-3346.71Show/hide
Query:  LKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIV
        L + +   GKW+  G+FHLK E W+   H  P  + GYGGWI++KNLPL +W +  FE IG +FGGL SI+ + LNL+   +A I+VK N CGF+PA I 
Subjt:  LKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIV

Query:  VTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEIDLKRVSQIMMDEEI
        V+++  G+  L FGDIS  +PP  +  DL  SDF N IDL R++++   E I
Subjt:  VTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEIDLKRVSQIMMDEEI

TrEMBL top hitse value%identityAlignment
A0A5A7U128 Uncharacterized protein4.7e-3234.6Show/hide
Query:  WVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKL-----KDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG
        WV +  EV+    +   +++++FA +D +++++ LENYF +K+++NP   + AL+ L     KD +  +GKW+++G+F+LK E W  HK+  P  ++GYG
Subjt:  WVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKL-----KDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG

Query:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID
        GW+ +KNL    W     E                      SEARI+VK N CGF+P+ I + D   GN  L FGD   L+PP      + +SDF   I 
Subjt:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID

Query:  LKRVSQIMMDE
        L R+ +++ DE
Subjt:  LKRVSQIMMDE

A0A5A7V878 DUF4283 domain-containing protein7.3e-3330.23Show/hide
Query:  GWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENISKKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEI-
        GW L C+VWP SGG+  + +PVG  + GW+ F  MI+DFL                       SV+ + +++     DM+ +    P F     + P   
Subjt:  GWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENISKKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEI-

Query:  --SSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKLKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG
          SS WV K  EV+                 D+K +                                                         +V++GYG
Subjt:  --SSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKLKDSLKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYG

Query:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID
        GWI++KNLPL +W+  +++ IG  FGG  SIS +T+NL++CSEA+I+V +N CGFLPA + + D    N  L FGDI  L+ P+ I   L +S  +N ID
Subjt:  GWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPPRFIPMDLSLSDFDNEID

Query:  LKRVSQIMMDE
        L R++Q+++DE
Subjt:  LKRVSQIMMDE

A0A5D3BI91 Putative 3,4-dihydroxy-2-butanone kinase isoform X58.3e-2954.87Show/hide
Query:  GKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGN
        GKW+  GNFHLKIE W   KH  P V +GYGGW+ +KNLPL +W R   E+IG+HF GL  I+ +TLNL + SEARI+VK+N CGF+P+ I +TD    N
Subjt:  GKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGN

Query:  FALRFGDISPLDP
          L FGD   L+P
Subjt:  FALRFGDISPLDP

A0A5D3DJE1 DUF4283 domain-containing protein7.0e-2826.09Show/hide
Query:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENIS
        I +S   L+W++ TL  L+ +P    F  + RDS   + + K  + +G   E         K  + VP G  K+GW+ F  MI   +    +T+    + 
Subjt:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENIS

Query:  KKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEISSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMAD
        + S     S  ++   ++  R+  +          + S        +SF      ++    L+   V+ R F H+DW ++ Q L          N F A+
Subjt:  KKSKYEVSSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEISSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMAD

Query:  KALVKLKDSLKF-----DGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRN
        KALV    ++       +  W  +G + ++ E WS   H  P++I  YGGW   + +PL  WN   F+ IG   GGL+ ++ +T +  +  EARI+V+ N
Subjt:  KALVKLKDSLKF-----DGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRN

Query:  FCGFLPAEIVVTDKIHGNFALR
        + GFLPA + + D     F+++
Subjt:  FCGFLPAEIVVTDKIHGNFALR

A0A5D3DLT1 DUF4283 domain-containing protein4.4e-3028.53Show/hide
Query:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMI---RDFLMKFVETKSVE
        I ++   L WL++T   LL +P    F  + R  +  + + K ++++G+  E       G K  + VP G  K GW +F +M+   +    K   T+   
Subjt:  IHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMI---RDFLMKFVETKSVE

Query:  NISKKSKYEVSSTSVNNTSKNLDRSYADMV------RVKSGEPHFGSHTKKLPEISSFW------VRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALE
        N  K  +    S   +  S+   ++Y + V       + S      S TK  P +  F       +RKE++  D+  ++  ++SR   H+DW ++   L+
Subjt:  NISKKSKYEVSSTSVNNTSKNLDRSYADMV------RVKSGEPHFGSHTKKLPEISSFW------VRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALE

Query:  NYFHSK---VLLNPFMADKALVKLKDS-----LKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISS
        +    K       PF ADKAL+ +KD      L  +  W  +G F++K E WS + H   +VI  YGGW   + +PL  WN   F  IG   GG +  + 
Subjt:  NYFHSK---VLLNPFMADKALVKLKDS-----LKFDGKWKLIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISS

Query:  QTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALR
        +++N L+ +EA I+VK N+ GFLPA I + D+   +F ++
Subjt:  QTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62010.1 unknown protein2.4e-1267.35Show/hide
Query:  NACDADSLQVHGVAIPTILGIQNVLKYVGAQKDRQQAQVLWINLREEPV
        N   ADSL+VHGVAIPT +GI+NVL+++GA KD +Q +VLWI+LREEPV
Subjt:  NACDADSLQVHGVAIPTILGIQNVLKYVGAQKDRQQAQVLWINLREEPV

AT3G62010.2 unknown protein2.4e-1267.35Show/hide
Query:  NACDADSLQVHGVAIPTILGIQNVLKYVGAQKDRQQAQVLWINLREEPV
        N   ADSL+VHGVAIPT +GI+NVL+++GA KD +Q +VLWI+LREEPV
Subjt:  NACDADSLQVHGVAIPTILGIQNVLKYVGAQKDRQQAQVLWINLREEPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAAATTCATCTTTCTAGGCCTCACTTGAATTGGCTTGAGTTAACTTTAGTCGAATTATTGCAAAGCCCGGTTCATTTGTTCTTTCGAAAGAAATTAAGAGATTC
AAATGGAACAGTTCAGTTATCGAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAATGCTCCGTTTGGCCCTTTTCTGGTGGAAAGAAAAGTGTGCAAGTTCCTGTGGGTT
ATGCCAAGAACGGTTGGTTAATATTTTGGGAAATGATTAGAGACTTTCTTATGAAATTTGTGGAAACAAAGTCTGTTGAGAATATTTCAAAGAAGTCTAAATATGAAGTA
TCATCCACGTCGGTTAATAATACTAGTAAGAATTTGGATAGAAGTTATGCTGATATGGTGAGGGTTAAATCTGGGGAACCTCATTTTGGTTCTCACACTAAGAAGCTGCC
AGAAATTTCTTCATTTTGGGTTAGAAAAGAAAAAGAAGTGGTGGACTTAAAGCTAGATGAATTTTGTGTGGTATCTAGAATGTTTGCACATAATGATTGGAAGGAAGTAA
AGCAAGCTTTGGAAAATTATTTTCATTCTAAGGTTTTACTCAATCCTTTTATGGCTGATAAAGCCTTGGTAAAACTGAAAGATAGCTTGAAATTTGATGGCAAATGGAAA
CTTATTGGAAACTTTCATTTGAAAATTGAAAATTGGTCGTGTCACAAACATTTTCATCCTGAGGTGATTGAAGGATATGGAGGTTGGATAGCCTTGAAGAACTTACCTTT
GCCATTCTGGAATCGTTACATTTTTGAAATCATTGGAAATCACTTTGGTGGATTGGTTAGCATTTCTTCTCAAACATTGAACCTTTTAGATTGTTCAGAAGCTCGAATAG
AGGTGAAAAGGAATTTTTGTGGATTTTTACCTGCTGAAATTGTGGTTACGGATAAGATTCATGGAAACTTTGCTCTTCGTTTTGGTGATATATCTCCTTTAGACCCTCCA
CGTTTTATTCCTATGGATTTATCATTGAGTGATTTTGATAATGAAATTGATTTAAAAAGAGTTTCCCAGATTATGATGGATGAAGAAATTTCCTCATCACAAGAAGATAT
TAATTCCTTCAATGATCAAGGACTAAACCCTCCAGCCAAGCAAAATCGGGTAAATCAAGAAATCTCTTCATCTTCTAAAGACCCCAACGGCATTTTTGTTGAATTTTCCT
GTTCCAAGGAAGCATTAATTGAAAAGGTTAATGGTGGTTTAATGAGGCCGGTTGAGTTTTCTAAAAAGAATGCTTTGTTGCAAGAAACGGCTATTAATACCATTGTTAAG
GACATTAATGCCAATGTTTCAGTTATTAAGAATGCATTAATTGATGGTGCTGTGCATGAGTCCCAGGAGTTACTACTCACGCCTATTTATGATCCTACTTCAGGTTTGAA
GAGAAGTAATGCTGCTGGTTTTGAAGAAAATGAATCATCTTCTAAAGATCCCAACGGTATTATGGATGATTCACTGGAAAAATTGAATGATGGAGGTTCACATTTAGTAA
ATTCGGTTGAGTTATTACACGAAAATGCCTTTAGTGACTTGGTTTCACATTCCAAGGAAGCATTAATTAAAGAGGTTAATTGTAATTTAATTGGGCCGGTTGAGATTTCA
AAAGAGAAGAGTGTTTTGTTGCAAGAAAGAGATTTTAATGCCAACGGAAAGGGTATTAATGCCATCGGTTCAGATATTCAAGGAGCGTTAACTGATGGAGCTTTGAATGA
GTCTCAGGATTTATTATTCACGCCTATTCATGACCCACCTTCGGATTTGAAGAGTTGTAATGCAGCTGGTTTGAAAGAAAACGAACAGAATGTTTCTAAGGCTTTAAAGA
AGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAGTGTGAAAAGTCAGATATTTTGGACTCAATTCCCATTAATTCCAATTATAACCCTGATGTTATTGAAGAA
TGTTGTTCTCAATTTTTGCTCTCTGCTTTGAATCAGCCTAGGTACTGTCAAACTAATCTTAATGAGTTATCAAATTCCACTTCATCCAATCAGTATATTCTTTCAAACAT
TCAATCGTCTTCCTTAACAAAGGGGGTTTTTATTCCTTCATCCAAAGTTGAAATTAAAGTTGATCAATCATATTCATCTCCTATTGATTCTGATGATGATTCAGTGGTGA
GTATTAGTAGTGTTGAGGCTGAAAATCAGTATTTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCAATGGCTTTTAATCAGATTTTCCAGAATAATGAT
GATGTTTCTGAAGTTCAGTTGAATGCTTGTGATGCCGATTCACTACAAGTCCACGGTGTTGCTATTCCTACAATTCTTGGAATTCAGAATGTTCTCAAGTATGTTGGAGC
TCAAAAAGATAGGCAACAAGCTCAAGTTCTTTGGATTAACCTTCGGGAAGAACCGGTAAATCCTCTCATCGCAATAATATGTTGTTTTTATGCCGATAGAGCTTTGCTAA
CACGTGAAAACGAAAATAAATCATCTACCCACTATAGGGTAAACAAATGGTATAGGGTGGGTGAATTCTCTCTCAAATTCAAGCAATGGAATAGCGTTGATAGCAAAGAA
GACCGGTTGATCCCATCTTATGGTGGCTGGATAAAGATTAAAAACCTCCCTCTTGATAAATGGGATGAGAAAACCCTTGGATACATTGGAGACTCTTGCGGAGGATATTT
TGAGATCGCCAACAAGACTTTATCTCGAATAGACCTCATAGAAGCATCCATTGAGGTATCCATGGAACTTTGCCGCCGGCAACTCTCCGATAAGATTTCTGAATCACAAA
AAGAAGCTAAACAAGTGGCGGGAATTCTTGGGCCGTACCCCCAATCTGACCCATGGCCGTTTGTAACCTGCCCACAACTTGTACAATCCTGTCCTATACTTGATCGTTCT
TGTAGTAGAACTGATAGCTTTACGCAAACAGGCACTAAGCATTCCTCGAATCCTCCAAACCTTGTCTTTGACTCTGAGGATGATTTCTCCAGTCCTTATCCTTCCTCCCC
ATCTCCTTCCACACCTATCCCCCTCCCAATATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAAATTCATCTTTCTAGGCCTCACTTGAATTGGCTTGAGTTAACTTTAGTCGAATTATTGCAAAGCCCGGTTCATTTGTTCTTTCGAAAGAAATTAAGAGATTC
AAATGGAACAGTTCAGTTATCGAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAATGCTCCGTTTGGCCCTTTTCTGGTGGAAAGAAAAGTGTGCAAGTTCCTGTGGGTT
ATGCCAAGAACGGTTGGTTAATATTTTGGGAAATGATTAGAGACTTTCTTATGAAATTTGTGGAAACAAAGTCTGTTGAGAATATTTCAAAGAAGTCTAAATATGAAGTA
TCATCCACGTCGGTTAATAATACTAGTAAGAATTTGGATAGAAGTTATGCTGATATGGTGAGGGTTAAATCTGGGGAACCTCATTTTGGTTCTCACACTAAGAAGCTGCC
AGAAATTTCTTCATTTTGGGTTAGAAAAGAAAAAGAAGTGGTGGACTTAAAGCTAGATGAATTTTGTGTGGTATCTAGAATGTTTGCACATAATGATTGGAAGGAAGTAA
AGCAAGCTTTGGAAAATTATTTTCATTCTAAGGTTTTACTCAATCCTTTTATGGCTGATAAAGCCTTGGTAAAACTGAAAGATAGCTTGAAATTTGATGGCAAATGGAAA
CTTATTGGAAACTTTCATTTGAAAATTGAAAATTGGTCGTGTCACAAACATTTTCATCCTGAGGTGATTGAAGGATATGGAGGTTGGATAGCCTTGAAGAACTTACCTTT
GCCATTCTGGAATCGTTACATTTTTGAAATCATTGGAAATCACTTTGGTGGATTGGTTAGCATTTCTTCTCAAACATTGAACCTTTTAGATTGTTCAGAAGCTCGAATAG
AGGTGAAAAGGAATTTTTGTGGATTTTTACCTGCTGAAATTGTGGTTACGGATAAGATTCATGGAAACTTTGCTCTTCGTTTTGGTGATATATCTCCTTTAGACCCTCCA
CGTTTTATTCCTATGGATTTATCATTGAGTGATTTTGATAATGAAATTGATTTAAAAAGAGTTTCCCAGATTATGATGGATGAAGAAATTTCCTCATCACAAGAAGATAT
TAATTCCTTCAATGATCAAGGACTAAACCCTCCAGCCAAGCAAAATCGGGTAAATCAAGAAATCTCTTCATCTTCTAAAGACCCCAACGGCATTTTTGTTGAATTTTCCT
GTTCCAAGGAAGCATTAATTGAAAAGGTTAATGGTGGTTTAATGAGGCCGGTTGAGTTTTCTAAAAAGAATGCTTTGTTGCAAGAAACGGCTATTAATACCATTGTTAAG
GACATTAATGCCAATGTTTCAGTTATTAAGAATGCATTAATTGATGGTGCTGTGCATGAGTCCCAGGAGTTACTACTCACGCCTATTTATGATCCTACTTCAGGTTTGAA
GAGAAGTAATGCTGCTGGTTTTGAAGAAAATGAATCATCTTCTAAAGATCCCAACGGTATTATGGATGATTCACTGGAAAAATTGAATGATGGAGGTTCACATTTAGTAA
ATTCGGTTGAGTTATTACACGAAAATGCCTTTAGTGACTTGGTTTCACATTCCAAGGAAGCATTAATTAAAGAGGTTAATTGTAATTTAATTGGGCCGGTTGAGATTTCA
AAAGAGAAGAGTGTTTTGTTGCAAGAAAGAGATTTTAATGCCAACGGAAAGGGTATTAATGCCATCGGTTCAGATATTCAAGGAGCGTTAACTGATGGAGCTTTGAATGA
GTCTCAGGATTTATTATTCACGCCTATTCATGACCCACCTTCGGATTTGAAGAGTTGTAATGCAGCTGGTTTGAAAGAAAACGAACAGAATGTTTCTAAGGCTTTAAAGA
AGAAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAGTGTGAAAAGTCAGATATTTTGGACTCAATTCCCATTAATTCCAATTATAACCCTGATGTTATTGAAGAA
TGTTGTTCTCAATTTTTGCTCTCTGCTTTGAATCAGCCTAGGTACTGTCAAACTAATCTTAATGAGTTATCAAATTCCACTTCATCCAATCAGTATATTCTTTCAAACAT
TCAATCGTCTTCCTTAACAAAGGGGGTTTTTATTCCTTCATCCAAAGTTGAAATTAAAGTTGATCAATCATATTCATCTCCTATTGATTCTGATGATGATTCAGTGGTGA
GTATTAGTAGTGTTGAGGCTGAAAATCAGTATTTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCAATGGCTTTTAATCAGATTTTCCAGAATAATGAT
GATGTTTCTGAAGTTCAGTTGAATGCTTGTGATGCCGATTCACTACAAGTCCACGGTGTTGCTATTCCTACAATTCTTGGAATTCAGAATGTTCTCAAGTATGTTGGAGC
TCAAAAAGATAGGCAACAAGCTCAAGTTCTTTGGATTAACCTTCGGGAAGAACCGGTAAATCCTCTCATCGCAATAATATGTTGTTTTTATGCCGATAGAGCTTTGCTAA
CACGTGAAAACGAAAATAAATCATCTACCCACTATAGGGTAAACAAATGGTATAGGGTGGGTGAATTCTCTCTCAAATTCAAGCAATGGAATAGCGTTGATAGCAAAGAA
GACCGGTTGATCCCATCTTATGGTGGCTGGATAAAGATTAAAAACCTCCCTCTTGATAAATGGGATGAGAAAACCCTTGGATACATTGGAGACTCTTGCGGAGGATATTT
TGAGATCGCCAACAAGACTTTATCTCGAATAGACCTCATAGAAGCATCCATTGAGGTATCCATGGAACTTTGCCGCCGGCAACTCTCCGATAAGATTTCTGAATCACAAA
AAGAAGCTAAACAAGTGGCGGGAATTCTTGGGCCGTACCCCCAATCTGACCCATGGCCGTTTGTAACCTGCCCACAACTTGTACAATCCTGTCCTATACTTGATCGTTCT
TGTAGTAGAACTGATAGCTTTACGCAAACAGGCACTAAGCATTCCTCGAATCCTCCAAACCTTGTCTTTGACTCTGAGGATGATTTCTCCAGTCCTTATCCTTCCTCCCC
ATCTCCTTCCACACCTATCCCCCTCCCAATATAG
Protein sequenceShow/hide protein sequence
MQKIHLSRPHLNWLELTLVELLQSPVHLFFRKKLRDSNGTVQLSKFNSQQGWFLECSVWPFSGGKKSVQVPVGYAKNGWLIFWEMIRDFLMKFVETKSVENISKKSKYEV
SSTSVNNTSKNLDRSYADMVRVKSGEPHFGSHTKKLPEISSFWVRKEKEVVDLKLDEFCVVSRMFAHNDWKEVKQALENYFHSKVLLNPFMADKALVKLKDSLKFDGKWK
LIGNFHLKIENWSCHKHFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGNHFGGLVSISSQTLNLLDCSEARIEVKRNFCGFLPAEIVVTDKIHGNFALRFGDISPLDPP
RFIPMDLSLSDFDNEIDLKRVSQIMMDEEISSSQEDINSFNDQGLNPPAKQNRVNQEISSSSKDPNGIFVEFSCSKEALIEKVNGGLMRPVEFSKKNALLQETAINTIVK
DINANVSVIKNALIDGAVHESQELLLTPIYDPTSGLKRSNAAGFEENESSSKDPNGIMDDSLEKLNDGGSHLVNSVELLHENAFSDLVSHSKEALIKEVNCNLIGPVEIS
KEKSVLLQERDFNANGKGINAIGSDIQGALTDGALNESQDLLFTPIHDPPSDLKSCNAAGLKENEQNVSKALKKKYESFPLHYSRRKCEKSDILDSIPINSNYNPDVIEE
CCSQFLLSALNQPRYCQTNLNELSNSTSSNQYILSNIQSSSLTKGVFIPSSKVEIKVDQSYSSPIDSDDDSVVSISSVEAENQYLNDENNELLEEDSFAMAFNQIFQNND
DVSEVQLNACDADSLQVHGVAIPTILGIQNVLKYVGAQKDRQQAQVLWINLREEPVNPLIAIICCFYADRALLTRENENKSSTHYRVNKWYRVGEFSLKFKQWNSVDSKE
DRLIPSYGGWIKIKNLPLDKWDEKTLGYIGDSCGGYFEIANKTLSRIDLIEASIEVSMELCRRQLSDKISESQKEAKQVAGILGPYPQSDPWPFVTCPQLVQSCPILDRS
CSRTDSFTQTGTKHSSNPPNLVFDSEDDFSSPYPSSPSPSTPIPLPI