; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015991 (gene) of Snake gourd v1 genome

Gene IDTan0015991
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG09:69993127..69997346
RNA-Seq ExpressionTan0015991
SyntenyTan0015991
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589615.1 hypothetical protein SDJN03_15038, partial [Cucurbita argyrosperma subsp. sororia]7.7e-20387.28Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT+PSV AQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSS+DTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLACE+ + MRKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + EHQMANSLMQAA++WLRR QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_022134722.1 uncharacterized protein LOC111006925 [Momordica charantia]4.8e-20588.28Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        MFGTALQFGGIKGEDRFYIPV+ARKNYNQQK SRRP K+DETESPSSD K+KVVASTTKPSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE +AMRSDSK RLA EDSDLDSSRDTSSDGSIEYE GK+   SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
          L CENTLKMR  S+ DEH  IQEGFSSDDGDAGNP SVLLFQF EQDLPYQRVPLADKIFDLAYQYPGLK+LRSCDIQ ASWVSVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAPVMI+ ND+DG+PKVSLPVFG+ASYKLKGSIWAQNG+ EHQMANSLMQAAD WLR  QV+QPDFQFFASHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_022921943.1 uncharacterized protein LOC111430050 [Cucurbita moschata]1.2e-20387.53Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT+PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSS+DTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLACE+ + MRKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + EHQMANSLMQAA++WLRR QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_022987436.1 uncharacterized protein LOC111484983 [Cucurbita maxima]6.5e-20287.28Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK YNQQK SRRPTKTDETE+PS    SKVVASTT PSKPLTPQ KSNLERFLDAT+PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSSRDTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLAC++ L +RKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + E+QM NSLMQAA++WLRR QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

XP_023516127.1 uncharacterized protein LOC111780081 [Cucurbita pepo subsp. pepo]1.6e-20387.78Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PS    SKVVASTT PSKPLTPQ KSNLERFLDAT+PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSSRDTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLACE+ + MRKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + EHQMANSLMQAA++WLRR QVNQPDFQFFAS+ TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A5A7USF1 Uncharacterized protein8.9e-18982.29Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARKNYNQQK SRRPTKTDETES S    SKVV  TTKP + LTPQ KSNLERFL+ATRPSVPAQYFSKTTMR WRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVL+GGDSVVQYYVPYLS IQI+GE AA+RSDS  RLA EDSDLDSSRDTSSDGSI+Y+ GKS N SREQW H
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
         HLACEN  KMRKTSL+DE   +QEGF SDDGDAG P S LLFQFLEQDLPYQRVPLADKIF+LAYQ+PGLKTLRSCDI  ASWVSVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTP + N H   PVM++  D+D I K+SLPVFGMASYKLKGSIW QNGIN+HQ ANSLMQAAD+WLR  QV+QPDFQFF+SHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1C0E1 uncharacterized protein LOC1110069252.3e-20588.28Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        MFGTALQFGGIKGEDRFYIPV+ARKNYNQQK SRRP K+DETESPSSD K+KVVASTTKPSKPLTPQ KSNLERFLDAT PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE +AMRSDSK RLA EDSDLDSSRDTSSDGSIEYE GK+   SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
          L CENTLKMR  S+ DEH  IQEGFSSDDGDAGNP SVLLFQF EQDLPYQRVPLADKIFDLAYQYPGLK+LRSCDIQ ASWVSVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAPVMI+ ND+DG+PKVSLPVFG+ASYKLKGSIWAQNG+ EHQMANSLMQAAD WLR  QV+QPDFQFFASHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1E577 uncharacterized protein LOC1114300505.8e-20487.53Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK+YNQQK SRRPTKTDETE+PSS+    VVASTT PSKPLTPQ KSNLERFLDAT+PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSS+DTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLACE+ + MRKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + EHQMANSLMQAA++WLRR QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1JE68 uncharacterized protein LOC1114849833.2e-20287.28Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARK YNQQK SRRPTKTDETE+PS    SKVVASTT PSKPLTPQ KSNLERFLDAT+PSVPAQYFSKTTMRGWRTC+
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
        I+FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+GE AA+RSDSKSRLANEDSDLDSSRDTSS+GSI+YEFGKSCN SREQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        HHLAC++ L +RKTSL DEHST QEGFSSDDGDA  P S LLFQFLEQDLPYQRVPLADKIFDLAYQ+PGLKTLRSCDI  ASW+SVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR NGHGQAP MI+ ND DGIPKVSLPVFG+ASYKLKGSIWAQN + E+QM NSLMQAA++WLRR QVNQPDFQFFASH TYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

A0A6J1KHH0 uncharacterized protein LOC1114957872.4e-18983.04Show/hide
Query:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN
        M GTALQFGGIKGEDRFYIPVRARKNYNQ+  SRR TKTDETE+ S    +K+VAST   S PL  QPK+NLERFLDATRPSVPAQ+FSKT+MRGW +C 
Subjt:  MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCN

Query:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH
          FQPYF+LNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLS IQI+G+PA +RSDSKSR+ +EDSDLDSSRDTSSDGS EYEFG SCNFS+EQWVH
Subjt:  IDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVH

Query:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT
        H LACENTLKMRKTSLSDEHS I+E  SSDD     P SVLLFQFLE D PYQRVPLADKIFDLAYQYPGLKTLRSCDIQ+ASWVSVAWYPIYRIPTGPT
Subjt:  HHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPT

Query:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW
        LKDLDACFLTYHSLSTPIR N HGQ   +I+SNDVDG PK+SLPVFGMASYKLKGSIWAQNG+NEHQMA+SLMQAAD+WL+  QVNQPDFQFFASHGTYW
Subjt:  LKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYW

Query:  R
        R
Subjt:  R

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)4.4e-8753.82Show/hide
Query:  SNLERFLDATRPSVPAQYFSKTTMRGWRTCNIDFQ-PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANE
        SN+ERFLD+  PSVPA Y SKT +R     +++ Q PYFLL D+WESF EWSAYG GVPL LN   D V QYYVP LS IQ++ +  A+ S  ++R   E
Subjt:  SNLERFLDATRPSVPAQYFSKTTMRGWRTCNIDFQ-PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANE

Query:  DSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQ
        +S+ D  RD+SS+GS   E  +   +S+EQ          + +M K SL  EH   QE  SSDDG+  +    L+F++LE+DLPY R P ADK+ DLA +
Subjt:  DSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQ

Query:  YPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIH-SNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEH
        +P LKTLRSCD+  +SW SVAWYPIY+IPTGPTLKDLDACFLTYHSL TP +  G G     +H     + + K+ LPVFG+ASYKL+GS+W   G + H
Subjt:  YPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIH-SNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEH

Query:  QMANSLMQAADQWLRRCQVNQPDFQFF
        Q+ANSL QAAD WLR  QVN PDF FF
Subjt:  QMANSLMQAADQWLRRCQVNQPDFQFF

AT2G01260.1 Protein of unknown function (DUF789)7.4e-8747.99Show/hide
Query:  MFGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTC
        M G   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS        S  K     +    SNL+RFL++  PSVPAQ+ SKT +R  R  
Subjt:  MFGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTC

Query:  NIDFQ---PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSR
        + D+    PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LSAIQI+    A+ S  KSR   + SD D  RD+SSD S +         S 
Subjt:  NIDFQ---PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSR

Query:  EQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRI
         + V   + C         SL D+H   QE  SSDDG+       L+F++LE+DLPY R P ADK+ DLA Q+P L TLRSCD+  +SW SVAWYPIYRI
Subjt:  EQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRI

Query:  PTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFF
        PTGPTLKDLDACFLTYHSL T     G  Q+  +    + +   K+SLPVFG+ASYK +GS+W   G +EHQ+ NSL QAAD+WL  C V+ PDF FF
Subjt:  PTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFF

AT2G01260.2 Protein of unknown function (DUF789)2.8e-6548.29Show/hide
Query:  MFGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTC
        M G   Q   G  G+D FY   + R+  NQ+    R  ++D +  PSS        S  K     +    SNL+RFL++  PSVPAQ+ SKT +R  R  
Subjt:  MFGTALQF-GGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTC

Query:  NIDFQ---PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSR
        + D+    PYF+L D+W+SF EWSAYG GVPLVLN   D V+QYYVP LSAIQI+    A+ S  KSR   + SD D  RD+SSD S +         S 
Subjt:  NIDFQ---PYFLLNDLWESFKEWSAYGAGVPLVLNGG-DSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSR

Query:  EQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRI
         + V   + C         SL D+H   QE  SSDDG+       L+F++LE+DLPY R P ADK+ DLA Q+P L TLRSCD+  +SW SVAWYPIYRI
Subjt:  EQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRI

Query:  PTGPTLKDLDACFLTYHSLST
        PTGPTLKDLDACFLTYHSL T
Subjt:  PTGPTLKDLDACFLTYHSLST

AT4G16100.1 Protein of unknown function (DUF789)4.2e-8243.88Show/hide
Query:  IKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSS----DTKSKV-----------------VASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFS
        I+GE+RFY P   RK   ++++ R   +  E E   +    D K KV                 V S    +   T    SNL RFLD T P V  Q+  
Subjt:  IKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSS----DTKSKV-----------------VASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFS

Query:  KTTMRGWRTCNIDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGK
         T+ +GWRT   +++PYFLLNDLW+SF+EWSAYG GVPL+LNG DSVVQYYVPYLS IQ++ +P+  R+ +  R   E+SD DS RD SSDGS       
Subjt:  KTTMRGWRTCNIDFQPYFLLNDLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGK

Query:  SCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDA-GNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVA
         C              E +  + + SL ++      G SSD+ +A  N P  L+F++LE  +P+ R PL DKI +L+ Q+P L+T RSCD+  +SWVSVA
Subjt:  SCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDA-GNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVA

Query:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIR--DNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWA-QNGINEHQMANSLMQAADQWLRRCQV
        WYPIYRIP G +L++LDACFLT+HSLSTP R   N  GQ+     S+      K+ LP FG+ASYK K S W+ ++ ++E+Q   +L++ A++WLRR +V
Subjt:  WYPIYRIPTGPTLKDLDACFLTYHSLSTPIR--DNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWA-QNGINEHQMANSLMQAADQWLRRCQV

Query:  NQPDFQFFASH-GTYWR
          PDF+ F SH G+ WR
Subjt:  NQPDFQFFASH-GTYWR

AT5G49220.1 Protein of unknown function (DUF789)9.8e-7139.27Show/hide
Query:  GTALQFGGIKGEDRFYIPVRARKNYNQ---QKQSRRPTKTDE--------------TESPS--------SDTKSKVVASTTKPSKPLTPQPK--------
        G ++    I+GE+RFY P   R+   +   Q+Q R   + D+              T +P         S++KS+VV S ++     +            
Subjt:  GTALQFGGIKGEDRFYIPVRARKNYNQ---QKQSRRPTKTDE--------------TESPS--------SDTKSKVVASTTKPSKPLTPQPK--------

Query:  -SNLERFLDATRPSVPAQYFSKTTMRGWRTCNIDFQPYFLLNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSR
         SNL+RFL+ T P VPA+ F   +    +T   D   YF+L DLWESF EWSAYGAGV     PL ++G DS VQYYVPYLS IQ++ +P       K R
Subjt:  -SNLERFLDATRPSVPAQYFSKTTMRGWRTCNIDFQPYFLLNDLWESFKEWSAYGAGV-----PLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSR

Query:  LANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFD
            D++  SS  +S+  ++  +                       ++ + SL D+  +I    SS + +  NP   LLF++LE + P+ R PLA+KI D
Subjt:  LANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVHHHLACENTLKMRKTSLSDEHSTIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFD

Query:  LAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGI
        LA + P L T RSCD+  +SWVSV+WYPIYRIP GPTL++LDACFLT+HSLST    +  G        +D     K+ LP FG+ASYKLK S+W QN I
Subjt:  LAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMIHSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGI

Query:  NEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYWR
         E Q   SL+QAAD+WL+R QV+ PD++FF S+    R
Subjt:  NEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYWR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAGTCGAGGAGACCTAC
CAAGACCGATGAAACTGAGAGCCCATCTTCAGATACGAAGAGTAAAGTTGTGGCTTCTACTACAAAGCCTTCTAAGCCATTAACTCCTCAGCCTAAGAGCAACTTAGAGA
GATTCTTGGACGCCACAAGGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTAATATTGATTTTCAACCTTATTTCCTTCTGAAT
GATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTGCTTAATGGAGGTGACTCCGTTGTTCAATATTACGTTCCATATTTGTCTGCTAT
CCAAATATTTGGTGAACCTGCTGCAATGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGATGGAAGCATTG
AATATGAATTTGGAAAAAGCTGTAATTTTTCTAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAACACTCTTAAAATGAGAAAGACGTCTTTAAGTGATGAACAT
AGCACGATACAAGAAGGTTTTTCGAGTGATGATGGGGATGCTGGAAATCCTCCAAGTGTTTTGCTCTTTCAGTTTCTTGAGCAAGATCTTCCTTATCAACGTGTACCATT
GGCTGACAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGCGATATCCAGTCAGCCAGTTGGGTCTCTGTAGCATGGTACCCAATATACC
GTATACCCACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTGACATATCATTCCCTTTCCACACCCATTAGAGATAATGGACATGGTCAGGCACCGGTGATGATA
CATTCAAATGACGTTGATGGTATCCCAAAGGTATCCTTGCCTGTTTTTGGAATGGCTTCTTACAAGCTGAAAGGTTCGATTTGGGCGCAAAATGGCATCAACGAGCATCA
AATGGCGAATTCCCTCATGCAGGCAGCAGACCAATGGCTGAGGCGCTGTCAGGTCAATCAACCCGATTTTCAGTTCTTTGCATCGCATGGTACATACTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATTTACCCCAGGAAGTATAATTTCTAACAAAAGAACCCACAAAAGTTTGATTTAATTAATTAAACTGAATTTCTACATTTTTGGTATATCATTATTATGTCGTCATCTTC
TTTTATAACACACTCTGGTTCAAGCTTCTTCCTCTCTCTCTTTCTCTCTCTCTGCACTTTCCTTCTCGCCTGGTTTCTTAATTCACAAAAAACGCATTTGGGTCATTCAT
TGCGTGCATTCTTCAAATATTTGCTACGACGTTTTTGAATCGATTTCCAGTTCTCTTCTTCATTTAATCGATTTGAATCCCTCTGTTATTTGGGTTTCTCTCTGATTCCG
ATATTGTTTTGCGAACATCCTCCATTGAATTCAGTGTTCTCTAGACAAGATTTCGAATCTTTTCTTTTGCAAGAAGAAAGCAGAGTTTAAAGAATCTGCATCTCTTTTCT
TTTAAACGCCATTTCATCGGCGTTGTACTACTTGGGAGTTTAAAGAATTTCATACTTTCCTTTTCTTCTCGTCTTTTTGTATTCTGGTTGGTTGAGCTCATTTTTCCCAT
TTTAGAGAAAAGGTAGAGAACTTTGGGGTTCCCCATTATAATCTGTGCAATTTCCTCCGTTTCCTTTGAAACCAAACGGAGGATTCTTCTTTCATTTTTTTTGGTTTAGA
CATAGAAATGTTTGGAACTGCGTTGCAGTTTGGGGGAATCAAAGGTGAGGATCGGTTTTATATTCCGGTAAGGGCACGAAAGAATTATAATCAGCAAAAGCAGTCGAGGA
GACCTACCAAGACCGATGAAACTGAGAGCCCATCTTCAGATACGAAGAGTAAAGTTGTGGCTTCTACTACAAAGCCTTCTAAGCCATTAACTCCTCAGCCTAAGAGCAAC
TTAGAGAGATTCTTGGACGCCACAAGGCCTTCAGTTCCAGCGCAGTACTTCTCTAAGACAACTATGAGGGGTTGGAGGACTTGTAATATTGATTTTCAACCTTATTTCCT
TCTGAATGATCTGTGGGAGTCTTTCAAGGAGTGGAGTGCATACGGTGCTGGAGTTCCTTTAGTGCTTAATGGAGGTGACTCCGTTGTTCAATATTACGTTCCATATTTGT
CTGCTATCCAAATATTTGGTGAACCTGCTGCAATGAGATCAGATTCTAAGTCCAGGCTGGCTAATGAGGACAGTGATCTTGACTCTTCTAGAGATACAAGCAGCGATGGA
AGCATTGAATATGAATTTGGAAAAAGCTGTAATTTTTCTAGAGAACAGTGGGTTCATCACCATTTAGCTTGTGAAAACACTCTTAAAATGAGAAAGACGTCTTTAAGTGA
TGAACATAGCACGATACAAGAAGGTTTTTCGAGTGATGATGGGGATGCTGGAAATCCTCCAAGTGTTTTGCTCTTTCAGTTTCTTGAGCAAGATCTTCCTTATCAACGTG
TACCATTGGCTGACAAGATATTTGATCTTGCTTACCAATATCCTGGTTTGAAAACTTTAAGAAGTTGCGATATCCAGTCAGCCAGTTGGGTCTCTGTAGCATGGTACCCA
ATATACCGTATACCCACTGGTCCGACATTAAAAGATTTGGATGCTTGCTTCTTGACATATCATTCCCTTTCCACACCCATTAGAGATAATGGACATGGTCAGGCACCGGT
GATGATACATTCAAATGACGTTGATGGTATCCCAAAGGTATCCTTGCCTGTTTTTGGAATGGCTTCTTACAAGCTGAAAGGTTCGATTTGGGCGCAAAATGGCATCAACG
AGCATCAAATGGCGAATTCCCTCATGCAGGCAGCAGACCAATGGCTGAGGCGCTGTCAGGTCAATCAACCCGATTTTCAGTTCTTTGCATCGCATGGTACATACTGGAGA
TGACAAGGAATGACTCAAGATATGTCTATGCCCCTGCTTCCATGCTGCCCACTCAAAATACCAAAATAACGAACTCGTGCCTGTATCAACCATCAGACCTGCCCCTCAAT
ACTGGTAATGATACATTTTGATTTTTGGGTGGTTTGTTCTTTCTGGGAATACATTTCATAGAGGCAGTTAAGAGAAAAACATGTCTGAAATTGAAATGGGAGTAGTAGGA
GAACAATCAGGGCTCATGTGAAGTGGAATTTTAAGGGCAAAAGCAATATGGAAACTGATGGCAATCAAGATGAAGGTTGTCAATCATTTTACAGTTTGTGTCCTTGGAAA
CTTGTGAGAAGTTATTCTCTTTTTCTACACTCCCATCAAATTTAGAGTCTTTTGTCAAAACTGTATGTCAAGAAGTACAAATTTCTATGTGTATATTACGTATGTTTTGA
TGGTGACCCTATCTTTTTTAGATTCAAAGAAGTATAGATGAAAAAGATTAGATGGAATTTTATATTGAAAGGAGTCATATTATCTTTATAGCATTTTATATTTCAGATAT
ATGCGCCATTCTTGGATTGAACAGGAATCATATTCCCCATCCAATATATTGGTAAAAATTGC
Protein sequenceShow/hide protein sequence
MFGTALQFGGIKGEDRFYIPVRARKNYNQQKQSRRPTKTDETESPSSDTKSKVVASTTKPSKPLTPQPKSNLERFLDATRPSVPAQYFSKTTMRGWRTCNIDFQPYFLLN
DLWESFKEWSAYGAGVPLVLNGGDSVVQYYVPYLSAIQIFGEPAAMRSDSKSRLANEDSDLDSSRDTSSDGSIEYEFGKSCNFSREQWVHHHLACENTLKMRKTSLSDEH
STIQEGFSSDDGDAGNPPSVLLFQFLEQDLPYQRVPLADKIFDLAYQYPGLKTLRSCDIQSASWVSVAWYPIYRIPTGPTLKDLDACFLTYHSLSTPIRDNGHGQAPVMI
HSNDVDGIPKVSLPVFGMASYKLKGSIWAQNGINEHQMANSLMQAADQWLRRCQVNQPDFQFFASHGTYWR