; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020799 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020799
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionNeuronal PAS domain-containing protein 4
Genome locationscaffold487:606584..608431
RNA-Seq ExpressionMS020799
SyntenyMS020799
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599961.1 hypothetical protein SDJN03_05194, partial [Cucurbita argyrosperma subsp. sororia]2.5e-24383.94Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD
        M  C FPD YSWIQNLPPLSQWK TS STSICSSSS+NSSLN VA K+LHS T+T SVIAD S PISLW+S+PLK STKS++L+ D+ESIS LLLN + D
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD

Query:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL
        VLHYGS+QQKNFS  FL+LNITFN KEIFN+ FL L+FLICIYEAPT LR DCLTTLK+HL N QSRQ SKMLMKLLGSN+EEQWMRS+NLAITNWI+EL
Subjt:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL

Query:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK
        KANSC LKTPSPLFSYSFSTHGLWKVQLYCP+IAMD IENS NPSTDERLQ SLNYHQLEG+LQFNYKAEVREKW++LRVHVDNIRC+II LVND LLSK
Subjt:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK

Query:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK
        RGVG SEK+FPSRISLQLTP LQTNIMSVSVSKSSDNP I+VGTEKT EAGFEP+  YPGLKLAVGE+VT SLKPWKFEQ VYGNT ILNWYLHDSSDGK
Subjt:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK

Query:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        EVAS KPSKLALINPRAWFRDRYSSA RPFN+QGGVIFAGDEYGESVWWKI+  AR KT+EWEIRGWIW+TYWPNKHKTFYTETRRLEFKE LH+SIP
Subjt:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

XP_022146787.1 uncharacterized protein LOC111015909 [Momordica charantia]6.1e-24282.49Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        M SC F D YSWI+NLPPLSQWK TS S +IC+SSSTNSSL+ +A K+LHSPT+TFS++A  SFPISLWTSKPL ISTKS++L+DEESIS LLLN VHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        L+YGSNQQK+ +LNFL+ N+ FNSKE FNLAFLTL+FLICIYEAPT LRSDCLTTLKHHLANC+SRQ SK+LMKLLGSNLEEQWMRS+NL+ITNWI ELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
         NS T+KTPSPLFSYSFST GLWKVQLYCP+IAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN+KA VREKW DLRVHVDNIRCDII+LVN+TLLS+R
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        G G SEKHFPSRISL++TP +QTNIMSVSVSKSSDNP IDVG EKTFEAGFEP    P LKLA+GE+V++SLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VAST+PSKLALINP+AWFRDRYSSA+RPFNKQGGVIFAGDEYGESVWWKI+  ARGK MEWEIRGWIW+TYWPNKH+TFYTETRRLEFKE LHL IP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

XP_022146788.1 uncharacterized protein LOC111015910 [Momordica charantia]1.3e-28999.8Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
        ANSCTLKTPSPLFSYSFST GLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

XP_022941838.1 uncharacterized protein LOC111447080 [Cucurbita moschata]5.5e-24383.94Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD
        M  C FPD YSWIQNLPPLSQWK TS STSICSSSS+NSSLN VA K+LHS T+T SVIAD S PISLW+S+PLK STKS++L+ D+ESIS LLLN + D
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD

Query:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL
        VLHYGS+QQKNFS  FL+LNITFN KEIFN+ FL L+FLICIYEAPT LR DCLTTLK+HL N QSRQ SKMLMKLLGSN+EEQWMRS+NLAITNWI+EL
Subjt:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL

Query:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK
        KANSC LKTPSPLFSYSFSTHGLWKVQLYCP+IAMD IENS NPSTDERLQ SLNYHQLEGVLQFNYKAEVREKW++LRVHVDNIRC+II LVND LLSK
Subjt:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK

Query:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK
        RGVG SEK+FPSR+SLQLTP LQTNIMSVSVSKSSDNP I+VGTEKT EAGFEP+  YPGLKLAVGE+VT SLKPWKFEQ VYGNT ILNWYLHDSSDGK
Subjt:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK

Query:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        EVAS KPSKLALINPRAWFRDRYSSA RPFN+QGGVIFAGDEYGESVWWKI+  AR KT+EWEIRGWIW+TYWPNKHKTFYTETRRLEFKE LH+SIP
Subjt:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

XP_023544488.1 uncharacterized protein LOC111804047 [Cucurbita pepo subsp. pepo]9.5e-24383.94Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD
        M  C FPD YSWIQNLPPLSQWK TS STSICSSSS+NSSLN VA K+LHS T+T SVIAD S PISLW+S+PLK STKS++L+ D+ESIS LLLN + D
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD

Query:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL
        VLHYGS+QQKNFS  FL+LNITFN KEIFN+ FL L+FLICIYEAPT LR DCLTTLK+HL N QSRQ SKMLMKLLGSN+EEQWMRS+NLAITNWI+EL
Subjt:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL

Query:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK
        KANSC LKTPSPLFSYSFSTHGLWKVQLYCP+IAMD IENS NPSTDERLQ SLNYHQLEGVLQFNYKAEVREKW++LRVHVDNIRC+II LVND LLSK
Subjt:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK

Query:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK
        RGVG SEK+FPSRISLQLTP LQTNIMSVSVSKSSDNP I+VG EKT EAGFEP+  YPGLKLAVGE+VT SLKPWKFEQ VYGNT ILNWYLHDSSDGK
Subjt:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK

Query:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        EVAS KPSKLALINPRAWFRDRYSSA RPFN+QGGVIFAGDEYGESVWWKI+  AR KT+EWEIRGWIW+TYWPNKHKTFYTETRRLEFKE LH+SIP
Subjt:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

TrEMBL top hitse value%identityAlignment
A0A6J1CZH7 uncharacterized protein LOC1110159106.5e-29099.8Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
        ANSCTLKTPSPLFSYSFST GLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

A0A6J1D0J9 uncharacterized protein LOC1110159093.0e-24282.49Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        M SC F D YSWI+NLPPLSQWK TS S +IC+SSSTNSSL+ +A K+LHSPT+TFS++A  SFPISLWTSKPL ISTKS++L+DEESIS LLLN VHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        L+YGSNQQK+ +LNFL+ N+ FNSKE FNLAFLTL+FLICIYEAPT LRSDCLTTLKHHLANC+SRQ SK+LMKLLGSNLEEQWMRS+NL+ITNWI ELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
         NS T+KTPSPLFSYSFST GLWKVQLYCP+IAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN+KA VREKW DLRVHVDNIRCDII+LVN+TLLS+R
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        G G SEKHFPSRISL++TP +QTNIMSVSVSKSSDNP IDVG EKTFEAGFEP    P LKLA+GE+V++SLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VAST+PSKLALINP+AWFRDRYSSA+RPFNKQGGVIFAGDEYGESVWWKI+  ARGK MEWEIRGWIW+TYWPNKH+TFYTETRRLEFKE LHL IP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

A0A6J1FT79 uncharacterized protein LOC1114470802.7e-24383.94Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD
        M  C FPD YSWIQNLPPLSQWK TS STSICSSSS+NSSLN VA K+LHS T+T SVIAD S PISLW+S+PLK STKS++L+ D+ESIS LLLN + D
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLI-DEESISCLLLNFVHD

Query:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL
        VLHYGS+QQKNFS  FL+LNITFN KEIFN+ FL L+FLICIYEAPT LR DCLTTLK+HL N QSRQ SKMLMKLLGSN+EEQWMRS+NLAITNWI+EL
Subjt:  VLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILEL

Query:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK
        KANSC LKTPSPLFSYSFSTHGLWKVQLYCP+IAMD IENS NPSTDERLQ SLNYHQLEGVLQFNYKAEVREKW++LRVHVDNIRC+II LVND LLSK
Subjt:  KANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSK

Query:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK
        RGVG SEK+FPSR+SLQLTP LQTNIMSVSVSKSSDNP I+VGTEKT EAGFEP+  YPGLKLAVGE+VT SLKPWKFEQ VYGNT ILNWYLHDSSDGK
Subjt:  RGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGK

Query:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        EVAS KPSKLALINPRAWFRDRYSSA RPFN+QGGVIFAGDEYGESVWWKI+  AR KT+EWEIRGWIW+TYWPNKHKTFYTETRRLEFKE LH+SIP
Subjt:  EVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

A0A6J1FUY0 uncharacterized protein LOC1114470813.3e-24182.09Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        M SC FPD YSWIQ LPPLSQWK +S STSIC+S+S +SSL  VA KNLHSPT+T S+IAD SFPISLWTSKPLK ST S++L DEE++S LLLN VHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        L+YGSNQ+KN S   L+L+IT +S+EIFNLAFLTLIFLICIYEAPT LRS+CL TLKHHLAN  SRQ SK+LMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
        AN  TLKTPSPL+SYSFSTHGLWKVQLYCP+IAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+  VR+KW+D+RVHVDNIRCDI++LVN+TLLS+R
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        GVG SEKHFPSRISLQLTP   TNIMSVSVSKSS+NP I++GTE+TFEAGFEP+  YPGLKL+VGE+  VSLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VASTKPSKL LINP+AWFRDRYSSAHRPFNKQGGVIFAGDEYGE+VWWKI+  ARGKTMEWEIRGWIW+TYWPNKHKTFYTET+RLEFKE LHLSIP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

A0A6J1JZ56 uncharacterized protein LOC1114910281.9e-24182.29Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV
        M SC FPD YSWIQ LPPLSQWK +S STSIC+S+S +SSL  VA KNLHSPT+T S+IAD SFPISLWTSKPLK ST S++L DEE++S LLLN VHDV
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK
        L+YGSNQ+KN S   L+L+IT +SK+IFNLAFLTLIFLICIYEAPT LRS+CL TLKHHLAN  SRQ SK+LMKLLGSNLEEQWMRS+NLAITNW+LELK
Subjt:  LHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELK

Query:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR
        AN  TLKTPSPL+SYSFSTHGLWKVQLYCP+IAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNY+  VR+KW+D+RVHVDNIRCDII+LVN+TLLS+R
Subjt:  ANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKR

Query:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE
        GVG SEKHFPSRISLQLTP   TNIMSVSVSKSS+NP I+VGTE+TFEAGFEP+  YPGLKL+VGE+  VSLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKE

Query:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        VASTKPSKLALINP+AWFRDRYSSAHRPFNKQGGVIFAGDEYGE+VWWKI+  ARGKTMEWEI+GWIW+TYWPNKHKTFYTET+RLEFKE +HLSIP
Subjt:  VASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15020.1 unknown protein5.1e-4526.34Show/hide
Query:  DFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNL----HSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDVLHY
        D +SWI  LP   ++  + +      +     S+   A + L     S +LTF+V+A+     +L  S  + +S  +  L  E+    L+L  + +++  
Subjt:  DFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNL----HSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDVLHY

Query:  GSNQQKNFSLNFLELNI---------------TFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLAN---CQSRQTSKMLMKLLGSNLEEQWM
                   F +L I               +F+S  +FNL  LT +F +C+++AP+++ S     L     N   CQ     +  +  LG + E   +
Subjt:  GSNQQKNFSLNFLELNI---------------TFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLAN---CQSRQTSKMLMKLLGSNLEEQWM

Query:  RSVNLAITNWI---------LELKANSCTLKTPSPL-FSYSFSTHGLWKVQLYCPLIAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQ
        R+ + A++ W+         L LK  S +L     L FSY+   HGLW ++ Y P+++M+   NSSN         P  + +   L+++L++ Q E ++Q
Subjt:  RSVNLAITNWI---------LELKANSCTLKTPSPL-FSYSFSTHGLWKVQLYCPLIAMDNIENSSN---------PSTDER---LQFSLNYHQLEGVLQ

Query:  FNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKRGVG---------RSEKHFPSRISLQLTPIL-QTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEP
        F Y  +  E ++ +   VDNIR  + +L       K GVG           E++FPSR+ + L P L  +++  +S+ +S+ N   D+   +  +  F  
Subjt:  FNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKRGVG---------RSEKHFPSRISLQLTPIL-QTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEP

Query:  AAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEED
            P +K     +    +K W+ EQ   GN A+ +  L+D   G+EV + KP            +         F K GG++F  DEYG+ V W++  +
Subjt:  AAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEED

Query:  ARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSI
          G  ++W + G IW+TYWPNK  T + ETR +E+ + + L +
Subjt:  ARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSI

AT2G40390.1 unknown protein2.6e-16657.52Show/hide
Query:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPT-LTFSVIADISFPISLWTSKPLK-ISTKSTSLIDEESISCLLLNFVH
        M SC  PD ++W+Q LPPLS WK    S  ICS +S++ SLNF  T+   SP   TFS++A+   PI+L+ SK  + IST ST+ ++E  IS LL+ FV 
Subjt:  MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPT-LTFSVIADISFPISLWTSKPLK-ISTKSTSLIDEESISCLLLNFVH

Query:  DVLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILE
         VL+Y + ++   S+    L  T N K++FNLAF T +FLICIYEAPT LR+ CL T+K  L  C+SRQ SK+LM  LGSNLEEQWMRS+NLAITNWI+E
Subjt:  DVLHYGSNQQKNFSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILE

Query:  LKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLS
        +KA    LK+PSPLFSY+FST GLWKV +YCP++AM+ +E+ ++   DERL FSLNYHQLEGV+Q N++  VREKW ++ V++DN+RCDII+LVN+ LLS
Subjt:  LKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLS

Query:  KRGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDG
        +RG+G  EKHFPSRISLQLTP  Q+NI+ VSV KSS+NP  +   EK  EA  +P   + GLK++  E+ T S+KPWKFE++V+G +A L W+LHD  DG
Subjt:  KRGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDG

Query:  KEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP
        +EV+S+KPSK++++NPRAWF++RYSSA RPF KQGGV+FAGD YG+SV WK+++ A GK ME+E++G +W+TYWPNKH TFY++TR+LEFKE L+L++P
Subjt:  KEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP

AT5G64190.1 unknown protein3.2e-14853.97Show/hide
Query:  FPDFYSWIQNLPPLSQWKVTSTSTSICSSSS--TNSSLNFVATKNLHSPTLTFSVIADIS--FPISLWTSK-PLKISTKSTSLIDEESISCLLLNFVHDV
        FPD ++WIQN+P +++W+ TS    IC S+S   NS+LN  A K+     +TFS+I   +   P+ LWT+K  L I+  S +  DE +I  LL NFV  +
Subjt:  FPDFYSWIQNLPPLSQWKVTSTSTSICSSSS--TNSSLNFVATKNLHSPTLTFSVIADIS--FPISLWTSK-PLKISTKSTSLIDEESISCLLLNFVHDV

Query:  LHYGSNQQKNFSLNFLELNITFNS-------KEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAIT
        L Y SN     S N+  + I  +        K+I N   LTL F++C+YEAP  LR +CL TLK+HL  C +R+ +  LMKLLGSNLEEQWMR+VNLA T
Subjt:  LHYGSNQQKNFSLNFLELNITFNS-------KEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAIT

Query:  NWILELKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVN
        NWI+E + +  T  T +PLFSY+ S +GLWKVQLYCP+ AM+ +E SSNP+ D RL FSL ++QLEGV+QFN+K  VR+ W+D+ V +DNIR D+I+LVN
Subjt:  NWILELKANSCTLKTPSPLFSYSFSTHGLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVN

Query:  DTLLSKRGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLH
        + L+S+RG G  EKHFPSRISLQLTP LQT+ +SVSVSKSS+NP  +   E++ E  F+P  +  GL++A  E+ T+++ PWK EQ V G TA LNW L+
Subjt:  DTLLSKRGVGRSEKHFPSRISLQLTPILQTNIMSVSVSKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLH

Query:  DSS-DGKEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETL
        DSS  G+EV STKPS+ ++++PR+WF+DRY+ A+R F ++GGVIFAGDEYGESV WKI + A G TMEWEI+G+IW+TYWPNK+KTFY ETRRLEF + L
Subjt:  DSS-DGKEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGDEYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETL

Query:  HLSI
        +L+I
Subjt:  HLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCTTGTTGCTTTCCTGATTTCTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAAGTAACTTCAACTTCTACATCCATATGCTCTTCAAGCTCAAC
CAACTCCTCTCTGAATTTTGTTGCTACCAAAAACCTTCATTCTCCAACCCTTACTTTCTCAGTTATTGCAGATATCAGCTTTCCTATATCCCTTTGGACATCAAAGCCCT
TGAAGATCAGCACCAAATCCACAAGTTTAATAGATGAAGAAAGCATATCCTGTCTCTTGCTTAACTTTGTTCATGATGTTCTTCATTATGGCTCAAACCAACAAAAGAAT
TTTAGCCTTAATTTCCTTGAACTCAACATCACTTTTAACTCGAAAGAAATCTTCAATCTCGCATTTCTTACCCTCATATTTCTGATCTGCATTTACGAAGCTCCAACCAA
ACTCCGTTCGGATTGTCTTACAACTCTCAAGCATCATTTGGCAAATTGCCAGTCCAGACAGACATCAAAAATGCTGATGAAACTGTTGGGATCTAATCTAGAGGAGCAAT
GGATGAGGTCCGTAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACAGCTGCACCCTGAAAACACCCTCACCATTGTTCTCTTATTCATTTTCAACACAT
GGGTTGTGGAAAGTTCAACTCTATTGCCCTCTCATTGCAATGGATAATATTGAGAACTCAAGCAATCCCTCAACTGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
GCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCCGAGGTTCGAGAAAAGTGGGTTGATCTGAGGGTTCACGTTGATAACATAAGGTGCGACATCATCCAGCTTGTGAACG
ACACTCTCTTGTCCAAGCGAGGAGTCGGCAGATCAGAAAAGCACTTCCCATCACGAATCTCGCTGCAACTCACTCCAATTCTACAGACAAACATAATGAGCGTCTCAGTA
AGCAAATCATCGGACAACCCTACAATTGACGTGGGAACCGAAAAAACCTTCGAAGCCGGATTCGAACCGGCAGCAGCTTACCCAGGCCTGAAATTAGCAGTCGGAGAGAG
CGTAACGGTGAGCCTTAAGCCATGGAAGTTCGAGCAGTTCGTGTATGGCAACACGGCGATCCTGAACTGGTACCTCCACGACAGTTCAGACGGGAAAGAGGTGGCCTCCA
CCAAGCCATCGAAACTTGCGCTGATAAACCCAAGAGCTTGGTTCCGGGACCGATACTCGAGCGCTCACCGGCCGTTCAACAAACAGGGAGGGGTCATATTCGCCGGAGAT
GAGTACGGAGAGAGTGTTTGGTGGAAGATTGAGGAAGATGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGGTTGGATTTGGGTAACTTATTGGCCAAACAAACACAA
AACATTCTACACTGAAACCAGGAGGCTGGAATTCAAGGAGACTCTCCATCTTTCAATTCCT
mRNA sequenceShow/hide mRNA sequence
ATGACTTCTTGTTGCTTTCCTGATTTCTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAAGTAACTTCAACTTCTACATCCATATGCTCTTCAAGCTCAAC
CAACTCCTCTCTGAATTTTGTTGCTACCAAAAACCTTCATTCTCCAACCCTTACTTTCTCAGTTATTGCAGATATCAGCTTTCCTATATCCCTTTGGACATCAAAGCCCT
TGAAGATCAGCACCAAATCCACAAGTTTAATAGATGAAGAAAGCATATCCTGTCTCTTGCTTAACTTTGTTCATGATGTTCTTCATTATGGCTCAAACCAACAAAAGAAT
TTTAGCCTTAATTTCCTTGAACTCAACATCACTTTTAACTCGAAAGAAATCTTCAATCTCGCATTTCTTACCCTCATATTTCTGATCTGCATTTACGAAGCTCCAACCAA
ACTCCGTTCGGATTGTCTTACAACTCTCAAGCATCATTTGGCAAATTGCCAGTCCAGACAGACATCAAAAATGCTGATGAAACTGTTGGGATCTAATCTAGAGGAGCAAT
GGATGAGGTCCGTAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACAGCTGCACCCTGAAAACACCCTCACCATTGTTCTCTTATTCATTTTCAACACAT
GGGTTGTGGAAAGTTCAACTCTATTGCCCTCTCATTGCAATGGATAATATTGAGAACTCAAGCAATCCCTCAACTGATGAAAGATTGCAATTCTCTTTAAATTATCACCA
GCTTGAAGGGGTTCTGCAGTTCAATTACAAGGCCGAGGTTCGAGAAAAGTGGGTTGATCTGAGGGTTCACGTTGATAACATAAGGTGCGACATCATCCAGCTTGTGAACG
ACACTCTCTTGTCCAAGCGAGGAGTCGGCAGATCAGAAAAGCACTTCCCATCACGAATCTCGCTGCAACTCACTCCAATTCTACAGACAAACATAATGAGCGTCTCAGTA
AGCAAATCATCGGACAACCCTACAATTGACGTGGGAACCGAAAAAACCTTCGAAGCCGGATTCGAACCGGCAGCAGCTTACCCAGGCCTGAAATTAGCAGTCGGAGAGAG
CGTAACGGTGAGCCTTAAGCCATGGAAGTTCGAGCAGTTCGTGTATGGCAACACGGCGATCCTGAACTGGTACCTCCACGACAGTTCAGACGGGAAAGAGGTGGCCTCCA
CCAAGCCATCGAAACTTGCGCTGATAAACCCAAGAGCTTGGTTCCGGGACCGATACTCGAGCGCTCACCGGCCGTTCAACAAACAGGGAGGGGTCATATTCGCCGGAGAT
GAGTACGGAGAGAGTGTTTGGTGGAAGATTGAGGAAGATGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGGTTGGATTTGGGTAACTTATTGGCCAAACAAACACAA
AACATTCTACACTGAAACCAGGAGGCTGGAATTCAAGGAGACTCTCCATCTTTCAATTCCT
Protein sequenceShow/hide protein sequence
MTSCCFPDFYSWIQNLPPLSQWKVTSTSTSICSSSSTNSSLNFVATKNLHSPTLTFSVIADISFPISLWTSKPLKISTKSTSLIDEESISCLLLNFVHDVLHYGSNQQKN
FSLNFLELNITFNSKEIFNLAFLTLIFLICIYEAPTKLRSDCLTTLKHHLANCQSRQTSKMLMKLLGSNLEEQWMRSVNLAITNWILELKANSCTLKTPSPLFSYSFSTH
GLWKVQLYCPLIAMDNIENSSNPSTDERLQFSLNYHQLEGVLQFNYKAEVREKWVDLRVHVDNIRCDIIQLVNDTLLSKRGVGRSEKHFPSRISLQLTPILQTNIMSVSV
SKSSDNPTIDVGTEKTFEAGFEPAAAYPGLKLAVGESVTVSLKPWKFEQFVYGNTAILNWYLHDSSDGKEVASTKPSKLALINPRAWFRDRYSSAHRPFNKQGGVIFAGD
EYGESVWWKIEEDARGKTMEWEIRGWIWVTYWPNKHKTFYTETRRLEFKETLHLSIP