; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1068 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1068
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNeuronal PAS domain-containing protein 4
Genome locationMC09:16545167..16551000
RNA-Seq ExpressionMC09g1068
SyntenyMC09g1068
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146787.1 uncharacterized protein LOC111015909 [Momordica charantia]0.0100Show/hide
Query:  MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL
        MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL
Subjt:  MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL

Query:  NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK
        NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK
Subjt:  NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK

Query:  LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW
        LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW
Subjt:  LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW

Query:  TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK
        TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK
Subjt:  TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK

Query:  FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH
        FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH
Subjt:  FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH

Query:  RTFYTETRRLEFKEILHLPIP
        RTFYTETRRLEFKEILHLPIP
Subjt:  RTFYTETRRLEFKEILHLPIP

XP_022146788.1 uncharacterized protein LOC111015910 [Momordica charantia]1.26e-30782.49Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        M SC F D YSWI+NLPPLSQWK TS S +IC+SSSTNSSL+ +A K+LHSPT+TFS++A  SFPISLWTSKPL ISTKS++L+DEESIS LLLN VHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        L+YGSNQQK+ +LNFL+ N+ FNSKE FNLAFLTL+FLICIYEAPT LRSDCLTTLKHHLANC+SRQ SK+LMKLLGSNLEEQWMRS+NL+ITNWI ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         NS T+KTPSPLFSYSFST GLWKVQLYCP+IAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN+KA VREKW DLRVHVDNIRCDII+LVN+TLLS+R
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G G SEKHFPSRISL++TP +QTNIMSVSVSKSSDNP IDVG EKTFEAGFEP    P LKLA+GE+V++SLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKLALINP+AWFRDRYSSA+RPFNKQGGVIFAGDEYGESVWWKI+  ARGK MEWEIRGWIW+TYWPNKH+TFYTETRRLEFKE LHL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

XP_022941840.1 uncharacterized protein LOC111447081 [Cucurbita moschata]8.88e-30881.89Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        MASCSF DVYSWI+ LPPLSQWKT+SIS +IC S+S +SSL ++AAK+LHSPTIT SI+A FSFPISLWTSKPL  ST SSNL DEE++S+LLLNCVHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        LYYGSNQ+K+S+   LK ++  +S+E FNLAFLTL+FLICIYEAPTDLRS+CL TLKHHLAN  SRQISKVLMKLLGSNLEEQWMRS+NL+ITNW+ ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         N RT+KTPSPL+SYSFST GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++ VVR+KW D+RVHVDNIRCDI+RLVNETLLSER
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G GGSEKHFPSRISL++TPT  TNIMSVSVSKSS+NP+I++G E+TFEAGFEP  P P LKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKL LINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKID KARGK MEWEIRGWIWLTYWPNKH+TFYTET+RLEFKEILHL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

XP_022995522.1 uncharacterized protein LOC111491028 [Cucurbita maxima]6.26e-30882.09Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        MASCSF DVYSWI+ LPPLSQWKT+SIS +IC S+S +SSL ++AAK+LHSPTIT SI+A FSFPISLWTSKPL  ST SSNL DEE++S+LLLNCVHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        LYYGSNQ+K+S+   LK ++  +SK+ FNLAFLTL+FLICIYEAPTDLRS+CL TLKHHLAN  SRQISKVLMKLLGSNLEEQWMRS+NL+ITNW+ ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         N RT+KTPSPL+SYSFST GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++ VVR+KW D+RVHVDNIRCDIIRLVNETLLSER
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G GGSEKHFPSRISL++TPT  TNIMSVSVSKSS+NP+I+VG E+TFEAGFEP  P P LKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKID KARGK MEWEI+GWIWLTYWPNKH+TFYTET+RLEFKEI+HL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

XP_023544514.1 uncharacterized protein LOC111804061 [Cucurbita pepo subsp. pepo]1.54e-30882.29Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        MASCSF DVYSWI+ LPPLSQWKTTSIS +IC S+S +SSL ++AAK+LHSPTIT SI+A FSFPISLWTSKPL  ST SSNL DEE++S+LLLNCVHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        LYYGSNQ+K+S+   LK ++  +S+E FNLAFLTL+FLICIYEAPTDLRS+CL TLKHHLAN  SRQISKVLMKLLGSNLE+QWMRS+NL+ITNW+ ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         N RT+KTPSPL+SYSFS+ GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++ VVR+KW D+RVHVDNIRCDIIRLVNETLLSER
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G GGSEKHFPSRISL++TPT  TNIMSVSVSKSS+NP+I+VG E+TFEAGFEP  P P LKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKID KARGK MEWEIRGWIWLTYWPNKH+TFYTET+RLEFKEILHL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

TrEMBL top hitse value%identityAlignment
A0A6J1CZH7 uncharacterized protein LOC1110159106.11e-30882.49Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        M SC F D YSWI+NLPPLSQWK TS S +IC+SSSTNSSL+ +A K+LHSPT+TFS++A  SFPISLWTSKPL ISTKS++L+DEESIS LLLN VHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        L+YGSNQQK+ +LNFL+ N+ FNSKE FNLAFLTL+FLICIYEAPT LRSDCLTTLKHHLANC+SRQ SK+LMKLLGSNLEEQWMRS+NL+ITNWI ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         NS T+KTPSPLFSYSFST GLWKVQLYCP+IAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN+KA VREKW DLRVHVDNIRCDII+LVN+TLLS+R
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G G SEKHFPSRISL++TP +QTNIMSVSVSKSSDNP IDVG EKTFEAGFEP    P LKLA+GE+V++SLKPWKFEQFV+GN A LNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKLALINP+AWFRDRYSSA+RPFNKQGGVIFAGDEYGESVWWKI+  ARGK MEWEIRGWIW+TYWPNKH+TFYTETRRLEFKE LHL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

A0A6J1D0J9 uncharacterized protein LOC1110159090.0100Show/hide
Query:  MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL
        MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL
Subjt:  MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPL

Query:  NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK
        NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK
Subjt:  NISTKSSNLLDEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMK

Query:  LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW
        LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW
Subjt:  LLGSNLEEQWMRSLNLSITNWISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKW

Query:  TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK
        TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK
Subjt:  TDLRVHVDNIRCDIIRLVNETLLSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWK

Query:  FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH
        FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH
Subjt:  FEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKH

Query:  RTFYTETRRLEFKEILHLPIP
        RTFYTETRRLEFKEILHLPIP
Subjt:  RTFYTETRRLEFKEILHLPIP

A0A6J1FUY0 uncharacterized protein LOC1114470814.30e-30881.89Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        MASCSF DVYSWI+ LPPLSQWKT+SIS +IC S+S +SSL ++AAK+LHSPTIT SI+A FSFPISLWTSKPL  ST SSNL DEE++S+LLLNCVHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        LYYGSNQ+K+S+   LK ++  +S+E FNLAFLTL+FLICIYEAPTDLRS+CL TLKHHLAN  SRQISKVLMKLLGSNLEEQWMRS+NL+ITNW+ ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         N RT+KTPSPL+SYSFST GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++ VVR+KW D+RVHVDNIRCDI+RLVNETLLSER
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G GGSEKHFPSRISL++TPT  TNIMSVSVSKSS+NP+I++G E+TFEAGFEP  P P LKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKL LINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKID KARGK MEWEIRGWIWLTYWPNKH+TFYTET+RLEFKEILHL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

A0A6J1JLL6 uncharacterized protein LOC1114869874.10e-30682.04Show/hide
Query:  LHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNC
        LH  MASCSF DVY WI+NLPPLSQWKTTSIS +IC+SSSTNSSL ++AAK+LHSPTITFS+ A FSF ISLWTS+PL  STK+SNLL++ES+S+LLLNC
Subjt:  LHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNC

Query:  VHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWI
        V DVLYYGSN +++S+ N LK ++  + KE FN  FLTL+FLICIYEAP DLRS+CL TLKHHLANC SRQ SKVLMKLLGSNLE+QWMRS+NL+ITNWI
Subjt:  VHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWI

Query:  SELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETL
         ELK   RT+KTPSPLFSYSFST GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++AV REKW DLRVHVDNIRCDIIRLV+ETL
Subjt:  SELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETL

Query:  LSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSS
        LSERG GGSEKHFPSRISL++TPT  TNIMSVSVSKSS NP++D+G EKTFEAGFE   P P LKLA+GETV +SLKPWKFEQFVHGNAATLNWYLHDSS
Subjt:  LSERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSS

Query:  DGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI
        DGKEVAST+PSKL LINPKAWFRDRYSSA RPFNKQGGVIFAGDEYGESVWWKID KARGK MEWEIRGWIWLTYWPNKH+TFYTETRRLEFKEIL+L I
Subjt:  DGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI

Query:  P
        P
Subjt:  P

A0A6J1JZ56 uncharacterized protein LOC1114910283.03e-30882.09Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        MASCSF DVYSWI+ LPPLSQWKT+SIS +IC S+S +SSL ++AAK+LHSPTIT SI+A FSFPISLWTSKPL  ST SSNL DEE++S+LLLNCVHDV
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK
        LYYGSNQ+K+S+   LK ++  +SK+ FNLAFLTL+FLICIYEAPTDLRS+CL TLKHHLAN  SRQISKVLMKLLGSNLEEQWMRS+NL+ITNW+ ELK
Subjt:  LYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISELK

Query:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER
         N RT+KTPSPL+SYSFST GLWKVQLYCPIIAMDN+ENSSNPS DERLQFSLNYHQLEGVLQFN++ VVR+KW D+RVHVDNIRCDIIRLVNETLLSER
Subjt:  VNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSER

Query:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE
        G GGSEKHFPSRISL++TPT  TNIMSVSVSKSS+NP+I+VG E+TFEAGFEP  P P LKL++GET  +SLKPWKFEQFVHGNAATLNWYLHDSSDGKE
Subjt:  GAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEP--PNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKE

Query:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        VAST+PSKLALINPKAWFRDRYSSA+RPFNKQGGVIFAGDEYGE+VWWKID KARGK MEWEI+GWIWLTYWPNKH+TFYTET+RLEFKEI+HL IP
Subjt:  VASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15020.1 unknown protein9.5e-4224.08Show/hide
Query:  DVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSL----HSPTITFSIVA-GFSF--PISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV
        D +SWI  LP   ++  +        +     S+ + A ++L     S ++TF++VA GF+     ++W S    +S+       E+    L+L  + ++
Subjt:  DVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSL----HSPTITFSIVA-GFSF--PISLWTSKPLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNF-------------LKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLAN---CRSRQISKVLMKLLGSNLEEQW
        +         +   F             + S+   +    FNL  LT +F +C+++AP+++ S     L     N   C+   + +  +  LG + E   
Subjt:  LYYGSNQQKSSALNF-------------LKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLAN---CRSRQISKVLMKLLGSNLEEQW

Query:  MRSLNLSITNWISELKV----------NSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSN---------PSIDER---LQFSLNYHQLEGVL
        +R+ + +++ W+   ++          +S  +   S  FSY+    GLW ++ Y PI++M+   NSSN         P ++ +   L+++L++ Q E ++
Subjt:  MRSLNLSITNWISELKV----------NSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSN---------PSIDER---LQFSLNYHQLEGVL

Query:  QFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSERGAG---------GSEKHFPSRISLEITPTM-QTNIMSVSVSKSSDNPRIDVGNEKTFEAGFE
        QF +     E +  +   VDNIR  + +L       + G G           E++FPSR+ + + P +  +++  +S+ +S+ N   D+   +  +  F 
Subjt:  QFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSERGAG---------GSEKHFPSRISLEITPTM-QTNIMSVSVSKSSDNPRIDVGNEKTFEAGFE

Query:  PPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDA
             P +K          +K W+ EQ   GNAA  +  L+D   G+EV + +P            +         F K GG++F  DEYG+ V W++  
Subjt:  PPN--PALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDA

Query:  KARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI
        +  G +++W + G IWLTYWPNK  T + ETR +E+ + + LP+
Subjt:  KARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI

AT2G40390.1 unknown protein3.4e-16457.31Show/hide
Query:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPT-ITFSIVAGFSFPISLWTSKPL-NISTKSSNLLDEESISSLLLNCVH
        MASC   D ++W++ LPPLS WK   +S+ IC+ +S++ SL+    ++  SP   TFSIVA F  PI+L+ SK    IST S+  L+E  IS+LL+  V 
Subjt:  MASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPT-ITFSIVAGFSFPISLWTSKPL-NISTKSSNLLDEESISSLLLNCVH

Query:  DVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISE
         VL Y + ++ + ++         N K+ FNLAF T VFLICIYEAPT LR+ CL T+K  L  CRSRQ SK+LM  LGSNLEEQWMRSLNL+ITNWI E
Subjt:  DVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISE

Query:  LKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLS
        +K   + +K+PSPLFSY+FST GLWKV +YCP++AM+ +E+ ++   DERL FSLNYHQLEGV+Q NH+  VREKW ++ V++DN+RCDIIRLVNE LLS
Subjt:  LKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLS

Query:  ERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNP--ALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDG
        ERG G  EKHFPSRISL++TPT Q+NI+ VSV KSS+NP  +   EK  EA  +PPN    LK++  ET + S+KPWKFE++VHG +A L W+LHD  DG
Subjt:  ERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNP--ALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDG

Query:  KEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP
        +EV+S++PSK++++NP+AWF++RYSSA+RPF KQGGV+FAGD YG+SV WK+D  A GK+ME+E++G +WLTYWPNKH TFY++TR+LEFKE+L+L +P
Subjt:  KEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP

AT5G64190.1 unknown protein3.4e-14854.22Show/hide
Query:  FTDVYSWIKNLPPLSQWKTTSISITICASSS--TNSSLDVIAAKSLHSPTITFSIV--AGFSFPISLWTSK-PLNISTKSSNLLDEESISSLLLNCVHDV
        F DV++WI+N+P +++W+TTS+   IC S+S   NS+L++ A KS     +TFSI+  +    P+ LWT+K  L+I+  S N  DE +I SLL N V  +
Subjt:  FTDVYSWIKNLPPLSQWKTTSISITICASSS--TNSSLDVIAAKSLHSPTITFSIV--AGFSFPISLWTSK-PLNISTKSSNLLDEESISSLLLNCVHDV

Query:  LYYGSNQQKSSALNFLKSNVA--FNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISE
        L Y SN    S +    S+ +     K+  N   LTL F++C+YEAP  LR +CL TLK+HL  C +R+ +  LMKLLGSNLEEQWMR++NL+ TNWI E
Subjt:  LYYGSNQQKSSALNFLKSNVA--FNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITNWISE

Query:  LKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLS
         + +  T  T +PLFSY+ S  GLWKVQLYCP+ AM+ VE SSNP+ D RL FSL ++QLEGV+QFNHK VVR+ W D+ V +DNIR D+I+LVNE L+S
Subjt:  LKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLS

Query:  ERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNP-ALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSS-DG
         RGAG  EKHFPSRISL++TPT+QT+ +SVSVSKSS+NP  +   E++ E  F+PPN   L++A  E  +M++ PWK EQ V G  A LNW L+DSS  G
Subjt:  ERGAGGSEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNP-ALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSS-DG

Query:  KEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI
        +EV ST+PS+ ++++P++WF+DRY+ AYR F ++GGVIFAGDEYGESV WKI   A G  MEWEI+G+IWLTYWPNK++TFY ETRRLEF ++L+L I
Subjt:  KEVASTRPSKLALINPKAWFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAAACCTCCTTAGTTTCCTTGCGGAAAGAATATAAAGAGAGCTCAATCTTGTTATCTAACTTACATGCTCTCATGGCTTCTTGCAGCTTCACTGATGTTTATTC
CTGGATAAAGAACCTGCCACCACTTTCTCAATGGAAAACAACTTCCATTTCTATAACCATATGCGCTTCAAGCTCAACCAACTCCTCTCTTGATGTTATTGCAGCCAAAA
GCCTTCATTCCCCAACCATTACTTTCTCAATTGTTGCAGGTTTCAGCTTTCCCATCTCCCTTTGGACATCAAAACCCTTGAATATCAGCACCAAATCCTCAAATTTATTA
GATGAAGAAAGCATATCCAGTCTCTTGCTTAATTGTGTTCATGATGTTCTTTATTATGGCTCAAACCAACAAAAGAGTTCGGCCCTCAACTTTCTCAAATCCAACGTCGC
TTTCAACTCCAAAGAAAACTTCAATCTCGCGTTTCTTACCCTCGTATTCCTAATCTGCATCTACGAAGCTCCGACCGATCTCCGTTCGGATTGTCTGACGACTCTCAAGC
ATCATCTGGCAAATTGTCGGTCAAGGCAGATATCAAAGGTGCTTATGAAACTGTTGGGGTCTAATCTAGAAGAGCAATGGATGAGGTCCCTAAACCTTTCAATCACCAAC
TGGATATCGGAGCTCAAGGTTAACAGCCGTACTATAAAAACACCCTCACCTTTGTTCTCGTATTCTTTTTCAACGGATGGTCTATGGAAAGTTCAACTCTATTGCCCTAT
CATTGCAATGGATAATGTTGAGAACTCGAGCAATCCTTCAATTGATGAAAGGTTGCAATTCTCTTTAAATTATCACCAGCTTGAAGGGGTTCTGCAGTTCAATCACAAGG
CTGTGGTTCGAGAAAAGTGGACTGATCTGAGGGTACACGTTGATAACATAAGGTGTGACATCATCCGGCTTGTAAACGAGACTCTCTTGTCTGAACGTGGAGCTGGCGGA
TCAGAAAAGCATTTTCCTTCACGGATCTCACTGGAAATAACTCCAACTATGCAGACAAACATAATGAGTGTCTCAGTAAGCAAATCCTCAGACAACCCAAGAATAGATGT
CGGAAATGAAAAAACCTTTGAAGCTGGATTTGAACCTCCAAACCCTGCCCTCAAATTAGCAATAGGAGAGACTGTGTCAATGAGCTTGAAGCCATGGAAATTTGAGCAGT
TCGTCCATGGCAATGCCGCAACCCTTAACTGGTATCTCCATGACAGTTCGGATGGGAAAGAGGTCGCCTCCACCAGGCCATCGAAACTTGCACTTATAAACCCTAAAGCT
TGGTTTCGCGACCGTTACTCGAGCGCTTACAGGCCTTTCAACAAACAGGGAGGGGTAATATTTGCAGGAGACGAGTACGGAGAGAGTGTGTGGTGGAAGATTGATGCAAA
GGCCAGAGGGAAAATCATGGAGTGGGAAATTAGAGGTTGGATTTGGTTAACTTACTGGCCAAACAAACACAGAACGTTTTACACTGAAACCAGAAGGCTGGAATTCAAAG
AGATTCTCCATCTTCCAATTCCTTAG
mRNA sequenceShow/hide mRNA sequence
GGACTCGTTCAGTAACGCTCTCGTTTCTTACTTCTCGTTTCCAATTTATTAAGAAATCAAAATCTTTGAAAAACTATTTAAATTTCTTATTTCTCGTTTTTTAGAAACAA
TCTCAAAATCGTATAGAAATTTTGGAAACAAAAACAAGTTTCTTGTTTTTTATGTCCGTTTCCAATTTGTTCTAACATAAATAAAAATTTTGTTTTATCATTAATGAATC
CATAAATGTTGGGGGCCATGCAATTCTATTTTTTTTATTAAAAATTGAAAAACGGAAACGTTACTGAATGACCCGAACAAGTTCTAATATTTCCTCCCTTTAAAAGGATT
CTTAATGGTGTTGTAGGCTGCTGCTAGAGATGCTTAATATTATGTAGACTTGGACTTTTTAATCAAATATTATAGCCTTTGAGCGAAGAGCTCCCCCAATCAAAAGGCAT
CTATCTATCTATCATCTCTTTTGCTGCACAACCATCCAATCTTATCAAGTTATTTACCATTTCTCCCTTCTTTCTTGATTCCAATGGTCTCAACAATCACTTTTCCAATT
TACTTTTCCTCCCTGGAGAATATTTGCTTTACTTTTGCTTCCATTCATGAAAGAGTTCATAAATATTTTTCTGACGACGGGGAGTCGGAGCTTTTTTCCCCTCTACCCAG
AACCACCTACACCCGCCCAAACCCAAGATAAATCAAAGATTTTTTGTATTAAACTCACCCGAAAGTTCAAACTTGAGATCTTTAAACCAGCATGATCAAGAGACTCCAAG
TCCTTGCCAACATGGTCACTCCATTGGAGAGAGTTTATAAATATGGAGAAAACCTCCTTAGTTTCCTTGCGGAAAGAATATAAAGAGAGCTCAATCTTGTTATCTAACTT
ACATGCTCTCATGGCTTCTTGCAGCTTCACTGATGTTTATTCCTGGATAAAGAACCTGCCACCACTTTCTCAATGGAAAACAACTTCCATTTCTATAACCATATGCGCTT
CAAGCTCAACCAACTCCTCTCTTGATGTTATTGCAGCCAAAAGCCTTCATTCCCCAACCATTACTTTCTCAATTGTTGCAGGTTTCAGCTTTCCCATCTCCCTTTGGACA
TCAAAACCCTTGAATATCAGCACCAAATCCTCAAATTTATTAGATGAAGAAAGCATATCCAGTCTCTTGCTTAATTGTGTTCATGATGTTCTTTATTATGGCTCAAACCA
ACAAAAGAGTTCGGCCCTCAACTTTCTCAAATCCAACGTCGCTTTCAACTCCAAAGAAAACTTCAATCTCGCGTTTCTTACCCTCGTATTCCTAATCTGCATCTACGAAG
CTCCGACCGATCTCCGTTCGGATTGTCTGACGACTCTCAAGCATCATCTGGCAAATTGTCGGTCAAGGCAGATATCAAAGGTGCTTATGAAACTGTTGGGGTCTAATCTA
GAAGAGCAATGGATGAGGTCCCTAAACCTTTCAATCACCAACTGGATATCGGAGCTCAAGGTTAACAGCCGTACTATAAAAACACCCTCACCTTTGTTCTCGTATTCTTT
TTCAACGGATGGTCTATGGAAAGTTCAACTCTATTGCCCTATCATTGCAATGGATAATGTTGAGAACTCGAGCAATCCTTCAATTGATGAAAGGTTGCAATTCTCTTTAA
ATTATCACCAGCTTGAAGGGGTTCTGCAGTTCAATCACAAGGCTGTGGTTCGAGAAAAGTGGACTGATCTGAGGGTACACGTTGATAACATAAGGTGTGACATCATCCGG
CTTGTAAACGAGACTCTCTTGTCTGAACGTGGAGCTGGCGGATCAGAAAAGCATTTTCCTTCACGGATCTCACTGGAAATAACTCCAACTATGCAGACAAACATAATGAG
TGTCTCAGTAAGCAAATCCTCAGACAACCCAAGAATAGATGTCGGAAATGAAAAAACCTTTGAAGCTGGATTTGAACCTCCAAACCCTGCCCTCAAATTAGCAATAGGAG
AGACTGTGTCAATGAGCTTGAAGCCATGGAAATTTGAGCAGTTCGTCCATGGCAATGCCGCAACCCTTAACTGGTATCTCCATGACAGTTCGGATGGGAAAGAGGTCGCC
TCCACCAGGCCATCGAAACTTGCACTTATAAACCCTAAAGCTTGGTTTCGCGACCGTTACTCGAGCGCTTACAGGCCTTTCAACAAACAGGGAGGGGTAATATTTGCAGG
AGACGAGTACGGAGAGAGTGTGTGGTGGAAGATTGATGCAAAGGCCAGAGGGAAAATCATGGAGTGGGAAATTAGAGGTTGGATTTGGTTAACTTACTGGCCAAACAAAC
ACAGAACGTTTTACACTGAAACCAGAAGGCTGGAATTCAAAGAGATTCTCCATCTTCCAATTCCTTAGCTCATAAGAAATCATTAGAAATGGAGGGAGAATAACAACTTG
GGATCTCCTTTTTTATTTATTCATTTTCTTTCTCAGTAACAAACAAGTCGATCTCTACTTTGTATAATTTGACTGCATATGGGAATTGCATCGTCCATATAGATTAATAT
ATAATATATATATATAGCTGCTTCAATAACTGTGTTTACTTGTCGCCAACGATTTAACTTGGTAAGTCAATTTTCACAATTTTCTTAAACAAGAAACAAAGAGAATTCAA
AAGTTACAAACCGCAACTTTCTACCACAAACATAAAAATAGAAAGAAAAAGAAAAACCAACTAAAAACTAACACTTACATCCTTTAAAATTATAAGATACTAATATTTTT
CTGATAAAAACTAAATCAGAAGCCCCCTAAAAATTTAATTAAATTAAAAAAAAAACTTTTGAAGTCTCAATTTCATCCATGGAACTCTAAACTTAAGTAAATAAGATGAT
TAATAACCCTTCAAAGCTATTTTAGAAAATTACTCCAGGACTAAACAATAAATTATACAAAATTTAACCAACAAAGCATTCAAAATCAAATTAAAAAAAAAAAACCTAAA
GTCAACCATAAAAATACCTGAACATAACCTATTTCTTACTATGGCATAATCTAAGTTCAAGTGGGAAATATACATGTGCACAAATCAACAAAGGCTTGCAATATTTTTAT
TAATTTTGATAATCTTTTATTACATCTTTCGAAGTTTGTTAATGTTTTCCTTCCTTTTTTGAGGATTCTTTTTGGTGTTGTAGGCTGCAGCCAGGGAACTGGATGCTTCA
TATTATATAGAGAGTTGGACTCTTGATCAAAGATTATACGCTTTATATGAAGACTTCTCCCCAAGTAAAATGTGAGATATCTATCATCTCTTTTCTCTGCACAACCATCC
AATCTCATTAAGTTCTTTAACTTTTCTCTCTTCTTTCTTGATGCCAATGATCTCAACAATCAGTTTTTCAATTGACTTTTGCTTCCTTAAGAATCTTTCCTTTACTTTCC
TTCCATTTATGAAAGAGTCTATAAATATGGAAAAAACATCATTATTTGTCCTTGCAGAGAGAAAATAAAGAGAGTTCAATCTTGTTAACTAATTTGCATGCTCCTATGAC
TTCTTGTTGCTTTCCTGATTTCTATTCCTGGATACAGAACCTACCACCACTTTCTCAATGGAAAGTAACTTCAACTTCTACATCCATATGCTCTTCAAGCTCAACCAACT
CCTCTCTGAATTTTGTTGCTACCAAAAACCTTCATTCTCCAACCCTTACTTTCTCAGTTATTGCAGATATCAGCTTTCCTATATCCCTTTGGACATCAAAGCCCTTGAAG
ATCAGCACCAAATCCACAAGTTTAATAGATGAAGAAAGCATATCCTGTCTCTTGCTTAACTTTGTTCATGATGTTCTTCATTATGGCTCAAACCAACAAAAGAATTTTAG
CCTTAATTTCCTTGAACTCAACATCACTTTCAACTCGAAAGAAATCTTCAATCTCGCATTTCTTACCCTCATATTTCTGATCTGCATTTACGAAGCTCCAACCAAACTCC
GTTCGGATTGTCTTACAACTCTCAAGCATCATTTGGCAAATTGCCAGTCCAGACAGACATCAAAAATGCTGATGAAACTGTTGGGATCTAATCTAGAGGAGCAATGGATG
AGGTCCGTAAACCTTGCAATCACCAACTGGATATTGGAGCTCAAGGCCAACAGCTGCACCCTGAAAACACCCTCACCATTGTTCTCTTATTCATTTTCAACACGTGGGTT
GTGGAAAGTTCAACTCTATTGCCCTCTCATTGCAATGGATAATATTGAGAACTCAAGCAATCCCTCAACTGATGAAAGATTGCAATTCTCTTTAAATTATCACCAGCTTG
AAGGGGTTCTGCAGTTCAATTACAAGGCCGAGGTTCGAGAAAAGTGGGTTGATCTGAGGGTTCACGTTGATAACATAAGGTATACTAACTCTTGAAGCTACTTTTTGGGC
TGCTAAGCACAAATTTCTGCTATCTCTCTCTCTCTATCTCTCATTCTCTCTCTCAAGTAACGATGAAAACAGAGACAATTTCTCGGAAAATTGATTCCAAACTGATCTTC
CTAGCCTAGCCACTAGAAGTAAATGATATTAAGATTCTCACGTACCTTTAACAATTGAGTTTCTTCTTCCTGATACTAAAAGTAACGACTAAAACAGAGACAGTTTCTAG
AAAAAATTGATTCCAAACTGATCTTTCAAGCTATTAGAAGCAATTTATATTAAAGATCCCACGTATCCTTGACAATTGAGTTTAAAACGCACCTCGATTCTCACAGGTGC
GACATCATCCAGCTTGTGAACGACACTCTCTTGTCCAAGCGAGGAGTCGGCAGATCAGAAAAGCACTTCCCATCACGAATCTCGCTGCAACTCACTCCAATTCTACAGAC
AAACATAATGAGCGTCTCAGTAAGCAAATCATCGGATAACCCTACAATAGACGTGGGAACCGAAAAAACCTTCGAAGCCGGATTCGAACCGGCAGCAGCTTACCCAGGCC
TGAAATTAGCAGTCGGAGAGAGCGTAACGGTGAGCCTGAAGCCATGGAAGTTCGAGCAGTTCGTGTATGGCAACACGGCGATCCTGAACTGGTACCTCCACGACAGTTCA
GACGGGAAAGAGGTGGCCTCCACCAAGCCATCGAAACTTGCGCTGATAAACCCAAGAGCTTGGTTCCGGGACCGATACTCGAGCGCTCACCGGCCGTTCAACAAACAGGG
AGGGGTCATATTCGCCGGAGATGAGTACGGAGAGAGTGTTTGGTGGAAGATTGAGGAAGATGCGAGAGGGAAAACCATGGAATGGGAGATCAGAGGTTGGATTTGGGTAA
CTTATTGGCCAAACAAACACAAAACATTCTACACTGAAACCAGGAGGCTGGAATTCAAGGAGACTCTCCATCTTTCAATTCCTTAGCTCAGAAGAAATCACTGGAAATGG
AGGCACAACAAACTGTTTTTTTTTTTCTTTTCTTTTTTTCATGTTTAATAAAAATTTGCGTCGATAAATCAAAATAATTAATTATTTTATTACCTAATATTCATTTCCCC
Protein sequenceShow/hide protein sequence
MEKTSLVSLRKEYKESSILLSNLHALMASCSFTDVYSWIKNLPPLSQWKTTSISITICASSSTNSSLDVIAAKSLHSPTITFSIVAGFSFPISLWTSKPLNISTKSSNLL
DEESISSLLLNCVHDVLYYGSNQQKSSALNFLKSNVAFNSKENFNLAFLTLVFLICIYEAPTDLRSDCLTTLKHHLANCRSRQISKVLMKLLGSNLEEQWMRSLNLSITN
WISELKVNSRTIKTPSPLFSYSFSTDGLWKVQLYCPIIAMDNVENSSNPSIDERLQFSLNYHQLEGVLQFNHKAVVREKWTDLRVHVDNIRCDIIRLVNETLLSERGAGG
SEKHFPSRISLEITPTMQTNIMSVSVSKSSDNPRIDVGNEKTFEAGFEPPNPALKLAIGETVSMSLKPWKFEQFVHGNAATLNWYLHDSSDGKEVASTRPSKLALINPKA
WFRDRYSSAYRPFNKQGGVIFAGDEYGESVWWKIDAKARGKIMEWEIRGWIWLTYWPNKHRTFYTETRRLEFKEILHLPIP