; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G019920 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G019920
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionEpstein-barr nuclear antigen
Genome locationchr04:27024989..27030096
RNA-Seq ExpressionLsi04G019920
SyntenyLsi04G019920
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024650.1 hypothetical protein SDJN02_13468 [Cucurbita argyrosperma subsp. argyrosperma]5.8e-24787.65Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGL+HSHIAPPSFPWPNPP+SKLFDLEFPGQSFGIKDYGLTAHN+GINGV+SIFDIG+RIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPF QEEN+IASVRMDLDKSWQRDD+GVAVQG LGTLT+CL NSE  DKDAVSDGVVDDEASGFDLRAIGHLGRAQGT+NISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK------------------------------SSN
         DVESS+VARGDLWRVEASHGRTA GND SSLFLLQLGPVLFVRDSTLLLPVH+SKQHLLWYGYDRK                              SSN
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK------------------------------SSN

Query:  IISP-LPPSQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQ
        I+ P LPPSQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQ PGEMRFSFSCKNKWGTRITP+VQ+PDKSFTLDLAQSLAWKRSGLLVKPT+Q
Subjt:  IISP-LPPSQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQ

Query:  CSVSPTFGGSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        CS+S TFGGSNPGFRAEIVHSVKK LNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVD PLSNIRRTSFSVQINTGIEC
Subjt:  CSVSPTFGGSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

XP_008463765.1 PREDICTED: uncharacterized protein LOC103501831 isoform X1 [Cucumis melo]1.5e-25090.27Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHI+ PSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMD+DKSWQRDDMGVAVQGN GTL+ECLRNSEL DK  VSDG VDDEASGFDL+AIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTA GNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAF+PFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSG+VVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

XP_011654032.1 uncharacterized protein LOC101205592 [Cucumis sativus]4.9e-24689.64Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHI+ PSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMD+DKSWQRDDMGVAVQGN   + ECLRNSEL   D VSDGVVDDEASGFDL+AIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTA GNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAF+PFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSG+VVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

XP_022133983.1 uncharacterized protein LOC111006382 [Momordica charantia]8.4e-24688.35Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPP+SKLFD+EFPGQSFGIKDYGLT HNSGINGVTSI DIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMDLDKSWQRDDMGV VQGNLGTLTECLRNSEL D D VSDG+VDDE  GFDLRAIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYD+K+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQ PGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAW+RSG+LVKPT+QCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        FRAEIVHSVKKHLN++CGCS IAHPSA+ASIS+GRSKWNGN+G+SGIVVR D PLSNIRRTSFSVQINTGIE
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE

XP_038898428.1 uncharacterized protein LOC120086070 [Benincasa hispida]2.9e-25491.75Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMDLDKSWQRDDMG+AVQGNLGTLTECLRNSEL DKD VSDG VDDEASGFDLRAIG+LGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEM+FSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

TrEMBL top hitse value%identityAlignment
A0A0A0KZH1 Uncharacterized protein2.4e-24689.64Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHI+ PSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMD+DKSWQRDDMGVAVQGN   + ECLRNSEL   D VSDGVVDDEASGFDL+AIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTA GNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAF+PFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSG+VVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

A0A1S3CK01 uncharacterized protein LOC103501831 isoform X17.1e-25190.27Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHI+ PSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMD+DKSWQRDDMGVAVQGN GTL+ECLRNSEL DK  VSDG VDDEASGFDL+AIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTA GNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAF+PFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSG+VVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

A0A5D3DW19 Uncharacterized protein7.1e-25190.27Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHI+ PSF WPNPP SKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMD+DKSWQRDDMGVAVQGN GTL+ECLRNSEL DK  VSDG VDDEASGFDL+AIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTA GNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAF+PFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITP+VQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSG+VVRVDTPLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

A0A6J1C0S0 uncharacterized protein LOC1110063824.1e-24688.35Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPP+SKLFD+EFPGQSFGIKDYGLT HNSGINGVTSI DIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPFRQEENVIAS+RMDLDKSWQRDDMGV VQGNLGTLTECLRNSEL D D VSDG+VDDE  GFDLRAIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
        RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYD+K+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQ PGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAW+RSG+LVKPT+QCS+SPTFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        FRAEIVHSVKKHLN++CGCS IAHPSA+ASIS+GRSKWNGN+G+SGIVVR D PLSNIRRTSFSVQINTGIE
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE

A0A6J1IPL9 uncharacterized protein LOC1114770559.0e-24688.79Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPP+SKLFDLEFPGQSFGIKDYGLTAHN+GINGV+SIFDIGNRIGQAGADFGACLN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
        GMVQQFFRQLPVPF QEEN++ASVRMDLDKSWQRDD+GVAVQG LGTLT CL NSE  DKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRS

Query:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV
         DVESS+VARGDLWRVEASHGRTA GND SSLFLLQLGPVLFVRDSTLLLPVH+SKQHLLWYGYDRK+                   +++   PP+ SFV
Subjt:  RDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSFV

Query:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG
        DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQ PGEMRFSFSCKNKWGTRITP+VQ+PDKSFTLDLAQSLAWKRSGLLVKPT+QCS+S TFGGSNPG
Subjt:  DLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPG

Query:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC
        FRAEIVHSVKK LNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVD PLSNIRRTSFSVQINTGIEC
Subjt:  FRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIEC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53450.1 unknown protein8.6e-15658.99Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERS EAWEEVQRHGQDLADRLAQGF GLI   I PPSFP      SKLFDLEF  Q FGI+D   + H   INGV++I DIGN+IGQAG DFG+ LN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVV-DDEASGFDLRAIGHLGRAQGTINISSTYDSR
         MVQQFFR+LPVPFR +ENV  S   D             V  +     +   NS     D  S G V +++ + FDLR IG   RA+GT+ +SS+Y++R
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVV-DDEASGFDLRAIGHLGRAQGTINISSTYDSR

Query:  SRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSF
        +  +E SL ARGDLWRVEAS   +   +D+SSLFLLQLGP+LF+RDSTLLLPVHLSKQHLLWYGYDRK                    +++   P   SF
Subjt:  SRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSF

Query:  VDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNP
        VDLQFPNGQLTYVSGEGLTT+ F+P CGGLLQAQGQ PG+MRFSFSCK+K GTRITPM+  PDKS  L ++Q+LAW+RSG+++KP +Q SV  TFGGSNP
Subjt:  VDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNP

Query:  GFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        G + E++ S+  ++N++CGC+F AHPS FAS+S GRSKWNGN+G +GIVVR DTPL N+ R SFS+QIN   E
Subjt:  GFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE

AT1G53450.2 unknown protein8.6e-15658.99Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN
        MSVERS EAWEEVQRHGQDLADRLAQGF GLI   I PPSFP      SKLFDLEF  Q FGI+D   + H   INGV++I DIGN+IGQAG DFG+ LN
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLN

Query:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVV-DDEASGFDLRAIGHLGRAQGTINISSTYDSR
         MVQQFFR+LPVPFR +ENV  S   D             V  +     +   NS     D  S G V +++ + FDLR IG   RA+GT+ +SS+Y++R
Subjt:  GMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVV-DDEASGFDLRAIGHLGRAQGTINISSTYDSR

Query:  SRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSF
        +  +E SL ARGDLWRVEAS   +   +D+SSLFLLQLGP+LF+RDSTLLLPVHLSKQHLLWYGYDRK                    +++   P   SF
Subjt:  SRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPPSQSF

Query:  VDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNP
        VDLQFPNGQLTYVSGEGLTT+ F+P CGGLLQAQGQ PG+MRFSFSCK+K GTRITPM+  PDKS  L ++Q+LAW+RSG+++KP +Q SV  TFGGSNP
Subjt:  VDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNP

Query:  GFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        G + E++ S+  ++N++CGC+F AHPS FAS+S GRSKWNGN+G +GIVVR DTPL N+ R SFS+QIN   E
Subjt:  GFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE

AT3G14830.1 unknown protein5.4e-16662.47Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWP----NPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFG
        MS+ERS EAWEEVQRHGQDLADRLAQGFTGLI  HI PPSFPWP    +  ++KLFDLEFP Q F +      + N  INGVT+I DIGN+IGQAG DFG
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWP----NPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFG

Query:  ACLNGMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSEL-VDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISST
        A LN MVQQFFR+LP+PF  E+N    V +D DKS +     V  +G+LG  TE LR+S      D  S  + ++E +   LRA G LGR++GTI+ SS+
Subjt:  ACLNGMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSEL-VDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISST

Query:  YDSRSRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPP
        YDSR+  +E SL ARGDLWRVEASH  +   + NSSLFLLQLGP+LF+RDSTLLLP+HLSKQHLLWYGYDRK                    +++S  P 
Subjt:  YDSRSRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPP

Query:  SQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFG
        + SF+DLQFPNGQLTYVSGEGLTT+AF+PFCGGLLQAQGQ PG+MRFS+SCKNK GTRITPMV  PDKSF LDL+Q LAW+RSGLL+KPT+Q SV PTFG
Subjt:  SQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFG

Query:  GSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        GSNPG +AE++HS+   LNL+CG +  AHPSAFAS++ GRSKWNGN+G +GIVVR DTPL++I + SFS+Q+N   E
Subjt:  GSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE

AT3G14830.2 unknown protein5.4e-16662.47Show/hide
Query:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWP----NPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFG
        MS+ERS EAWEEVQRHGQDLADRLAQGFTGLI  HI PPSFPWP    +  ++KLFDLEFP Q F +      + N  INGVT+I DIGN+IGQAG DFG
Subjt:  MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWP----NPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFG

Query:  ACLNGMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSEL-VDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISST
        A LN MVQQFFR+LP+PF  E+N    V +D DKS +     V  +G+LG  TE LR+S      D  S  + ++E +   LRA G LGR++GTI+ SS+
Subjt:  ACLNGMVQQFFRQLPVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSEL-VDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISST

Query:  YDSRSRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPP
        YDSR+  +E SL ARGDLWRVEASH  +   + NSSLFLLQLGP+LF+RDSTLLLP+HLSKQHLLWYGYDRK                    +++S  P 
Subjt:  YDSRSRDVESSLVARGDLWRVEASHGRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSS------------------NIISPLPP

Query:  SQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFG
        + SF+DLQFPNGQLTYVSGEGLTT+AF+PFCGGLLQAQGQ PG+MRFS+SCKNK GTRITPMV  PDKSF LDL+Q LAW+RSGLL+KPT+Q SV PTFG
Subjt:  SQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKNKWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFG

Query:  GSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE
        GSNPG +AE++HS+   LNL+CG +  AHPSAFAS++ GRSKWNGN+G +GIVVR DTPL++I + SFS+Q+N   E
Subjt:  GSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNIRRTSFSVQINTGIE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGTAGAAAGATCCTTTGAAGCGTGGGAGGAGGTGCAGAGGCATGGTCAGGATTTGGCGGATCGTCTTGCTCAGGGTTTCACGGGACTCATTCATTCTCACATAGC
GCCTCCTTCTTTTCCCTGGCCCAATCCTCCCCAGTCTAAGCTCTTTGATCTTGAATTTCCGGGCCAAAGTTTTGGTATCAAGGATTATGGGTTGACTGCCCATAATTCTG
GAATTAATGGGGTTACATCCATTTTTGATATTGGTAATAGGATTGGACAAGCTGGCGCTGATTTTGGTGCTTGTTTGAATGGTATGGTACAACAATTTTTTAGACAGCTC
CCAGTTCCGTTTCGGCAAGAGGAGAATGTGATCGCATCGGTTAGGATGGATCTGGATAAGAGTTGGCAGAGGGATGATATGGGAGTCGCTGTTCAAGGGAATCTTGGAAC
ATTAACGGAGTGTTTGCGTAATTCTGAACTTGTTGACAAGGATGCTGTTTCAGATGGGGTGGTTGATGATGAAGCTTCTGGCTTTGATTTGAGAGCTATAGGACATCTGG
GTAGGGCACAGGGCACGATCAATATTTCTTCAACGTATGATAGTAGATCACGAGATGTGGAAAGTTCATTAGTTGCTAGAGGAGATTTATGGAGAGTAGAGGCATCACAT
GGCAGAACAGCAACTGGAAATGATAATTCATCTTTATTTCTGCTGCAGCTTGGGCCAGTACTATTTGTTCGTGATTCAACACTTCTTTTGCCTGTGCATTTATCAAAGCA
GCACTTGCTTTGGTACGGTTATGATAGGAAGTCTAGTAACATAATCTCTCCCCTCCCTCCCTCCCAGTCCTTTGTTGATTTGCAGTTCCCCAATGGGCAGTTGACTTATG
TTTCGGGTGAAGGTTTAACTACAACGGCCTTTTTGCCTTTTTGTGGAGGCCTCCTTCAAGCTCAAGGCCAATGTCCAGGAGAAATGAGATTCAGCTTCTCTTGCAAGAAT
AAGTGGGGAACACGAATAACACCAATGGTGCAATTGCCTGATAAATCATTTACTTTGGACCTTGCTCAATCATTGGCTTGGAAGAGATCAGGTCTTCTGGTGAAACCAAC
TCTCCAATGCAGTGTGAGTCCCACTTTTGGTGGAAGCAATCCTGGGTTTCGTGCTGAAATTGTTCATTCAGTGAAGAAACATCTCAATCTCATGTGCGGCTGTTCTTTCA
TTGCCCACCCTTCTGCATTTGCTTCAATTTCTATTGGCAGGTCGAAGTGGAACGGAAACGTAGGGAATTCAGGGATAGTTGTAAGAGTTGATACTCCACTCTCAAATATT
CGTAGAACTTCCTTCTCTGTTCAGATAAATACTGGGATTGAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
TATAAACAAACACACAGCTCCTTCAACCGCGTTGGCAAATTGGCGTTCAGAAGAAGAAGAACCCTTTCCCTTTCAAAATCCTCTCGTCTTCTTCATTTCTCATCTTTACC
TTCTTCACTTCCAACAATCCCTTTTTATGTTACAACCTGAAAATCAATTCTATTAGTCCACCCAATCCATCCACAACCTAATTTCATGCTTAATTTGTTTCTGAATTGAA
TTCTCGGACCCGCGATTCTCCTGCGTTCCTGTACCGATCTCGACCACCCTTCTTCAGTTCCAGTTTTGATTTGGTGCCTCTTTTGAATCTTATTGCTTTGAGGTTGTGGG
TGTGAGGTAAAGGTAGAAGGTTTGGATTCTTTTAACTGATGTCCGTAGAAAGATCCTTTGAAGCGTGGGAGGAGGTGCAGAGGCATGGTCAGGATTTGGCGGATCGTCTT
GCTCAGGGTTTCACGGGACTCATTCATTCTCACATAGCGCCTCCTTCTTTTCCCTGGCCCAATCCTCCCCAGTCTAAGCTCTTTGATCTTGAATTTCCGGGCCAAAGTTT
TGGTATCAAGGATTATGGGTTGACTGCCCATAATTCTGGAATTAATGGGGTTACATCCATTTTTGATATTGGTAATAGGATTGGACAAGCTGGCGCTGATTTTGGTGCTT
GTTTGAATGGTATGGTACAACAATTTTTTAGACAGCTCCCAGTTCCGTTTCGGCAAGAGGAGAATGTGATCGCATCGGTTAGGATGGATCTGGATAAGAGTTGGCAGAGG
GATGATATGGGAGTCGCTGTTCAAGGGAATCTTGGAACATTAACGGAGTGTTTGCGTAATTCTGAACTTGTTGACAAGGATGCTGTTTCAGATGGGGTGGTTGATGATGA
AGCTTCTGGCTTTGATTTGAGAGCTATAGGACATCTGGGTAGGGCACAGGGCACGATCAATATTTCTTCAACGTATGATAGTAGATCACGAGATGTGGAAAGTTCATTAG
TTGCTAGAGGAGATTTATGGAGAGTAGAGGCATCACATGGCAGAACAGCAACTGGAAATGATAATTCATCTTTATTTCTGCTGCAGCTTGGGCCAGTACTATTTGTTCGT
GATTCAACACTTCTTTTGCCTGTGCATTTATCAAAGCAGCACTTGCTTTGGTACGGTTATGATAGGAAGTCTAGTAACATAATCTCTCCCCTCCCTCCCTCCCAGTCCTT
TGTTGATTTGCAGTTCCCCAATGGGCAGTTGACTTATGTTTCGGGTGAAGGTTTAACTACAACGGCCTTTTTGCCTTTTTGTGGAGGCCTCCTTCAAGCTCAAGGCCAAT
GTCCAGGAGAAATGAGATTCAGCTTCTCTTGCAAGAATAAGTGGGGAACACGAATAACACCAATGGTGCAATTGCCTGATAAATCATTTACTTTGGACCTTGCTCAATCA
TTGGCTTGGAAGAGATCAGGTCTTCTGGTGAAACCAACTCTCCAATGCAGTGTGAGTCCCACTTTTGGTGGAAGCAATCCTGGGTTTCGTGCTGAAATTGTTCATTCAGT
GAAGAAACATCTCAATCTCATGTGCGGCTGTTCTTTCATTGCCCACCCTTCTGCATTTGCTTCAATTTCTATTGGCAGGTCGAAGTGGAACGGAAACGTAGGGAATTCAG
GGATAGTTGTAAGAGTTGATACTCCACTCTCAAATATTCGTAGAACTTCCTTCTCTGTTCAGATAAATACTGGGATTGAGTGTTGATCTTAAGAACCATGCGTTTTTGGT
AGTTTGCAAGTGTATATATAATCTCAAAACTGTTCTCTGTTACTTACACGATGGATTAATATTTATCAACTATGAATGAGGCCATCGTCTGTAAATTATGAGACGTCCTT
TCTCGTTTCTTCTCTTACTCTTGTTACAATATTCACTCTTAATTTCTATGTTTGATATTTAAAAAAGATGGATCCCATTAG
Protein sequenceShow/hide protein sequence
MSVERSFEAWEEVQRHGQDLADRLAQGFTGLIHSHIAPPSFPWPNPPQSKLFDLEFPGQSFGIKDYGLTAHNSGINGVTSIFDIGNRIGQAGADFGACLNGMVQQFFRQL
PVPFRQEENVIASVRMDLDKSWQRDDMGVAVQGNLGTLTECLRNSELVDKDAVSDGVVDDEASGFDLRAIGHLGRAQGTINISSTYDSRSRDVESSLVARGDLWRVEASH
GRTATGNDNSSLFLLQLGPVLFVRDSTLLLPVHLSKQHLLWYGYDRKSSNIISPLPPSQSFVDLQFPNGQLTYVSGEGLTTTAFLPFCGGLLQAQGQCPGEMRFSFSCKN
KWGTRITPMVQLPDKSFTLDLAQSLAWKRSGLLVKPTLQCSVSPTFGGSNPGFRAEIVHSVKKHLNLMCGCSFIAHPSAFASISIGRSKWNGNVGNSGIVVRVDTPLSNI
RRTSFSVQINTGIEC