; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005424 (gene) of Snake gourd v1 genome

Gene IDTan0005424
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG05:81461365..81465832
RNA-Seq ExpressionTan0005424
SyntenyTan0005424
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573113.1 hypothetical protein SDJN03_27000, partial [Cucurbita argyrosperma subsp. sororia]6.6e-27286.96Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS+ASNDDS STNRSSIHRRSRSLSRFSHPLPSSP+DKGFGEGSARRGRFVNTSRGS FPEISLDDLAV+FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESS A SA  +S RRGRSVSRHGS KT+GGGS+GKG+ADY VG GK+VPESNSRRRRS+SVVRYQISDSESDLD+SQ SGTR++EKS+G GNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SN RP LRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYF NN  EKTI++I ARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV T SDDLHSSDSGV Q TSPFKRNYSTKQEQS+KRRDSL KMVL+EQR QELPK  KN  PDL NVVAENSSR RKRSNDRSRMSKRL+EEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPA SK VPPGMDGVLLPWLQWET NDATPYPRKNTI P +TPQ FPWDVNQEASNAQD  NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV   LSGKVVEDLGSRFKKTG YQ+QS+LESRESR  FDIDEYLKRPSNE+FLLERWKQQHK+N  GLLLC+ VFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

XP_022137536.1 uncharacterized protein LOC111008961 [Momordica charantia]5.4e-27486.25Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGA  AS DDS STNRSSIHRRSRSLSRFSHP+PSSP+DK F E  ARRGRFVNTSRGSGFPEISLDDLAV+FFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESSGA + A ASHRRGRSVSRHG  KT+  GS+GKGR +Y VG  KVVPE+NSRRRRS+SVVRYQISDSESDLDRSQ+SGTRVKEKSFG GNKQKP+ H
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSN RPTLRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYFGNNG+EKTI++IYARK KQ NGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV TFSDDL+S+DSGV Q TSPF RNYSTKQEQSEKRRDSL KMVLEEQRGQ+LPKM KNLP DLKNVVAENS R RKRSNDR+RMSKRLSEEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFK+PA SK+VPPGMDGVLLPWLQWET NDA+ YPRKNT EPPMTPQTFPWDVNQE++N QDQ NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESR-ESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV  G+ GKVVEDLGSRFKK GNYQ+QS+LE R +SRFDI+EYLKRPS+EDFLLERWKQQH+ NCSGLLLC+RVFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESR-ESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

XP_022994153.1 uncharacterized protein LOC111489977 [Cucurbita maxima]5.8e-27687.82Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS++SNDDS STNRSSIHRRSRSLSRFSHPLPSSP+DKGFGEGSARRGRFVNTSRGSGFPEISLDDLAV+FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESS A SA  +S RRGRS+SRHGS KT+GGGSEG+G+ADY VG GK+VPESNSRRRRS+SVVRYQISDSESDLD+SQ SGT ++EKS+GTGNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SN RP LRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYFGNN  EKTI++I ARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV T SDDLHSSDSGV QHTSPFKRNYSTKQEQS+KRRDSL KMVL+EQR QELPK  KN  PDL NVVAENSSR RKRSNDRSRMSKRL+EEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPAASK+VPPGMDGVLLPWLQWET NDATPYPRKNTI P MTPQ FPWDVNQEASNAQD  NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV   LSGKVVEDLGSRFKKTG YQ+QS+LESRESR  FDIDEYLKRPSNEDFLLERWKQQHKIN  GLLLC+ VFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

XP_023542617.1 uncharacterized protein LOC111802468 [Cucurbita pepo subsp. pepo]1.9e-27186.62Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS+ASNDDS STNRSSIHRRSRSLSRFSHPLPSSP+DKGFGEGSARRGRFVNTSRGSGFPEISLDDLA++FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESS A SA  +S RRGRSVSRHGS KT+GGGS+GKG+ADY VG GK+VPESNSRRRRS+SVVRYQISDSESDLD+SQ SGTR++EKS GTGNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SN RP LRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYFGNN  EKTI++I ARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV T SDDLHSSDSGV QH+SPFKRNYSTKQEQS+KRRDSL KMVL+EQR QELPK  KN  PDL NVVAENSSR RKRSNDRSRMSKRL+EEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        +ISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPA SK VPPGMDGVLLPWLQWET NDATPYPRKNTI P +TPQ+FPWD NQEASNAQD  NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV   LSGK VED+GSRFKKTG YQ+QS+LESRESR  FDIDEYLKRPSNE+FLLERWKQQHK+N  GLLLC+ VFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

XP_038893835.1 uncharacterized protein LOC120082652 isoform X1 [Benincasa hispida]8.4e-28390.36Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS+ SNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGE SA RGRFVNTSRGSGFPEISLDDLAV+FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
         ESSGA +AA AS+RRGRSVSRHGS KTNGGGSEGKGRA  GV  GKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKE SFG GNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSN RPTLRRSLSQNDFK HDGYSS SSVLTDDEGKDAYFGN+ +EKTI+SIYARKAKQ NG VVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SVETFSDDLHSSDSGVRQHTSPF RNYSTKQEQSEKRR+SL KMV+EEQRGQELPKM KNLPPD+KN VAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGK KPNFKI AAS+ VPPGMDGVLLPWLQWET NDATPYPRKN  EPP TPQTFPWDVNQ+ SN QD CNHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGVT G+S KVVED+GSRFKK GNYQ+QS LESRESRFDIDEYLKRPSNE+FLLERWKQQHKINCSGLLLC+RVFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

TrEMBL top hitse value%identityAlignment
A0A5D3BJY3 Uncharacterized protein1.1e-26985.91Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSAR-RGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGAS+ SNDDS STNR S HRRSRSLSRFSHPLPSSPIDK FGE SA  RGRFVNTSRGSGFPEISLDDLAV+FFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSAR-RGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAAR

Query:  SSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVS
        SSE SGA++++ AS+RRGRSVSRHG  KT+GGGSE KGR    V  GKVVPESNSRRRRSLSVVRYQISDSESD DRSQSSGTRV+EKSFG GNKQKP+S
Subjt:  SSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVS

Query:  HKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSN RPTLRRSLSQNDFK HDGYSS SSVLTDDEGKDA+FGN+ IEKT++SIYARKAKQ NG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIE
        SSVETFSDDLHSSDSGV  HTSPF RNYS KQEQSEKRRDSL KMV+E+QRGQ+LPKM KNLPPDLKNVVA+NSSR+RKRS DRSRMSKRLSEEAEKYIE
Subjt:  SSVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSAS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFKIPAA ++VPPGMDGVLLPWLQWET NDATPYPRKN  EPP TPQT PWDVNQ+ SNA D CNHS S
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSAS

Query:  SQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        SQGSWSPGVT GLSGKVVED+GSRFK+ GN Q QS+ ESRESRFDIDEYLKRPSNEDFLLERWKQQHKI CSG+LLC+RVFL
Subjt:  SQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

A0A6J1C6X2 uncharacterized protein LOC1110089612.6e-27486.25Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGA  AS DDS STNRSSIHRRSRSLSRFSHP+PSSP+DK F E  ARRGRFVNTSRGSGFPEISLDDLAV+FFGSGDRGRSAAR+
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESSGA + A ASHRRGRSVSRHG  KT+  GS+GKGR +Y VG  KVVPE+NSRRRRS+SVVRYQISDSESDLDRSQ+SGTRVKEKSFG GNKQKP+ H
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KADDSN RPTLRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYFGNNG+EKTI++IYARK KQ NGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV TFSDDL+S+DSGV Q TSPF RNYSTKQEQSEKRRDSL KMVLEEQRGQ+LPKM KNLP DLKNVVAENS R RKRSNDR+RMSKRLSEEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFK+PA SK+VPPGMDGVLLPWLQWET NDA+ YPRKNT EPPMTPQTFPWDVNQE++N QDQ NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESR-ESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV  G+ GKVVEDLGSRFKK GNYQ+QS+LE R +SRFDI+EYLKRPS+EDFLLERWKQQH+ NCSGLLLC+RVFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESR-ESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

A0A6J1GUF6 uncharacterized protein LOC1114570348.8e-27086.62Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS+ASN+DS STNRSSIHRRSRSLSRFSHPLPSSP+DKGFGEGSARRGRFVNTSRGS FPEISLDDLAV+FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESS A SA  +S RRGRSVSRHGS KT+GGGS+GKG+ADY VG GK+VPESNSRRRRS+SVVRYQISDSESDLD+SQ SGTR++EKS+G GNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SN RP LRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYF NN  EKTI++I ARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV T SDDLHSSDSGV Q TSPFKRNYSTKQEQS+KRRDSL KMVL+EQR QELPK  KN  PDL NVVAENSSR RKRSNDRSRMSKRL+EEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPA SK VPPGMDGVLLPWLQWET NDATPYPRKNTI P +TPQ FPWDVNQEASNAQD  NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV   LSGK VEDLGSRFKKTG YQ+QS+LESRESR  FDIDEYLKRPSNE+FLLERWKQQHK+N  GLLLC+ VFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

A0A6J1K4C9 uncharacterized protein LOC1114899772.8e-27687.82Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS
        MASSAFKSTTKRTPIGAS++SNDDS STNRSSIHRRSRSLSRFSHPLPSSP+DKGFGEGSARRGRFVNTSRGSGFPEISLDDLAV+FFGSGDRGRSAARS
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARS

Query:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH
        SESS A SA  +S RRGRS+SRHGS KT+GGGSEG+G+ADY VG GK+VPESNSRRRRS+SVVRYQISDSESDLD+SQ SGT ++EKS+GTGNKQKP+SH
Subjt:  SESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSH

Query:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
        KAD+SN RP LRRSLSQNDFK HDGYSSQSSVLTDDEGKDAYFGNN  EKTI++I ARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS
Subjt:  KADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNS

Query:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED
        SV T SDDLHSSDSGV QHTSPFKRNYSTKQEQS+KRRDSL KMVL+EQR QELPK  KN  PDL NVVAENSSR RKRSNDRSRMSKRL+EEAEKYIED
Subjt:  SVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIED

Query:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS
        FISNVEDTDISSLDGDRSDTSSSLGGKTKPNF+IPAASK+VPPGMDGVLLPWLQWET NDATPYPRKNTI P MTPQ FPWDVNQEASNAQD  NHS SS
Subjt:  FISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASS

Query:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        QGSWSPGV   LSGKVVEDLGSRFKKTG YQ+QS+LESRESR  FDIDEYLKRPSNEDFLLERWKQQHKIN  GLLLC+ VFL
Subjt:  QGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESR--FDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

E5GBV7 Uncharacterized protein1.1e-26985.91Show/hide
Query:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSAR-RGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAAR
        MASSAFKSTTKRTPIGAS+ SNDDS STNR S HRRSRSLSRFSHPLPSSPIDK FGE SA  RGRFVNTSRGSGFPEISLDDLAV+FFGS DRGRS  R
Subjt:  MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSAR-RGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAAR

Query:  SSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVS
        SSE SGA++++ AS+RRGRSVSRHG  KT+GGGSE KGR    V  GKVVPESNSRRRRSLSVVRYQISDSESD DRSQSSGTRV+EKSFG GNKQKP+S
Subjt:  SSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVS

Query:  HKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN
        HKADDSN RPTLRRSLSQNDFK HDGYSS SSVLTDDEGKDA+FGN+ IEKT++SIYARKAKQ NG VVD+GLYEAMRKELRHAVEEIRVELEQEMVNRN
Subjt:  HKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRN

Query:  SSVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIE
        SSVETFSDDLHSSDSGV  HTSPF RNYS KQEQSEKRRDSL KMV+E+QRGQ+LPKM KNLPPDLKNVVA+NSSR+RKRS DRSRMSKRLSEEAEKYIE
Subjt:  SSVETFSDDLHSSDSGVRQHTSPFKRNYSTKQEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIE

Query:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSAS
        DFISNVEDTDISSLDGDRSDTSSSLGGK KPNFKIPAA ++VPPGMDGVLLPWLQWET NDATPYPRKN  EPP TPQT PWDVNQ+ SNA D CNHS S
Subjt:  DFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSAS

Query:  SQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL
        SQGSWSPGVT GLSGKVVED+GSRFK+ GN Q QS+ ESRESRFDIDEYLKRPSNEDFLLERWKQQHKI CSG+LLC+RVFL
Subjt:  SQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHRVFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein3.0e-8138.98Show/hide
Query:  MASSAFKSTTKR-TPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFF---------GS
        MA+SAF ST KR T +  S  S DDS+ + R S  RR RSLSRFSH +P   I+        R+G+FVNT RGSGF EISLDDLAV+FF          S
Subjt:  MASSAFKSTTKR-TPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFF---------GS

Query:  GDRGRSAARSSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGV-------------------GAGKVVPES-------------------
        G+RGRS  R+S   G     T S RRGRSVSR GS    GGG+ G  R D                      G+ KV   S                   
Subjt:  GDRGRSAARSSESSGAMSAATASHRRGRSVSRHGSVKTNGGGSEGKGRADYGV-------------------GAGKVVPES-------------------

Query:  --------------NSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSHKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEG
                      NSRRRRSLSVVR +I +SESD+D+ Q S +    KSF +G  Q   S K+  S+ R  LRRS SQN  K+HDGYSSQSS +TDDEG
Subjt:  --------------NSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSHKADDSNCRPTLRRSLSQNDFKFHDGYSSQSSVLTDDEG

Query:  KDAYFGNNGIEKTIQSIYAR-KAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVETFSDDLHSSDSGVRQHTSPFKRNYSTK-QEQSEK
        KD+    +G E+ I+++YA+ KA     + + N  Y + RK L                NR  S  T                    + Y+TK QE  E+
Subjt:  KDAYFGNNGIEKTIQSIYAR-KAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVETFSDDLHSSDSGVRQHTSPFKRNYSTK-QEQSEK

Query:  RRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRS-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIP
        +R+ L +++LEEQRG+EL    K +  +  +   E   RTRKRS DRS RMS  L++EAE++I++FISN+EDTD SSL+ +RS++SSS G     + +  
Subjt:  RRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRS-RMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIP

Query:  AASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTP--QTFPW--DVNQEASNAQDQCNHSASSQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQ
        +A K     MDGV+LPWLQWET + +        I+ P TP  ++  W  D  Q+AS+ +     + SS+GSWSP                       Y 
Subjt:  AASKHVPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTP--QTFPW--DVNQEASNAQDQCNHSASSQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQ

Query:  SQSH--LESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHR
        S +   + ++  + D+ EYLKRP++ D L E WK +H+I+   L+LC R
Subjt:  SQSH--LESRESRFDIDEYLKRPSNEDFLLERWKQQHKINCSGLLLCHR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCTCTGCTTTCAAATCGACGACCAAACGGACGCCGATCGGAGCATCGCTCGCTTCAAACGATGACTCCGCTTCCACCAATCGGAGTTCGATTCACCGCCGTTC
TCGAAGTCTGAGCCGATTTTCGCACCCTCTGCCGTCGTCTCCCATTGATAAGGGCTTTGGTGAGGGTTCAGCCCGGCGAGGTAGGTTTGTCAACACGTCGAGAGGCTCGG
GATTCCCTGAGATTAGTCTCGACGATCTAGCGGTTGATTTCTTCGGTTCTGGTGATCGAGGACGCTCCGCTGCGCGGAGCTCTGAGTCTAGTGGTGCTATGAGCGCCGCC
ACGGCTTCGCATAGACGAGGGAGGTCGGTGTCCAGACACGGTAGCGTTAAAACTAATGGCGGTGGTAGCGAGGGCAAAGGAAGAGCCGATTATGGCGTTGGTGCGGGAAA
AGTGGTTCCTGAAAGTAATTCGAGAAGGAGACGCTCTCTCTCGGTGGTTCGCTACCAGATTAGCGATTCGGAGAGTGATCTTGATCGATCTCAGAGTTCTGGAACTCGTG
TCAAAGAAAAGAGCTTTGGTACTGGAAATAAGCAGAAGCCAGTATCCCACAAGGCTGATGATTCAAACTGTAGGCCAACATTGCGCAGATCTCTTAGCCAGAATGATTTC
AAGTTCCATGATGGCTATTCAAGCCAGTCTTCTGTTCTAACCGATGATGAAGGGAAGGATGCTTACTTCGGTAATAATGGAATTGAGAAGACCATTCAATCAATTTATGC
TAGAAAGGCAAAGCAGCTCAATGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGGAAAGAACTTAGACATGCTGTGGAAGAGATAAGGGTGGAACTTGAGCAGG
AAATGGTGAACAGAAATTCGTCTGTTGAAACTTTCAGTGATGATTTGCATTCGAGTGATTCTGGTGTTCGTCAGCATACATCTCCATTTAAAAGAAATTATTCAACAAAA
CAAGAACAGTCGGAGAAGCGCAGAGATTCATTGACTAAGATGGTGCTGGAGGAGCAACGTGGTCAAGAACTTCCAAAGATGGCTAAAAATTTGCCTCCTGATCTGAAGAA
TGTTGTTGCAGAGAACTCCTCACGAACCAGAAAGAGGAGCAATGACCGGAGTAGGATGTCTAAAAGATTGAGTGAAGAGGCAGAAAAATACATCGAGGACTTCATTTCCA
ATGTTGAAGATACAGATATTTCATCTCTTGATGGCGATAGGAGTGACACCAGTTCATCTTTAGGGGGTAAAACAAAACCAAATTTCAAAATTCCAGCAGCCAGCAAACAT
GTTCCTCCTGGAATGGATGGTGTCCTACTTCCATGGTTGCAATGGGAAACCTGTAACGATGCTACTCCTTACCCTCGAAAGAATACGATTGAACCACCTATGACTCCACA
AACTTTTCCATGGGATGTAAATCAGGAAGCAAGCAATGCTCAAGATCAATGCAACCATTCTGCCAGCAGCCAAGGGAGTTGGAGCCCCGGAGTTACCACTGGCCTCTCTG
GGAAAGTTGTTGAAGATTTAGGAAGTAGATTCAAGAAAACTGGTAACTATCAGAGTCAGTCCCATTTGGAATCAAGAGAATCTCGGTTCGATATCGACGAATATCTAAAA
CGTCCAAGCAATGAAGATTTCCTCCTAGAAAGATGGAAGCAGCAACACAAAATCAACTGCAGTGGTCTTTTGCTCTGTCATCGTGTATTTTTATAG
mRNA sequenceShow/hide mRNA sequence
CATACATTGGCCGAGTTTAGGGTAAAAATTAATTTTTTGTCCGTTTTAATTAAATTCATGCACCACCGTGGACCATACGACAGCCGCACGCACCGGCACAGCTACACCAC
ATTCAAATTTCATATAATTTTCGATCTCTCCGCCAAATCAACTTACTTAGCTCTTCTCACTCTCAGACCAAAGACTTCCAATGGAAATTTCTCTGACTGCTTAAAATGCC
GAATTTCAAATTTCAAAATCTTCAACTGTTTCTTTGATTTTGATCACAAATGGCGTCCTCTGCTTTCAAATCGACGACCAAACGGACGCCGATCGGAGCATCGCTCGCTT
CAAACGATGACTCCGCTTCCACCAATCGGAGTTCGATTCACCGCCGTTCTCGAAGTCTGAGCCGATTTTCGCACCCTCTGCCGTCGTCTCCCATTGATAAGGGCTTTGGT
GAGGGTTCAGCCCGGCGAGGTAGGTTTGTCAACACGTCGAGAGGCTCGGGATTCCCTGAGATTAGTCTCGACGATCTAGCGGTTGATTTCTTCGGTTCTGGTGATCGAGG
ACGCTCCGCTGCGCGGAGCTCTGAGTCTAGTGGTGCTATGAGCGCCGCCACGGCTTCGCATAGACGAGGGAGGTCGGTGTCCAGACACGGTAGCGTTAAAACTAATGGCG
GTGGTAGCGAGGGCAAAGGAAGAGCCGATTATGGCGTTGGTGCGGGAAAAGTGGTTCCTGAAAGTAATTCGAGAAGGAGACGCTCTCTCTCGGTGGTTCGCTACCAGATT
AGCGATTCGGAGAGTGATCTTGATCGATCTCAGAGTTCTGGAACTCGTGTCAAAGAAAAGAGCTTTGGTACTGGAAATAAGCAGAAGCCAGTATCCCACAAGGCTGATGA
TTCAAACTGTAGGCCAACATTGCGCAGATCTCTTAGCCAGAATGATTTCAAGTTCCATGATGGCTATTCAAGCCAGTCTTCTGTTCTAACCGATGATGAAGGGAAGGATG
CTTACTTCGGTAATAATGGAATTGAGAAGACCATTCAATCAATTTATGCTAGAAAGGCAAAGCAGCTCAATGGGGATGTTGTTGACAATGGCTTGTATGAAGCAATGCGG
AAAGAACTTAGACATGCTGTGGAAGAGATAAGGGTGGAACTTGAGCAGGAAATGGTGAACAGAAATTCGTCTGTTGAAACTTTCAGTGATGATTTGCATTCGAGTGATTC
TGGTGTTCGTCAGCATACATCTCCATTTAAAAGAAATTATTCAACAAAACAAGAACAGTCGGAGAAGCGCAGAGATTCATTGACTAAGATGGTGCTGGAGGAGCAACGTG
GTCAAGAACTTCCAAAGATGGCTAAAAATTTGCCTCCTGATCTGAAGAATGTTGTTGCAGAGAACTCCTCACGAACCAGAAAGAGGAGCAATGACCGGAGTAGGATGTCT
AAAAGATTGAGTGAAGAGGCAGAAAAATACATCGAGGACTTCATTTCCAATGTTGAAGATACAGATATTTCATCTCTTGATGGCGATAGGAGTGACACCAGTTCATCTTT
AGGGGGTAAAACAAAACCAAATTTCAAAATTCCAGCAGCCAGCAAACATGTTCCTCCTGGAATGGATGGTGTCCTACTTCCATGGTTGCAATGGGAAACCTGTAACGATG
CTACTCCTTACCCTCGAAAGAATACGATTGAACCACCTATGACTCCACAAACTTTTCCATGGGATGTAAATCAGGAAGCAAGCAATGCTCAAGATCAATGCAACCATTCT
GCCAGCAGCCAAGGGAGTTGGAGCCCCGGAGTTACCACTGGCCTCTCTGGGAAAGTTGTTGAAGATTTAGGAAGTAGATTCAAGAAAACTGGTAACTATCAGAGTCAGTC
CCATTTGGAATCAAGAGAATCTCGGTTCGATATCGACGAATATCTAAAACGTCCAAGCAATGAAGATTTCCTCCTAGAAAGATGGAAGCAGCAACACAAAATCAACTGCA
GTGGTCTTTTGCTCTGTCATCGTGTATTTTTATAGCTTTTTCTTCTCTTTCTCATATTTGTACTGTCCTCTTGCCCTCTATTCTTTCTGTTCTTCTTCATCTCTCTCTTG
GGTCTTGATGGGTTGTAAATTGGAATTTAGTATTTGGGTAAATTATGAAAAGGGTTCACCACATGTAGAATTATAGAAATTTATCATTTTTGTAATGGACTTGGTATTTT
GTCATAATGAGGCTTGTATCCACATAACAATTATTATGAAGTAACTATGTTGTTGAGATTTCATTGAATAATTTGTCAATAAATTAGAAAAAAAATTGAGTGA
Protein sequenceShow/hide protein sequence
MASSAFKSTTKRTPIGASLASNDDSASTNRSSIHRRSRSLSRFSHPLPSSPIDKGFGEGSARRGRFVNTSRGSGFPEISLDDLAVDFFGSGDRGRSAARSSESSGAMSAA
TASHRRGRSVSRHGSVKTNGGGSEGKGRADYGVGAGKVVPESNSRRRRSLSVVRYQISDSESDLDRSQSSGTRVKEKSFGTGNKQKPVSHKADDSNCRPTLRRSLSQNDF
KFHDGYSSQSSVLTDDEGKDAYFGNNGIEKTIQSIYARKAKQLNGDVVDNGLYEAMRKELRHAVEEIRVELEQEMVNRNSSVETFSDDLHSSDSGVRQHTSPFKRNYSTK
QEQSEKRRDSLTKMVLEEQRGQELPKMAKNLPPDLKNVVAENSSRTRKRSNDRSRMSKRLSEEAEKYIEDFISNVEDTDISSLDGDRSDTSSSLGGKTKPNFKIPAASKH
VPPGMDGVLLPWLQWETCNDATPYPRKNTIEPPMTPQTFPWDVNQEASNAQDQCNHSASSQGSWSPGVTTGLSGKVVEDLGSRFKKTGNYQSQSHLESRESRFDIDEYLK
RPSNEDFLLERWKQQHKINCSGLLLCHRVFL