; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0120 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0120
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein KOKOPELLI isoform X1
Genome locationMC01:6298165..6304602
RNA-Seq ExpressionMC01g0120
SyntenyMC01g0120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154937.1 protein KOKOPELLI isoform X1 [Momordica charantia]0.096.08Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGN
        MEVNELYLDLLALRELYILLLKSCLRDANSEL LDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGN
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGN

Query:  VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT
        VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT
Subjt:  VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT

Query:  QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
        QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
Subjt:  QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS

Query:  SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK
        SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK
Subjt:  SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK

Query:  KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

XP_022154939.1 protein KOKOPELLI isoform X2 [Momordica charantia]0.096.28Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNV
        MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGNV
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNV

Query:  AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ
        AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ
Subjt:  AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ

Query:  VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
        VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
Subjt:  VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS

Query:  HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK
        HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK
Subjt:  HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK

Query:  TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

XP_022154940.1 uncharacterized protein LOC111022084 isoform X3 [Momordica charantia]7.43e-30195.6Show/hide
Query:  SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ
        S+ LDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ
Subjt:  SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ

Query:  SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS
        SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS
Subjt:  SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS

Query:  VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
        VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
Subjt:  VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS

Query:  HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP
        HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP
Subjt:  HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP

Query:  TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

XP_022154941.1 protein KOKOPELLI isoform X4 [Momordica charantia]1.31e-277100Show/hide
Query:  MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD
        MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD
Subjt:  MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD

Query:  NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT
        NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT
Subjt:  NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT

Query:  SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
        SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
Subjt:  SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK

Query:  RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

XP_022958322.1 uncharacterized protein LOC111459571 isoform X2 [Cucurbita moschata]3.11e-15557.37Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR----
        ME +ELYLDLLALR+LY+ LLK CLRDANSEL+   RA+IL KHLLDDAT  +++FHSK    T    +FL  +   TKP++EKVAEWME+NQ+ R    
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR----

Query:  -------------KTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRV
                        NVAANDLS+GI  ALRRIE HILSLQ YT      RSHI+  KL+       +    ++H  +K  VA     HCS+FVHGFR+
Subjt:  -------------KTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRV

Query:  PLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS
        PL+QD  EAMK          Q+++  P  L+DKS C  GSKAT R    +NRT I E+R +N  G ++MRPTL  H KT +  QQESE+TNSESES  S
Subjt:  PLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS

Query:  SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQL
        SS AT+QTSE+ETT   SS   Q   PATGSE SS+    SS IS +AF+ SHGKK SKKA+GRFK LRNKLGLIFHHHHHH+H    N HN+  MWKQ+
Subjt:  SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQL

Query:  RKIFHGTDKKRVTSKG-RHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRMFCRRRGVKLPNKGRVKIGYV
        R++FH T KK +TSK  ++  L+KT IRSVSR NQVG+FQALAEGLRSHVWK  AMKKKE R    GK  G KKLHWW+M  RRRGVKLPNKGRVKIGYV
Subjt:  RKIFHGTDKKRVTSKG-RHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRMFCRRRGVKLPNKGRVKIGYV

Query:  NRKPQHKIV
        N+KP  KI+
Subjt:  NRKPQHKIV

TrEMBL top hitse value%identityAlignment
A0A6J1DL21 uncharacterized protein LOC111022084 isoform X33.60e-30195.6Show/hide
Query:  SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ
        S+ LDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ
Subjt:  SELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQ

Query:  SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS
        SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS
Subjt:  SRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRS

Query:  VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
        VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS
Subjt:  VNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRIS

Query:  HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP
        HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP
Subjt:  HGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKP

Query:  TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  TAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

A0A6J1DLN1 protein KOKOPELLI isoform X10.096.08Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGN
        MEVNELYLDLLALRELYILLLKSCLRDANSEL LDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGN
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSEL-LDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGN

Query:  VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT
        VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT
Subjt:  VAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGT

Query:  QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
        QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS
Subjt:  QVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSS

Query:  SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK
        SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK
Subjt:  SHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLK

Query:  KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  KTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

A0A6J1DNR3 protein KOKOPELLI isoform X20.096.28Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNV
        MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSK                  TKPVEEKVAEWMEYNQSTRKTGNV
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNV

Query:  AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ
        AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ
Subjt:  AANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQ

Query:  VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
        VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS
Subjt:  VSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSS

Query:  HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK
        HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK
Subjt:  HQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKK

Query:  TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  TAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

A0A6J1DQ76 protein KOKOPELLI isoform X46.36e-278100Show/hide
Query:  MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD
        MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD
Subjt:  MEYNQSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQD

Query:  NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT
        NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT
Subjt:  NVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQT

Query:  SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
        SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK
Subjt:  SETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKK

Query:  RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
        RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV
Subjt:  RVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV

A0A6J1H2T7 uncharacterized protein LOC111459571 isoform X21.51e-15557.37Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR----
        ME +ELYLDLLALR+LY+ LLK CLRDANSEL+   RA+IL KHLLDDAT  +++FHSK    T    +FL  +   TKP++EKVAEWME+NQ+ R    
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSELL-DERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTR----

Query:  -------------KTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRV
                        NVAANDLS+GI  ALRRIE HILSLQ YT      RSHI+  KL+       +    ++H  +K  VA     HCS+FVHGFR+
Subjt:  -------------KTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRV

Query:  PLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS
        PL+QD  EAMK          Q+++  P  L+DKS C  GSKAT R    +NRT I E+R +N  G ++MRPTL  H KT +  QQESE+TNSESES  S
Subjt:  PLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVR---SVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSS

Query:  SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQL
        SS AT+QTSE+ETT   SS   Q   PATGSE SS+    SS IS +AF+ SHGKK SKKA+GRFK LRNKLGLIFHHHHHH+H    N HN+  MWKQ+
Subjt:  SSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRY--RSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQL

Query:  RKIFHGTDKKRVTSKG-RHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRMFCRRRGVKLPNKGRVKIGYV
        R++FH T KK +TSK  ++  L+KT IRSVSR NQVG+FQALAEGLRSHVWK  AMKKKE R    GK  G KKLHWW+M  RRRGVKLPNKGRVKIGYV
Subjt:  RKIFHGTDKKRVTSKG-RHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGK-KGVKKLHWWRMFCRRRGVKLPNKGRVKIGYV

Query:  NRKPQHKIV
        N+KP  KI+
Subjt:  NRKPQHKIV

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI4.0e-1526.32Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE-------IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYN
        ME N +  +L +LR LY LL+ +   +   E   LD+  Q LLK LLD A+ E       ++    +V  KT    +  +N    ++     +   +  +
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE-------IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYN

Query:  QSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVAEPINGHCSEFVHGFRVPLSQ
          TR +  + A          L+R +  +  ++   S+   T   ++  +  ++ L     V SR+D      ++K+ V  P    C +      +P  +
Subjt:  QSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVAEPINGHCSEFVHGFRVPLSQ

Query:  DNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE----
         N   +   +V +         + ++ +      +  +A V    +     R  Q +P   IM+PTL++          +     Q    T SESE    
Subjt:  DNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE----

Query:  -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHH
                   S S S W TQ  ++TE+    S SS+      + SEVS S   + R +S+        K  +  +GRFKR++NK+G IFHHHHHHHHHH
Subjt:  -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHH

Query:  HHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRR--
        HH+       W +L+  FH   K +  SK R   + ++  + +  +++Q G F AL EGL  H       +K   ++    K   KK  WW++  +R+  
Subjt:  HHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRR--

Query:  GVKLPNKGRVKIG
        GVK+P +GRVK+G
Subjt:  GVKLPNKGRVKIG

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli2.9e-1626.32Show/hide
Query:  MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE-------IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYN
        ME N +  +L +LR LY LL+ +   +   E   LD+  Q LLK LLD A+ E       ++    +V  KT    +  +N    ++     +   +  +
Subjt:  MEVNELYLDLLALRELYILLLKSCLRDANSEL--LDERAQILLKHLLDDATAE-------IVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYN

Query:  QSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVAEPINGHCSEFVHGFRVPLSQ
          TR +  + A          L+R +  +  ++   S+   T   ++  +  ++ L     V SR+D      ++K+ V  P    C +      +P  +
Subjt:  QSTRKTGNVAANDLSNGIGLALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQK-VQSRMDHS----NLKARVAEPINGHCSEFVHGFRVPLSQ

Query:  DNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE----
         N   +   +V +         + ++ +      +  +A V    +     R  Q +P   IM+PTL++          +     Q    T SESE    
Subjt:  DNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCSVGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHM-------KTRMPTQQESEFTNSESE----

Query:  -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHH
                   S S S W TQ  ++TE+    S SS+      + SEVS S   + R +S+        K  +  +GRFKR++NK+G IFHHHHHHHHHH
Subjt:  -----------SVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVS-SRYRSSRISSKAFRISHGKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHH

Query:  HHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRR--
        HH+       W +L+  FH   K +  SK R   + ++  + +  +++Q G F AL EGL  H       +K   ++    K   KK  WW++  +R+  
Subjt:  HHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKT-AIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKPRLGKKGVKKLHWWRMFCRRR--

Query:  GVKLPNKGRVKIG
        GVK+P +GRVK+G
Subjt:  GVKLPNKGRVKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTAATGAGTTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGC
ACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCTGAAATTGTCCAGTTTCACTCGAAGGTTTGCATGAAAACTTCTTTCATTCTTTCTTTTTTAACAAATGAAC
CATTTGGCACAAAGCCAGTGGAAGAGAAAGTTGCTGAATGGATGGAATACAATCAAAGTACAAGAAAAACGGGAAATGTTGCTGCTAATGACTTATCAAATGGTATTGGT
TTAGCACTCAGAAGAATTGAATTCCACATTTTATCTCTGCAACATTATACAAGTCAAAGTAGGAACACAAGAAGCCATATCAATGGAGCTAAATTATCCAACTCTCCATT
AGATCAGCAGAAAGTTCAGTCAAGAATGGATCACTCAAATTTGAAGGCCAGAGTTGCTGAGCCAATCAATGGCCATTGTTCCGAGTTCGTTCACGGGTTTAGAGTACCTC
TGTCTCAAGACAATGTTGAGGCCATGAAACCTCCAAACGTTGGAACCCAGGTATCTAAACAAAACAAAGTTATAAATCCAGTGATTCTGATAGATAAATCTCGATGTTCA
GTGGGATCCAAGGCTACTGTACGGTCCGTGAATCGAACTCAGATACACGAAAGGCGGTGCCAAAATTTGCCTGGTCATATGATCATGAGGCCAACTTTGCTGAATCATAT
GAAGACTCGAATGCCCACTCAGCAAGAATCAGAATTTACAAACTCAGAATCAGAATCAGTTTCTTCTTCAAGTTGGGCAACTCAGCAGACAAGTGAAACTGAAACCACTG
ATTACCCTTCTTCCTCAAGTCACCAAGAGGATCAACCGGCAACCGGATCGGAGGTGAGTAGCCGGTACAGAAGCAGCAGAATTTCCTCAAAAGCATTTAGAATCAGCCAT
GGGAAAAAGGGGTCGAAGAAAGCGATCGGACGTTTCAAGAGACTGAGAAACAAACTAGGCCTTATCTTCCACCACCATCACCACCACCATCACCATCACCACCACAACAG
CCATAATAACTTCTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATGGCACAGACAAGAAAAGAGTAACAAGTAAAGGACGACACGAAACGCTAAAGAAAACAGCAATCC
GAAGCGTATCTCGGAAGAATCAAGTTGGAAGGTTTCAGGCTCTGGCTGAGGGGCTGAGGAGCCATGTTTGGAAACCGACAGCCATGAAGAAGAAAGAGCTTAGGAAGCCG
AGGTTGGGGAAGAAGGGTGTGAAGAAGTTGCACTGGTGGAGGATGTTTTGTCGCCGCCGTGGAGTGAAGTTGCCAAATAAAGGGCGTGTGAAAATAGGGTATGTAAATAG
AAAACCACAGCATAAGATAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTAATGAGTTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTTGCGAGATGCAAATTCAGAACTTCTGGATGAAAGGGC
ACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCTGAAATTGTCCAGTTTCACTCGAAGGTTTGCATGAAAACTTCTTTCATTCTTTCTTTTTTAACAAATGAAC
CATTTGGCACAAAGCCAGTGGAAGAGAAAGTTGCTGAATGGATGGAATACAATCAAAGTACAAGAAAAACGGGAAATGTTGCTGCTAATGACTTATCAAATGGTATTGGT
TTAGCACTCAGAAGAATTGAATTCCACATTTTATCTCTGCAACATTATACAAGTCAAAGTAGGAACACAAGAAGCCATATCAATGGAGCTAAATTATCCAACTCTCCATT
AGATCAGCAGAAAGTTCAGTCAAGAATGGATCACTCAAATTTGAAGGCCAGAGTTGCTGAGCCAATCAATGGCCATTGTTCCGAGTTCGTTCACGGGTTTAGAGTACCTC
TGTCTCAAGACAATGTTGAGGCCATGAAACCTCCAAACGTTGGAACCCAGGTATCTAAACAAAACAAAGTTATAAATCCAGTGATTCTGATAGATAAATCTCGATGTTCA
GTGGGATCCAAGGCTACTGTACGGTCCGTGAATCGAACTCAGATACACGAAAGGCGGTGCCAAAATTTGCCTGGTCATATGATCATGAGGCCAACTTTGCTGAATCATAT
GAAGACTCGAATGCCCACTCAGCAAGAATCAGAATTTACAAACTCAGAATCAGAATCAGTTTCTTCTTCAAGTTGGGCAACTCAGCAGACAAGTGAAACTGAAACCACTG
ATTACCCTTCTTCCTCAAGTCACCAAGAGGATCAACCGGCAACCGGATCGGAGGTGAGTAGCCGGTACAGAAGCAGCAGAATTTCCTCAAAAGCATTTAGAATCAGCCAT
GGGAAAAAGGGGTCGAAGAAAGCGATCGGACGTTTCAAGAGACTGAGAAACAAACTAGGCCTTATCTTCCACCACCATCACCACCACCATCACCATCACCACCACAACAG
CCATAATAACTTCTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATGGCACAGACAAGAAAAGAGTAACAAGTAAAGGACGACACGAAACGCTAAAGAAAACAGCAATCC
GAAGCGTATCTCGGAAGAATCAAGTTGGAAGGTTTCAGGCTCTGGCTGAGGGGCTGAGGAGCCATGTTTGGAAACCGACAGCCATGAAGAAGAAAGAGCTTAGGAAGCCG
AGGTTGGGGAAGAAGGGTGTGAAGAAGTTGCACTGGTGGAGGATGTTTTGTCGCCGCCGTGGAGTGAAGTTGCCAAATAAAGGGCGTGTGAAAATAGGGTATGTAAATAG
AAAACCACAGCATAAGATAGTTTAGTTTGGTGGAACAATTTTCAATCTTCTGCAGCCAATTTGGGCTTTTTATCCTCTCCTATGTTGAGCTGCTATGAGAGTTGAACATT
TTGGTAGACATATGTTGTACGTCTCTAGAGAATGTAAAAGGTTTGTATTTTTAATGTTAGTGTATTCCTATTGATTACAAACAATGTTTCTTAAGCATGAGTTCAAAGCG
CTAAACACTATGAGAAGAGCGGGCCTTTTTTTAGACCCGCTTTCCCAAACATTGATTATAAATTTGATTTAAATGAGTCGACTCCATGTAGGTCAACCACGAACTAGATT
ATGCGCCAAGTGAGATGCAAAAGGGCGAACGATAAAATTAAACATGCCATCCTCAAAGTCAAAAAAATATATACACCCTCAACGTTCATAAGCTAAGCTACCACCAAGGG
TCAAAGAAACTACACAAGCTATCATGGTCTTTTTTTTTTTAATATAAGAAGTTAAGCAACCTATAAATTTTAAATCCATGCTACATAACAGTAAGGAAAATAATTACAAA
TAAAGGGTGAAAGGATACTGCTCTAAATGTTGTGACTCCAATAATCAAGTTCTTCCTCATCTTCTCCCTCTTCTTCGTGTCGAGGATCGTTTGAAAATTCGGGTGGTTTG
GGGAATTGAATTGGATTTGCTCCTTGATTGAATCCACACTGTGCAAACATCCGAACGAGGCTACGGTTGAACTCCACCTGATACGACATAAATCTCCTCCACTCTTGATG
TTGGTGGTTGAGTCTTCGCTCTAGTCGATCTAATCGCTCCTCCATAGATCCTGATTCTTGTGCACGAGGTCTCGACGCTCGAGCTTGCTCATTCGAGCAAGCCTGCAATG
GCTCTATAGCCATTGCGTCTTCAGCAGGGACCCAGCCCTTCAATCCAAGAATAAAACTTTTGTCCATGACACTCCTTGGGTTAAGCAACTCCTCATGAGGTCCCCATACA
ACCCCAGCCTGTCTACATAGAGCAGTGATCAACGAAGGATGGGCTAGACCGCCCGTGGTTGACGCTCGTCTACAATGTCTAATGGACTGTCCGATGAGCCGCCCTACGTC
TATTGATTTACCAGTGACGATAGTATAGACCAAGATCGCCCTTTCCCTAGTAACATCACTGATATGTGTCACTGGCAATAACTTTGCACAGATAAAGCTGTGCCAGACCT
TATTGGTAACGCTCAACTCCGAGGTTTTGAAGCTCAATGCTTCATGTCCTCGGAATTTCCATTGAGTTCCTGGTCTACATAGTTTCAAAATCACTTGTTCTAAGTCCAAT
TCCTCCCTAGCATAGGTGCTATACTTGTCGTGTTCGATGTCGGGCAAATGGTACAACCTATTGATCGTAGTACGATCGAACGGCACCATTTTCCCTCTCACAAACACGCT
TATGTCGTCTTCTTTCATATTCGCATAGAACTCCCTCACCACCGAGACGACTGCAGCCTCGGGTTGTTTAACCAGCTCTTCCCAACCCCTTTGCTCAATGTTCAAATGGA
TGTCCCTTTGGTCCCCACTACATGGTTGTAGCCCCCTTTCTGTAATAACACTACGTTTTGCTACGGAACTGATAAATTTGTCTGAGGATTCTAAGTCTATGAACTTCCTT
CTATCAAAAGATGCCAAAGACGATCCACTAGTCTTGGCACGTTTTGGAGCCATAAACAAATTGATGCACAGATTAAGAGAAGTTTACAGCAGAAAATCTATCCCAAAACT
TGATTGTCTTTTTGGTTTTTTGTTTCTCAAATCAGCACCAAATCAATCGAAGTTCAAGTACATAGTTCACTTCCACAAGAAACAGCAAGTTCCAGCAAAATCAAGCTCCA
AAATAGAGATTTAGAGGAAAAACTTACTTGGAGAGGAATCTAGCGAAAAACCCACGAAAAATCTGGGTAAATTCGCCGGAGTGGAAAGAGGAATCTGCAGATAATAATAG
ATTTAGGGAAGAACTCGTGATTTGAGGTATGGGAAAGAAGAAATTTGAGAGAAGAGAGAAGTTATTGGAGACGGGGAAATTAGGGCAAGAAGTGACTGAAGCTGTTGTGA
GCGTCTTTTTAATTTTTCTGCTATAATAACAGCGCCTAGGCTCGGGATTCCCGCGCGGGGGCGCTATCGCTCTGTGCTTGCGTTTCATTCTGTCTCCCTATGGGAAGAAA
TTATAGAGTTGAA
Protein sequenceShow/hide protein sequence
MEVNELYLDLLALRELYILLLKSCLRDANSELLDERAQILLKHLLDDATAEIVQFHSKVCMKTSFILSFLTNEPFGTKPVEEKVAEWMEYNQSTRKTGNVAANDLSNGIG
LALRRIEFHILSLQHYTSQSRNTRSHINGAKLSNSPLDQQKVQSRMDHSNLKARVAEPINGHCSEFVHGFRVPLSQDNVEAMKPPNVGTQVSKQNKVINPVILIDKSRCS
VGSKATVRSVNRTQIHERRCQNLPGHMIMRPTLLNHMKTRMPTQQESEFTNSESESVSSSSWATQQTSETETTDYPSSSSHQEDQPATGSEVSSRYRSSRISSKAFRISH
GKKGSKKAIGRFKRLRNKLGLIFHHHHHHHHHHHHNSHNNFFMWKQLRKIFHGTDKKRVTSKGRHETLKKTAIRSVSRKNQVGRFQALAEGLRSHVWKPTAMKKKELRKP
RLGKKGVKKLHWWRMFCRRRGVKLPNKGRVKIGYVNRKPQHKIV