; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G007465 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G007465
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF1442)
Genome locationCG_Chr05:7880124..7881073
RNA-Seq ExpressionClCG05G007465
SyntenyClCG05G007465
Gene Ontology termsNA
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067688.1 uncharacterized protein E6C27_scaffold70G00580 [Cucumis melo var. makuwa]7.7e-10789.61Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVLIDCNL+ HVAVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS G TTHLLPIGKG+MVT+V AEVS +GD GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

XP_004148160.1 uncharacterized protein LOC101217454 [Cucumis sativus]2.1e-10488.31Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVL+DCNL  H+AVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS GSTTHLLPIG G+MVTKV AE S +G+ GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

XP_008439110.1 PREDICTED: uncharacterized protein LOC103484000 [Cucumis melo]2.9e-10689.18Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVLIDCNL+ H AVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS G TTHLLPIGKG+MVT+V AEVS +GD GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

XP_022985506.1 uncharacterized protein LOC111483494 [Cucurbita maxima]6.6e-9884.72Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQK NEPDV EFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRV+C+I RQEDL +SQ I+GVESY H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ
        GEAEK+I+T YREVDFVLIDCNL+ HVAVL+ VRSR++NQ ATVVVGFNAMSKR  G  GWSGGSTTHLLPIGKGLMVTKV AEVS SG  GRR RR   
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ

Query:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

XP_038905944.1 uncharacterized protein LOC120091864 [Benincasa hispida]3.1e-10890Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MA+WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYE SADHK+LALAAAA QTGGRVVCII RQED+HVSQAI+GV+S+DHRIEF+V
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRR-DNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRS
        GEAEKLIKTQYRE DFVLIDCNLEG+VAV+EAVRSRR +N+ ATVVVGFNAMSKRCGGG GW GGSTTHLLPIGKGL+VTKVAAE+S SGD GRRMRRRS
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRR-DNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRS

Query:  QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

TrEMBL top hitse value%identityAlignment
A0A0A0L8E9 Uncharacterized protein1.0e-10488.31Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVL+DCNL  H+AVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS GSTTHLLPIG G+MVTKV AE S +G+ GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A1S3AY08 uncharacterized protein LOC1034840001.4e-10689.18Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVLIDCNL+ H AVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS G TTHLLPIGKG+MVT+V AEVS +GD GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A5D3DJF3 Uncharacterized protein3.7e-10789.61Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII RQEDLHVSQAI+G+ S+DH IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR
        GEAEKLIKTQY EVDFVLIDCNL+ HVAVLEAVRSRR N Q AT+VVGFNAMSKRC GGA GWS G TTHLLPIGKG+MVT+V AEVS +GD GRRMRRR
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDN-QSATVVVGFNAMSKRCGGGA-GWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRR

Query:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         QSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1E9C9 uncharacterized protein LOC1114305891.0e-9683.84Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQK NEPDV EFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRVVC+I RQEDL +SQ I+GVESY H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ
        GEAEK+I+T YREVDFVLIDCNL+ HVAVL+ VRSR+++Q ATVVVGFNAMSKR  G  GWSGGSTTHLLPIGKGL+VTKV AEVS SG  GRR R    
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ

Query:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1J8E3 uncharacterized protein LOC1114834943.2e-9884.72Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV
        MASWSAENATEAFLNTLKMGQK NEPDV EFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRV+C+I RQEDL +SQ I+GVESY H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ
        GEAEK+I+T YREVDFVLIDCNL+ HVAVL+ VRSR++NQ ATVVVGFNAMSKR  G  GWSGGSTTHLLPIGKGLMVTKV AEVS SG  GRR RR   
Subjt:  GEAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQ

Query:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)6.3e-1431.08Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAY-ERSADHKILALAAAAGQTGGRVVCII--GRQEDLHVSQAIIGVESYDHRIEFVV
        WS E A++A+++T+K  +    PD  E I+AMAAG N +L+V  + E  A    + L  A+     + +CI+   R E  ++ QAI    S  +  E +V
Subjt:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAY-ERSADHKILALAAAAGQTGGRVVCII--GRQEDLHVSQAIIGVESYDHRIEFVV

Query:  GEAEKLIKTQYREVDFVLIDC-NLEGHVAVLEAVRSRRDNQSATVVV--GFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRR
         E       + + VDF+++D  N E   A L+   +   N+ A VV   G++++ +             T  LP+  G+ +  VAA   NSG  G   RR
Subjt:  GEAEKLIKTQYREVDFVLIDC-NLEGHVAVLEAVRSRRDNQSATVVV--GFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRR

Query:  RSQSQWVVKVDKCTGEEHVFRV
             W+  VD+ +GEEHVF +
Subjt:  RSQSQWVVKVDKCTGEEHVFRV

AT1G62840.1 Protein of unknown function (DUF1442)1.3e-1126.67Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKI-LALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV--
        WS E A++A+++T+K  +    P   E ++AMAAG NA L+V  +       I + L  A+  T GR +CI+            +  +S  +  E ++  
Subjt:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKI-LALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVV--

Query:  GEAEKLIKTQ--YREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVV---GFNAMSKRCGGGAGWSGGST--THLLPIGKGLMVTKVAAEVSNSGDYGR
         E E+L  T    + +DF+++D + +   A    +R+        VVV   G+   +        +S  +   T  LP+  GL +  VAA  S+    G+
Subjt:  GEAEKLIKTQ--YREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVV---GFNAMSKRCGGGAGWSGGST--THLLPIGKGLMVTKVAAEVSNSGDYGR

Query:  RMRRRSQSQWVVKVDKCTGEEHVFR
             ++ +W+   D+ +GEEHV R
Subjt:  RMRRRSQSQWVVKVDKCTGEEHVFR

AT2G45360.1 Protein of unknown function (DUF1442)7.2e-1831.84Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVVGE
        WS E A++A+++T+K  +   E  V EF+SA AAG NA+L+V  + R       + LA AA  TGGR VCI+   ++    + ++ +  +      VVGE
Subjt:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVVGE

Query:  AEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGW-------SGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRM
        + +    ++  VDF+++D      V  L   R  + +    V+V  NAM  R   G  W       +    +  LP+G GL +  V A        GR  
Subjt:  AEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGW-------SGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRM

Query:  RRRSQSQWVVKVDKCTGEEHVFR
         R  +S+W+  VD  +GEEH+FR
Subjt:  RRRSQSQWVVKVDKCTGEEHVFR

AT3G60780.1 Protein of unknown function (DUF1442)1.1e-1529.91Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIGRQEDLHVSQAII-GVESYDHRIEFVVG
        WS E A+ A+++T++  +   +  V EF+SA AAG N +L+V  + R       + LA AA  T GR VCI+  +E     +A++ G  + D     V+ 
Subjt:  WSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIGRQEDLHVSQAII-GVESYDHRIEFVVG

Query:  EAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSG----GS---TTHLLPIGKGLMVTKVAAEVSNSGDYGRR
         AE +++ +   VDF+++D      V  L   ++   ++   V+V  NA  K    G  W G    G+    +  LP+G+GL +  V A    +G     
Subjt:  EAEKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSG----GS---TTHLLPIGKGLMVTKVAAEVSNSGDYGRR

Query:  MRRRSQSQWVVKVDKCTGEEHVFR
          R+  S+W+  +D  +GEEH+F+
Subjt:  MRRRSQSQWVVKVDKCTGEEHVFR

AT5G62280.1 Protein of unknown function (DUF1442)1.0e-4847.68Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKIL-ALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFV
        MA WSAENAT+A+L+TLK  Q+  EP+V EFISA+AAGN+A+ + VA   +A+  IL AL AAA QT G+VVC++   E+L +SQ ++   S  H+I+FV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKIL-ALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFV

Query:  VGEA--EKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSAT-------VVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGD
        VGE+  + LI   + E DFVL+DCNLE H  ++  + +  +  + T       VVVG+NA S+   G   +S G  T  LPIG+GL+VT+V         
Subjt:  VGEA--EKLIKTQYREVDFVLIDCNLEGHVAVLEAVRSRRDNQSAT-------VVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGD

Query:  YGRRMRRRSQSQWVVKVDKCTGEEHVFRVRLPQGKVI
           R +   +S+WVVKVDKCTGEEHVFRVR+P+G+ I
Subjt:  YGRRMRRRSQSQWVVKVDKCTGEEHVFRVRLPQGKVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACTGAAGCCTTCCTCAACACCCTCAAAATGGGCCAAAAAGCGAACGAACCCGATGTAGGGGAGTTCATTTCAGCCATGGCAGC
CGGGAACAACGCGCAGCTAATGGTGGTGGCATACGAAAGGTCTGCTGACCACAAGATCCTAGCACTGGCGGCGGCTGCCGGGCAAACCGGCGGTCGGGTTGTCTGCATAA
TTGGACGGCAAGAAGATCTTCATGTATCACAAGCAATTATCGGGGTGGAATCATATGATCACCGAATAGAGTTCGTGGTGGGAGAAGCCGAGAAGCTTATAAAAACTCAA
TACAGAGAAGTAGATTTTGTTCTGATAGACTGCAATCTGGAGGGCCACGTGGCGGTGCTGGAGGCCGTTAGATCGAGAAGGGACAACCAAAGCGCCACCGTGGTGGTGGG
TTTTAACGCGATGAGCAAAAGATGTGGAGGAGGAGCAGGATGGTCGGGGGGATCGACCACGCATCTTTTGCCAATCGGAAAAGGGTTGATGGTGACGAAAGTGGCGGCGG
AGGTGTCAAATTCCGGCGATTATGGGAGGAGGATGAGGAGGAGGAGCCAAAGCCAGTGGGTTGTGAAGGTGGATAAATGCACCGGAGAAGAACATGTTTTTAGGGTTAGG
CTTCCACAGGGGAAAGTCATTCAAGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACTGAAGCCTTCCTCAACACCCTCAAAATGGGCCAAAAAGCGAACGAACCCGATGTAGGGGAGTTCATTTCAGCCATGGCAGC
CGGGAACAACGCGCAGCTAATGGTGGTGGCATACGAAAGGTCTGCTGACCACAAGATCCTAGCACTGGCGGCGGCTGCCGGGCAAACCGGCGGTCGGGTTGTCTGCATAA
TTGGACGGCAAGAAGATCTTCATGTATCACAAGCAATTATCGGGGTGGAATCATATGATCACCGAATAGAGTTCGTGGTGGGAGAAGCCGAGAAGCTTATAAAAACTCAA
TACAGAGAAGTAGATTTTGTTCTGATAGACTGCAATCTGGAGGGCCACGTGGCGGTGCTGGAGGCCGTTAGATCGAGAAGGGACAACCAAAGCGCCACCGTGGTGGTGGG
TTTTAACGCGATGAGCAAAAGATGTGGAGGAGGAGCAGGATGGTCGGGGGGATCGACCACGCATCTTTTGCCAATCGGAAAAGGGTTGATGGTGACGAAAGTGGCGGCGG
AGGTGTCAAATTCCGGCGATTATGGGAGGAGGATGAGGAGGAGGAGCCAAAGCCAGTGGGTTGTGAAGGTGGATAAATGCACCGGAGAAGAACATGTTTTTAGGGTTAGG
CTTCCACAGGGGAAAGTCATTCAAGCCTGA
Protein sequenceShow/hide protein sequence
MASWSAENATEAFLNTLKMGQKANEPDVGEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIGRQEDLHVSQAIIGVESYDHRIEFVVGEAEKLIKTQ
YREVDFVLIDCNLEGHVAVLEAVRSRRDNQSATVVVGFNAMSKRCGGGAGWSGGSTTHLLPIGKGLMVTKVAAEVSNSGDYGRRMRRRSQSQWVVKVDKCTGEEHVFRVR
LPQGKVIQA