; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030040 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030040
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1442)
Genome locationtig00153554:2432331..2433105
RNA-Seq ExpressionSgr030040
SyntenySgr030040
Gene Ontology termsNA
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576788.1 40S ribosomal protein S20-2, partial [Cucurbita argyrosperma subsp. sororia]8.0e-8577.48Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G VVC++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV
        EAEK+++THYRE DFVLIDCNLD H A+L  VRSR N QR TVVVGFNA+S RS  GWSGGS THLLPIGKGLLVTKV A+ SK+G G  R  RRSQWVV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV

Query:  KVDKCTGEEHVFRVRLPQGKVI
        KVDKCTGEEHVFR  L +  ++
Subjt:  KVDKCTGEEHVFRVRLPQGKVI

XP_022922645.1 uncharacterized protein LOC111430589 [Cucurbita moschata]2.2e-9080.8Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G VVC++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV
        EAEK+++THYRE DFVLIDCNLD H A+L  VRSR+N QR TVVVGFNA+S RS  GWSGGS THLLPIGKGLLVTKV A+ SK+G    R  RRSQWVV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV

Query:  KVDKCTGEEHVFRVRLPQGKVIQA
        KVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  KVDKCTGEEHVFRVRLPQGKVIQA

XP_022985506.1 uncharacterized protein LOC111483494 [Cucurbita maxima]2.2e-9080.44Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G V+C++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DGTIRSGRRSQWV
        EAEK+++THYRE DFVLIDCNLD H A+L  VRSR+ NQR TVVVGFNA+S RS  GWSGGS THLLPIGKGL+VTKV A+ SK+G DG  R  RRSQWV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DGTIRSGRRSQWV

Query:  VKVDKCTGEEHVFRVRLPQGKVIQA
        VKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VKVDKCTGEEHVFRVRLPQGKVIQA

XP_023522325.1 uncharacterized protein LOC111786241 [Cucurbita pepo subsp. pepo]2.2e-9080.36Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G VVC++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV
        EAE++++THYRE DFVLIDCNLD H A+L  VRSR+N QR TVVVGFNA+S RS  GWSGGS THLLPIGKGL+VTKV A+ SK+G G  R  RRSQWVV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV

Query:  KVDKCTGEEHVFRVRLPQGKVIQA
        KVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  KVDKCTGEEHVFRVRLPQGKVIQA

XP_038905944.1 uncharacterized protein LOC120091864 [Benincasa hispida]6.1e-8575.55Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV
        MA+WSAENATEAFLNTLKMGQKA EPDV EFISAMAAGNNA+LMVVAYE SADHK+LALAAAA QT G VVCI+   E++ +SQA++GV+S  +RIEF+V
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV

Query:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR---SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR--
        GEAEKL+KT YREADFVLIDCNL+G+ A++ AVRSRR  N+  TVVVGFNA+S R    GW GGS THLLPIGKGL+VTKVAA+ SK+GD   R  RR  
Subjt:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR---SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR--

Query:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

TrEMBL top hitse value%identityAlignment
A0A0A0L8E9 Uncharacterized protein2.6e-8174.03Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV
        MASWSAENATEAFLNTLKMGQKA EPDV EFISAMAAGNNA+LMVVAYERSADHKILALAAAA QT G VVCI+   E+L +SQA+LG+ S  + IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV

Query:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DG-TIRSG
        GEAEKL+KT Y E DFVL+DCNL  H A+L AVRSRR  +Q  T+VVGFNA+S R     +GWS GS THLLPIG G++VTKV A+ SK G DG  +R  
Subjt:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DG-TIRSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        R+SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A1S3AY08 uncharacterized protein LOC1034840008.1e-8374.03Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV
        MASWSAENATEAFLNTLKMGQKA EPDV EFISAMAAGNNA+LMVVAYERSADHKILALAAAA QT G VVCI+   E+L +SQA+LG+ S  + IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV

Query:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDG--TIRSG
        GEAEKL+KT Y E DFVLIDCNLD H A+L AVRSRR  +Q  T+VVGFNA+S R     +GWS G  THLLPIGKG++VT+V A+ SK GD    +R  
Subjt:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDG--TIRSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        R+SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A5D3DJF3 Uncharacterized protein1.4e-8274.46Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV
        MASWSAENATEAFLNTLKMGQKA EPDV EFISAMAAGNNA+LMVVAYERSADHKILALAAAA QT G VVCI+   E+L +SQA+LG+ S  + IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVES--YRIEFVV

Query:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR
        GEAEKL+KT Y E DFVLIDCNLD H A+L AVRSRR  +Q  T+VVGFNA+S R     +GWS G  THLLPIGKG++VT+V A+ SK GD   R  RR
Subjt:  GEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR--NQRGTVVVGFNALSNR-----SGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR

Query:  --SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
          SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  --SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1E9C9 uncharacterized protein LOC1114305891.1e-9080.8Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G VVC++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV
        EAEK+++THYRE DFVLIDCNLD H A+L  VRSR+N QR TVVVGFNA+S RS  GWSGGS THLLPIGKGLLVTKV A+ SK+G    R  RRSQWVV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRN-QRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVV

Query:  KVDKCTGEEHVFRVRLPQGKVIQA
        KVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  KVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1J8E3 uncharacterized protein LOC1114834941.1e-9080.44Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG
        MASWSAENATEAFLNTLKMGQK  EPDVAEFISAMAAGNNA+LMVVAYE SADHKILALAAAA QT G V+C++P  E+LRLSQ +LGVESY  IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYR-IEFVVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DGTIRSGRRSQWV
        EAEK+++THYRE DFVLIDCNLD H A+L  VRSR+ NQR TVVVGFNA+S RS  GWSGGS THLLPIGKGL+VTKV A+ SK+G DG  R  RRSQWV
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVVGFNALSNRS--GWSGGSRTHLLPIGKGLLVTKVAADSSKNG-DGTIRSGRRSQWV

Query:  VKVDKCTGEEHVFRVRLPQGKVIQA
        VKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VKVDKCTGEEHVFRVRLPQGKVIQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)9.3e-1529.63Show/hide
Query:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAY-ERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYRIEF---VVG
        WS E A++A+++T+K  +  + PD AE I+AMAAG N KL+V  + E  A    + L  A+   +   +CIV            +   S  + F   +V 
Subjt:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAY-ERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYRIEF---VVG

Query:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVV---GFNALSNRSGWSGGSRTHLLPIGKGLLVTKVAA-DSSKNGDGTIRSGRRSQW
        E         +  DF+++D      E    A+++     RG VVV   G+++L          RT  LP+  G+ +  VAA +S K+G+       + +W
Subjt:  EAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRR-NQRGTVVV---GFNALSNRSGWSGGSRTHLLPIGKGLLVTKVAA-DSSKNGDGTIRSGRRSQW

Query:  VVKVDKCTGEEHVFRV
        +  VD+ +GEEHVF +
Subjt:  VVKVDKCTGEEHVFRV

AT1G62840.1 Protein of unknown function (DUF1442)5.1e-1329.15Show/hide
Query:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKI-LALAAAAHQTSGCVVCIVPGPE------ELRLSQAVLGVESYRIEF
        WS E A++A+++T+K  +    P  AE ++AMAAG NA L+V  +       I + L  A+  T+G  +CIVP         +    Q+   +    I  
Subjt:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKI-LALAAAAHQTSGCVVCIVPGPE------ELRLSQAVLGVESYRIEF

Query:  VVGEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSR-RNQRGTVVV---GFNALSNRSGWSGG------SRTHLLPIGKGLLVTKVAADSSKNGDGTI
          GE  +      +  DF+++D   D  +     +R+     RG VVV   G+   ++   W+         RT  LP+  GL +  VAA  S +G    
Subjt:  VVGEAEKLLKTHYREADFVLIDCNLDGHEAILGAVRSR-RNQRGTVVV---GFNALSNRSGWSGG------SRTHLLPIGKGLLVTKVAADSSKNGDGTI

Query:  RSGRRSQWVVKVDKCTGEEHVFR
         S +R +W+   D+ +GEEHV R
Subjt:  RSGRRSQWVVKVDKCTGEEHVFR

AT2G45360.1 Protein of unknown function (DUF1442)8.1e-1933.94Show/hide
Query:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSAD-HKILALAAAAHQTSGCVVCIVPGPE-ELRLSQAVLGVESYRIEFVVGEA
        WS E A++A+++T+K  +  KE  VAEF+SA AAG NA+L+V  + R       + LA AA  T G  VCIVP  + +L    A+ G  +  +  VVGE+
Subjt:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSAD-HKILALAAAAHQTSGCVVCIVPGPE-ELRLSQAVLGVESYRIEFVVGEA

Query:  EKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRGTVVVGFNALSNR-SGWSGGS---------RTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR
         +     +   DF+++D      E +     ++ + +G V+V  NA+    SG+             R+  LP+G GL +  V A     G G  R+  R
Subjt:  EKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRGTVVVGFNALSNR-SGWSGGS---------RTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRR

Query:  SQWVVKVDKCTGEEHVFR
        S+W+  VD  +GEEH+FR
Subjt:  SQWVVKVDKCTGEEHVFR

AT3G60780.1 Protein of unknown function (DUF1442)5.3e-1832.42Show/hide
Query:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSAD-HKILALAAAAHQTSGCVVCIVPGPEELRLSQAVL--GVESYRIEFVVGE
        WS E A+ A+++T++  +  ++  VAEF+SA AAG N +L+V  + R       + LA AA  T G  VCIVP  E     +AV+   V S   E +V +
Subjt:  WSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSAD-HKILALAAAAHQTSGCVVCIVPGPEELRLSQAVL--GVESYRIEFVVGE

Query:  AEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRGTVVVGFNA-LSNRSG--WSG----GS---RTHLLPIGKGLLVTKVAADSSKNGDGTIRSGR
        + + +       DF+++D     HE +     ++ ++ G V+V  NA L +  G  W G    G+   R+  LP+G+GL +  V A    NG   I    
Subjt:  AEKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRGTVVVGFNA-LSNRSG--WSG----GS---RTHLLPIGKGLLVTKVAADSSKNGDGTIRSGR

Query:  RSQWVVKVDKCTGEEHVFR
         S+W+  +D  +GEEH+F+
Subjt:  RSQWVVKVDKCTGEEHVFR

AT5G62280.1 Protein of unknown function (DUF1442)1.3e-5351.71Show/hide
Query:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKIL-ALAAAAHQTSGCVVCIVPGPEELRLSQAVL-GVESYRIEFVV
        MA WSAENAT+A+L+TLK  Q+ KEP+VAEFISA+AAGN+A+ + VA   +A+  IL AL AAA+QT G VVC++ G EEL +SQ +L   E ++I+FVV
Subjt:  MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKIL-ALAAAAHQTSGCVVCIVPGPEELRLSQAVL-GVESYRIEFVV

Query:  GEA--EKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRG--------TVVVGFNALSNRSGW--SGGSRTHLLPIGKGLLVTKVAADSSKNGDGTI
        GE+  + L+  H+ EADFVL+DCNL+ H+ I+G + +   +           VVVG+NA S R  W  S G +T  LPIG+GLLVT+V  +         
Subjt:  GEA--EKLLKTHYREADFVLIDCNLDGHEAILGAVRSRRNQRG--------TVVVGFNALSNRSGW--SGGSRTHLLPIGKGLLVTKVAADSSKNGDGTI

Query:  RSG--RRSQWVVKVDKCTGEEHVFRVRLPQGKVI
        R    R+S+WVVKVDKCTGEEHVFRVR+P+G+ I
Subjt:  RSG--RRSQWVVKVDKCTGEEHVFRVRLPQGKVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACAGAAGCCTTTCTCAACACCCTCAAAATGGGCCAGAAAGCGAAAGAGCCCGACGTTGCAGAGTTCATTTCAGCCATGGCTGC
CGGAAACAACGCGAAGCTAATGGTTGTAGCCTACGAAAGATCTGCAGACCACAAGATTCTAGCGCTGGCCGCCGCCGCCCACCAGACTAGCGGCTGCGTCGTCTGCATCG
TTCCAGGGCCCGAAGAGCTTCGTCTTTCGCAGGCGGTTCTCGGAGTTGAATCATATCGCATCGAGTTCGTCGTCGGGGAAGCCGAAAAGCTTCTGAAAACCCATTACAGA
GAAGCTGATTTCGTGCTGATCGACTGCAATCTCGACGGCCATGAGGCGATCCTCGGAGCTGTGAGATCGAGAAGGAACCAACGCGGCACCGTGGTAGTGGGTTTTAATGC
ACTGAGCAACAGATCTGGGTGGTCCGGCGGGTCGAGAACTCATCTTCTGCCGATCGGAAAAGGGTTGCTGGTGACGAAAGTGGCGGCGGATTCTTCAAAAAATGGCGACG
GAACGATCAGATCTGGAAGGAGGAGCCAGTGGGTTGTGAAAGTTGATAAATGCACTGGGGAAGAACATGTTTTTAGGGTTAGACTTCCACAGGGGAAAGTGATTCAAGCT
TGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACAGAAGCCTTTCTCAACACCCTCAAAATGGGCCAGAAAGCGAAAGAGCCCGACGTTGCAGAGTTCATTTCAGCCATGGCTGC
CGGAAACAACGCGAAGCTAATGGTTGTAGCCTACGAAAGATCTGCAGACCACAAGATTCTAGCGCTGGCCGCCGCCGCCCACCAGACTAGCGGCTGCGTCGTCTGCATCG
TTCCAGGGCCCGAAGAGCTTCGTCTTTCGCAGGCGGTTCTCGGAGTTGAATCATATCGCATCGAGTTCGTCGTCGGGGAAGCCGAAAAGCTTCTGAAAACCCATTACAGA
GAAGCTGATTTCGTGCTGATCGACTGCAATCTCGACGGCCATGAGGCGATCCTCGGAGCTGTGAGATCGAGAAGGAACCAACGCGGCACCGTGGTAGTGGGTTTTAATGC
ACTGAGCAACAGATCTGGGTGGTCCGGCGGGTCGAGAACTCATCTTCTGCCGATCGGAAAAGGGTTGCTGGTGACGAAAGTGGCGGCGGATTCTTCAAAAAATGGCGACG
GAACGATCAGATCTGGAAGGAGGAGCCAGTGGGTTGTGAAAGTTGATAAATGCACTGGGGAAGAACATGTTTTTAGGGTTAGACTTCCACAGGGGAAAGTGATTCAAGCT
TGA
Protein sequenceShow/hide protein sequence
MASWSAENATEAFLNTLKMGQKAKEPDVAEFISAMAAGNNAKLMVVAYERSADHKILALAAAAHQTSGCVVCIVPGPEELRLSQAVLGVESYRIEFVVGEAEKLLKTHYR
EADFVLIDCNLDGHEAILGAVRSRRNQRGTVVVGFNALSNRSGWSGGSRTHLLPIGKGLLVTKVAADSSKNGDGTIRSGRRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA