; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019929 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019929
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1442)
Genome locationchr5:46780626..46781422
RNA-Seq ExpressionLag0019929
SyntenyLag0019929
Gene Ontology termsNA
InterPro domainsIPR009902 - Protein of unknown function DUF1442


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439110.1 PREDICTED: uncharacterized protein LOC103484000 [Cucumis melo]1.1e-9480.95Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDV EFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII ++EDLH+SQAILG+ SH H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG
        GEAEKL+++ Y E DFVLIDCNLD H AVLEAVRSRR ND+ AT+VVGFNAMSKR    + GWS G  THLLPIGKG+MVT+V AE+ K+G DG      
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        R+SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

XP_022922645.1 uncharacterized protein LOC111430589 [Cucurbita moschata]5.3e-10084.96Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG
        MASWSAENATEAFLNTLKMGQK NEPDVAEFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRVVC+IP++EDL LSQ ILG+ES+H IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG

Query:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW
        EAEK++R+HYRE DFVLIDCNLD HVAVL+ VRSR+N +RATVVVGFNAMSKRS  GWSGGS THLLPIGKGL+VTKV AE+ KSGGDG      RRSQW
Subjt:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW

Query:  VVKVDKCTGEEHVFRVRLPQGKVIQA
        VVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VVKVDKCTGEEHVFRVRLPQGKVIQA

XP_022985506.1 uncharacterized protein LOC111483494 [Cucurbita maxima]2.4e-10084.96Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG
        MASWSAENATEAFLNTLKMGQK NEPDVAEFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRV+C+IP++EDL LSQ ILG+ES+H IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG

Query:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW
        EAEK++R+HYRE DFVLIDCNLD HVAVL+ VRSR+N++RATVVVGFNAMSKRS  GWSGGS THLLPIGKGLMVTKV AE+ KSG DG  R   RRSQW
Subjt:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW

Query:  VVKVDKCTGEEHVFRVRLPQGKVIQA
        VVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VVKVDKCTGEEHVFRVRLPQGKVIQA

XP_023522325.1 uncharacterized protein LOC111786241 [Cucurbita pepo subsp. pepo]1.1e-10085.4Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG
        MASWSAENATEAFLNTLKMGQK NEPDVAEFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRVVC+IP++EDL LSQ ILG+ES+H IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG

Query:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW
        EAE+++R+HYRE DFVLIDCNLD HVAVL+ VRSR+N +RATVVVGFNAMSKRS  GWSGGS THLLPIGKGLMVTKV AE+ KSGGDG  R   RRSQW
Subjt:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW

Query:  VVKVDKCTGEEHVFRVRLPQGKVIQA
        VVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VVKVDKCTGEEHVFRVRLPQGKVIQA

XP_038905944.1 uncharacterized protein LOC120091864 [Benincasa hispida]1.1e-9481.22Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV
        MA+WSAENATEAFLNTLKMGQKANEPDV EFISAMAAGNNAQLMVVAYE SADHK+LALAAAA QTGGRVVCII ++ED+H+SQAI+G++SH HRIEF+V
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR--SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRR
        GEAEKL+++ YREADFVLIDCNL+G+VAV+EAVRSRR N+  ATVVVGFNAMSKR    GW GGS THLLPIGKGL+VTKVAAEL KSG DG       +
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR--SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRR

Query:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  SQWVVKVDKCTGEEHVFRVRLPQGKVIQA

TrEMBL top hitse value%identityAlignment
A0A0A0L8E9 Uncharacterized protein2.3e-9380.52Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDV EFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII ++EDLH+SQAILG+ SH H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG
        GEAEKL+++ Y E DFVL+DCNL  H+AVLEAVRSRR ND+ AT+VVGFNAMSKR    + GWS GS THLLPIG G+MVTKV AE  K+G DG      
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        R+SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A1S3AY08 uncharacterized protein LOC1034840005.5e-9580.95Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDV EFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII ++EDLH+SQAILG+ SH H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG
        GEAEKL+++ Y E DFVLIDCNLD H AVLEAVRSRR ND+ AT+VVGFNAMSKR    + GWS G  THLLPIGKG+MVT+V AE+ K+G DG      
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
        R+SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A5D3DJF3 Uncharacterized protein1.2e-9480.95Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV
        MASWSAENATEAFLNTLKMGQKANEPDV EFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCII ++EDLH+SQAILG+ SH H IEFVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESH-HRIEFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG
        GEAEKL+++ Y E DFVLIDCNLD HVAVLEAVRSRR ND+ AT+VVGFNAMSKR    + GWS G  THLLPIGKG+MVT+V AE+ K+G DG      
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRR-NDRRATVVVGFNAMSKR----SAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSG

Query:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA
         +SQWVVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  RRSQWVVKVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1E9C9 uncharacterized protein LOC1114305892.6e-10084.96Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG
        MASWSAENATEAFLNTLKMGQK NEPDVAEFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRVVC+IP++EDL LSQ ILG+ES+H IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG

Query:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW
        EAEK++R+HYRE DFVLIDCNLD HVAVL+ VRSR+N +RATVVVGFNAMSKRS  GWSGGS THLLPIGKGL+VTKV AE+ KSGGDG      RRSQW
Subjt:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW

Query:  VVKVDKCTGEEHVFRVRLPQGKVIQA
        VVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VVKVDKCTGEEHVFRVRLPQGKVIQA

A0A6J1J8E3 uncharacterized protein LOC1114834941.1e-10084.96Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG
        MASWSAENATEAFLNTLKMGQK NEPDVAEFISAMAAGNNAQLMVVAYE SADHKILALAAAA QTGGRV+C+IP++EDL LSQ ILG+ES+H IEFVVG
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVG

Query:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW
        EAEK++R+HYRE DFVLIDCNLD HVAVL+ VRSR+N++RATVVVGFNAMSKRS  GWSGGS THLLPIGKGLMVTKV AE+ KSG DG  R   RRSQW
Subjt:  EAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRS-AGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQW

Query:  VVKVDKCTGEEHVFRVRLPQGKVIQA
        VVKVDKCTGEEHVFRVRLPQGKVIQA
Subjt:  VVKVDKCTGEEHVFRVRLPQGKVIQA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12320.1 Protein of unknown function (DUF1442)3.6e-1430.73Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAY-ERSADHKILALAAAAGQTGGRVVCII--PQEEDLHLSQAILGLESHHRI-EFVV
        WS E A++A+++T+K  +    PD AE I+AMAAG N +L+V  + E  A    + L  A+     + +CI+   + E  +L QAI    S     E +V
Subjt:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAY-ERSADHKILALAAAAGQTGGRVVCII--PQEEDLHLSQAILGLESHHRI-EFVV

Query:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMS--KRSAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRS
         E         +  DF+++D       A   A+++     R  VVV  N  S  +R        RT  LP+  G+ +  VAA    SG     +S   + 
Subjt:  GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMS--KRSAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRS

Query:  QWVVKVDKCTGEEHVFRV
        +W+  VD+ +GEEHVF +
Subjt:  QWVVKVDKCTGEEHVFRV

AT1G62840.1 Protein of unknown function (DUF1442)3.1e-1328.44Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKI-LALAAAAGQTGGRVVCIIPQ-EEDLHLSQAILGLESHHRIEFVV--
        WS E A++A+++T+K  +    P  AE ++AMAAG NA L+V  +       I + L  A+  T GR +CI+P         QA+      +  E ++  
Subjt:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKI-LALAAAAGQTGGRVVCIIPQ-EEDLHLSQAILGLESHHRIEFVV--

Query:  --GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAGWSGGS--------RTHLLPIGKGLMVTKVAAELWKSGGDGT
          GE  +      +  DF+++D +     A    +R+     R  VVV  +   + ++ +S           RT  LP+  GL +  VAA   +S G   
Subjt:  --GEAEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAGWSGGS--------RTHLLPIGKGLMVTKVAAELWKSGGDGT

Query:  MRSSGRRSQWVVKVDKCTGEEHVFR
          S+ R+  W+   D+ +GEEHV R
Subjt:  MRSSGRRSQWVVKVDKCTGEEHVFR

AT2G45360.1 Protein of unknown function (DUF1442)1.8e-2133.78Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIPQEED-LHLSQAILGLESHHRIEFVVGE
        WS E A++A+++T+K  +   E  VAEF+SA AAG NA+L+V  + R       + LA AA  TGGR VCI+P E+  L    A+ G  +   +  VVGE
Subjt:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIPQEED-LHLSQAILGLESHHRIEFVVGE

Query:  AEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAGWSGGS---------RTHLLPIGKGLMVTKVAAELWKSGGDGTMRS
        + +     +   DF+++D      V  L   R  +   +  V+V  NAM +  +G+             R+  LP+G GL +  V       G  G   S
Subjt:  AEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAGWSGGS---------RTHLLPIGKGLMVTKVAAELWKSGGDGTMRS

Query:  SGRRSQWVVKVDKCTGEEHVFR
           RS+W+  VD  +GEEH+FR
Subjt:  SGRRSQWVVKVDKCTGEEHVFR

AT3G60780.1 Protein of unknown function (DUF1442)5.4e-1831.53Show/hide
Query:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIPQEEDLHLSQAIL-GLESHHRIEFVVGE
        WS E A+ A+++T++  +   +  VAEF+SA AAG N +L+V  + R       + LA AA  T GR VCI+P EE     +A++ G  +    E +V +
Subjt:  WSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSAD-HKILALAAAAGQTGGRVVCIIPQEEDLHLSQAIL-GLESHHRIEFVVGE

Query:  AEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAG--WSG----GS---RTHLLPIGKGLMVTKVAAELWKSGGDGTMRS
        + + +       DF+++D      V    A+   +  +   V+V  NA  K   G  W G    G+   R+  LP+G+GL +  V A    SGG   +R 
Subjt:  AEKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAG--WSG----GS---RTHLLPIGKGLMVTKVAAELWKSGGDGTMRS

Query:  SGRRSQWVVKVDKCTGEEHVFR
            S+W+  +D  +GEEH+F+
Subjt:  SGRRSQWVVKVDKCTGEEHVFR

AT5G62280.1 Protein of unknown function (DUF1442)8.8e-5348.93Show/hide
Query:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKIL-ALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVV
        MA WSAENAT+A+L+TLK  Q+  EP+VAEFISA+AAGN+A+ + VA   +A+  IL AL AAA QT G+VVC++   E+L +SQ +L     H+I+FVV
Subjt:  MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKIL-ALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVV

Query:  GEA--EKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRAT-------VVVGFNAMSKRSAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTM
        GE+  + L+ +H+ EADFVL+DCNL+ H  ++  + +   +   T       VVVG+NA S+ S  +S G +T  LPIG+GL+VT+V         +   
Subjt:  GEA--EKLLRSHYREADFVLIDCNLDGHVAVLEAVRSRRNDRRAT-------VVVGFNAMSKRSAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTM

Query:  RSSGRRSQWVVKVDKCTGEEHVFRVRLPQGKVI
            R+S+WVVKVDKCTGEEHVFRVR+P+G+ I
Subjt:  RSSGRRSQWVVKVDKCTGEEHVFRVRLPQGKVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACTGAAGCCTTCCTCAACACCCTCAAAATGGGCCAAAAAGCGAACGAACCCGACGTAGCGGAGTTCATATCAGCCATGGCAGC
CGGAAACAACGCACAGCTAATGGTGGTGGCATACGAGAGATCCGCCGACCACAAGATTCTAGCCCTCGCCGCCGCGGCCGGCCAAACCGGCGGCCGAGTCGTCTGCATAA
TTCCACAGGAAGAAGATCTTCATCTTTCACAAGCAATTCTCGGACTGGAATCACATCATAGAATCGAGTTCGTCGTCGGAGAAGCCGAGAAGCTTCTCAGAAGCCATTAC
AGAGAAGCGGATTTTGTTCTGATCGACTGCAATCTGGACGGCCACGTGGCGGTGCTGGAGGCCGTCAGATCGAGAAGGAACGACCGACGCGCCACCGTGGTGGTGGGGTT
TAACGCGATGAGCAAAAGATCTGCAGGGTGGTCCGGCGGATCGAGAACGCATCTTTTGCCGATCGGAAAAGGGCTGATGGTGACGAAAGTGGCGGCGGAGTTGTGGAAAT
CCGGCGGCGATGGGACGATGAGATCATCTGGGAGGAGGAGCCAGTGGGTTGTGAAGGTGGATAAATGCACTGGAGAAGAACATGTTTTTAGGGTTAGGCTTCCACAGGGA
AAAGTCATTCAAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCTGGTCTGCTGAGAATGCCACTGAAGCCTTCCTCAACACCCTCAAAATGGGCCAAAAAGCGAACGAACCCGACGTAGCGGAGTTCATATCAGCCATGGCAGC
CGGAAACAACGCACAGCTAATGGTGGTGGCATACGAGAGATCCGCCGACCACAAGATTCTAGCCCTCGCCGCCGCGGCCGGCCAAACCGGCGGCCGAGTCGTCTGCATAA
TTCCACAGGAAGAAGATCTTCATCTTTCACAAGCAATTCTCGGACTGGAATCACATCATAGAATCGAGTTCGTCGTCGGAGAAGCCGAGAAGCTTCTCAGAAGCCATTAC
AGAGAAGCGGATTTTGTTCTGATCGACTGCAATCTGGACGGCCACGTGGCGGTGCTGGAGGCCGTCAGATCGAGAAGGAACGACCGACGCGCCACCGTGGTGGTGGGGTT
TAACGCGATGAGCAAAAGATCTGCAGGGTGGTCCGGCGGATCGAGAACGCATCTTTTGCCGATCGGAAAAGGGCTGATGGTGACGAAAGTGGCGGCGGAGTTGTGGAAAT
CCGGCGGCGATGGGACGATGAGATCATCTGGGAGGAGGAGCCAGTGGGTTGTGAAGGTGGATAAATGCACTGGAGAAGAACATGTTTTTAGGGTTAGGCTTCCACAGGGA
AAAGTCATTCAAGCTTGA
Protein sequenceShow/hide protein sequence
MASWSAENATEAFLNTLKMGQKANEPDVAEFISAMAAGNNAQLMVVAYERSADHKILALAAAAGQTGGRVVCIIPQEEDLHLSQAILGLESHHRIEFVVGEAEKLLRSHY
READFVLIDCNLDGHVAVLEAVRSRRNDRRATVVVGFNAMSKRSAGWSGGSRTHLLPIGKGLMVTKVAAELWKSGGDGTMRSSGRRSQWVVKVDKCTGEEHVFRVRLPQG
KVIQA