; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021247 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021247
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:5874643..5876452
RNA-Seq ExpressionLag0021247
SyntenyLag0021247
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]9.0e-5445.19Show/hide
Query:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG
        P  P+S P P   +P   PQ    +++ +PN       P++ QPLAVKL+D+N+++WK QLLN V+ANGL+ FLDGS   PP+FLD QQQQ+N EF +W 
Subjt:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG

Query:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS
        RYNR +M W+Y+S++E  +G+IV   +A +IW +L+R Y + + A +  L+T L  IKK+G +   Y+ + + + +  ++IGEP++Y DHL + L GLG 
Subjt:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS

Query:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVE
        +YN FVT+IQ+++  P++E+V SLLL+Y+ARLE+Q+  +
Subjt:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVE

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]8.4e-5238.76Show/hide
Query:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG
        P  P+S P P   +P+  PQ        I N+    + P++ QPLAVKL+D+N+++WK QLLN V+ANGL+ FLDGS   PP+FLD QQQQ+N EF +W 
Subjt:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG

Query:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS
        RYNR +M W+Y+S++E  +G+IV   +A +IW +L+R Y + + A +  L+T L  IKK+G +   Y+ + + + +  ++IGEP++Y DHL + L GLG 
Subjt:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS

Query:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPF-NPSL--FPSHYSPSQQVAPSLLGKP
        +YN FVT+IQ+++  P++E+                                         P SP++   +P F NPS   FP+  S S        G+ 
Subjt:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPF-NPSL--FPSHYSPSQQVAPSLLGKP

Query:  QPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL
        + P     PS P P RP CQIC K GHTA  C+H TNL
Subjt:  QPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]5.6e-5642.46Show/hide
Query:  PQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL
        P +IP    Q   A P        P++ QP  +KL+ +N+L+WKNQLLN ++ANGL+ F+DGS P PP+F D  +Q  N E++ W R+NR IM W+Y+SL
Subjt:  PQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL

Query:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSD
        ++  +G+IV   +A+EIW +L + Y S + A+I  L+ +L  ++KDG +  +Y+ + K + +  +A+GEP+S +DHL ++  GL  EYNAFVT+I  R D
Subjt:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSD

Query:  NPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFN-PSLFPSHYSPSQQVAPSLLGKPQ
        N  LE++ SLLL+YE RLE QN   QL+  Q NL  L ++    R +  +P   F +   N    F SH   S Q  PS+LGKPQ
Subjt:  NPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFN-PSLFPSHYSPSQQVAPSLLGKPQ

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.4e-5341.75Show/hide
Query:  PTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRA
        P+L Q L++KL++ N LL K+QLLN ++ANGL+ F+D    +PPK+LD   +Q N EF+ W R N+ +M W+YSSL+   +G+IV   TA +IW SL   
Subjt:  PTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRA

Query:  YDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTV
        Y+S + A +M L +QL +IKK    +S+YLS++K V DEF+ IGEP+SYRD L  IL+GL  EY+ FVT+I NRSD P+L++V SLL  YE RL +++  
Subjt:  YDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTV

Query:  EQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL
        + LN  Q N              PR P        +N S+                                   P CQICGK GH AL  +HRTNL
Subjt:  EQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]5.7e-10962.15Show/hide
Query:  PQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEK
        P   P   A  PN  S N +PTLPQPL VKLNDNNFLLWKNQLLNAV+ANGL+G+LDG+I  PP+FLD  Q Q N  +  W RYNR +MCW+YSSLSEEK
Subjt:  PQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEK

Query:  IGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPAL
        +GE+VSLET ++IW+SL R YDSKTTARIMGLKT+L  ++KDG SVSQYL++IKE+ D+F+A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I NR+D+P+L
Subjt:  IGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPAL

Query:  EDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAP-------SLLGKPQPPPTQKWPSRPNP
        EDVRSLLLAYEARL+KQNTV+QLN+AQ NL +L L HNS+R  P+               FP+HY  S   +P       S+LGKPQ     KWP +P+ 
Subjt:  EDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAP-------SLLGKPQPPPTQKWPSRPNP

Query:  NRPPCQICGKFGHTALICHHRTNLA
        ++  CQICGK GH+A +C+HRTN+A
Subjt:  NRPPCQICGKFGHTALICHHRTNLA

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein2.7e-5642.46Show/hide
Query:  PQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL
        P +IP    Q   A P        P++ QP  +KL+ +N+L+WKNQLLN ++ANGL+ F+DGS P PP+F D  +Q  N E++ W R+NR IM W+Y+SL
Subjt:  PQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL

Query:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSD
        ++  +G+IV   +A+EIW +L + Y S + A+I  L+ +L  ++KDG +  +Y+ + K + +  +A+GEP+S +DHL ++  GL  EYNAFVT+I  R D
Subjt:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSD

Query:  NPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFN-PSLFPSHYSPSQQVAPSLLGKPQ
        N  LE++ SLLL+YE RLE QN   QL+  Q NL  L ++    R +  +P   F +   N    F SH   S Q  PS+LGKPQ
Subjt:  NPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFN-PSLFPSHYSPSQQVAPSLLGKPQ

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.7e-5341.75Show/hide
Query:  PTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRA
        P+L Q L++KL++ N LL K+QLLN ++ANGL+ F+D    +PPK+LD   +Q N EF+ W R N+ +M W+YSSL+   +G+IV   TA +IW SL   
Subjt:  PTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRA

Query:  YDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTV
        Y+S + A +M L +QL +IKK    +S+YLS++K V DEF+ IGEP+SYRD L  IL+GL  EY+ FVT+I NRSD P+L++V SLL  YE RL +++  
Subjt:  YDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTV

Query:  EQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL
        + LN  Q N              PR P        +N S+                                   P CQICGK GH AL  +HRTNL
Subjt:  EQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL

A0A6J1DQX7 uncharacterized protein LOC1110223152.8e-10962.15Show/hide
Query:  PQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEK
        P   P   A  PN  S N +PTLPQPL VKLNDNNFLLWKNQLLNAV+ANGL+G+LDG+I  PP+FLD  Q Q N  +  W RYNR +MCW+YSSLSEEK
Subjt:  PQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEK

Query:  IGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPAL
        +GE+VSLET ++IW+SL R YDSKTTARIMGLKT+L  ++KDG SVSQYL++IKE+ D+F+A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I NR+D+P+L
Subjt:  IGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPAL

Query:  EDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAP-------SLLGKPQPPPTQKWPSRPNP
        EDVRSLLLAYEARL+KQNTV+QLN+AQ NL +L L HNS+R  P+               FP+HY  S   +P       S+LGKPQ     KWP +P+ 
Subjt:  EDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAP-------SLLGKPQPPPTQKWPSRPNP

Query:  NRPPCQICGKFGHTALICHHRTNLA
        ++  CQICGK GH+A +C+HRTN+A
Subjt:  NRPPCQICGKFGHTALICHHRTNLA

A0A7J0EGI5 Uncharacterized protein4.4e-5445.19Show/hide
Query:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG
        P  P+S P P   +P   PQ    +++ +PN       P++ QPLAVKL+D+N+++WK QLLN V+ANGL+ FLDGS   PP+FLD QQQQ+N EF +W 
Subjt:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG

Query:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS
        RYNR +M W+Y+S++E  +G+IV   +A +IW +L+R Y + + A +  L+T L  IKK+G +   Y+ + + + +  ++IGEP++Y DHL + L GLG 
Subjt:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS

Query:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVE
        +YN FVT+IQ+++  P++E+V SLLL+Y+ARLE+Q+  +
Subjt:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVE

A0A7J0GPN0 UBX domain-containing protein4.1e-5238.76Show/hide
Query:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG
        P  P+S P P   +P+  PQ        I N+    + P++ QPLAVKL+D+N+++WK QLLN V+ANGL+ FLDGS   PP+FLD QQQQ+N EF +W 
Subjt:  PYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWG

Query:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS
        RYNR +M W+Y+S++E  +G+IV   +A +IW +L+R Y + + A +  L+T L  IKK+G +   Y+ + + + +  ++IGEP++Y DHL + L GLG 
Subjt:  RYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGS

Query:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPF-NPSL--FPSHYSPSQQVAPSLLGKP
        +YN FVT+IQ+++  P++E+                                         P SP++   +P F NPS   FP+  S S        G+ 
Subjt:  EYNAFVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPF-NPSL--FPSHYSPSQQVAPSLLGKP

Query:  QPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL
        + P     PS P P RP CQIC K GHTA  C+H TNL
Subjt:  QPPPTQKWPSRPNPNRPPCQICGKFGHTALICHHRTNL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-1722.87Show/hide
Query:  KLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFL-DEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTAR
        KL   N+L+W  Q+        L GFLDGS   PP  +  +   + N ++  W R ++ I   +  ++S      +    TA +IW +L++ Y + +   
Subjt:  KLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFL-DEQQQQTNLEFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTAR

Query:  IMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARL----------EKQN
        +  L+TQL +  K  +++  Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P L ++   LL +E+++             N
Subjt:  IMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPALEDVRSLLLAYEARL----------EKQN

Query:  TVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALIC
         V   N    N N    ++ +R +   + +N     P+  S   +++ P+   +   LGK                   CQICG  GH+A  C
Subjt:  TVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRPPCQICGKFGHTALIC

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).1.7e-1023.64Show/hide
Query:  QSIPQSYPQSSAAIPNSLSPN-SYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL
        ++I    P S    P  L P+  +P+      +  +++N++ WK +  + +      GF+DG++P P  F    Q         W + N  +M W+ +S+
Subjt:  QSIPQSYPQSSAAIPNSLSPN-SYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNLEFLTWGRYNRFIMCWMYSSL

Query:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFS
        +++ +  ++  ETA+++W  L+R +      +I  L+ +L  +++ G SV +Y  ++ +V  E S
Subjt:  SEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGATAATGACGGGAGAATTCATTATCTTCATCTTCCTCAAACCAAGTTTGGGCTTGAGATGGAAGAAAGTAGAAGAGCCATAAGGCTCCTTCTTCTCTTCTTCAT
TAATGGCATCAGAAACCGAGTCTTCTTCCTCCTCTTTAGGGCCTGTGGTTTCCTCAATAGCACCTATAACTACTCCAGTAGTTTCTCCATCTACCCCAATCACTACCCCC
ATCGTTTCTCCCATTACACAAACAGTCCAACCTCCCTTTCGCCCACCAAGACAAGCTCAACCCTATTTTTCGTCACCAACACAGCCTCGCCTCCACAAAATCAACCTTCG
GTAAATCTCTATCAGCAACCACAGTCGTTCTATCCTTCCTATTTTCAACCATATTATCCATCATCCTTTCCGAGACCTCAGTTCTTTGTTCCTCAGTCCATCCCTCAGTC
TTATCCCCAATCTTCAGCTGCTATTCCGAATTCTCTTTCCCCCAATTCATATCCTACTCTACCTCAGCCCCTTGCCGTCAAACTCAACGACAACAATTTTCTCTTATGGA
AAAATCAATTGTTGAATGCGGTTCTTGCCAATGGTCTTCAGGGCTTTCTTGATGGCTCGATTCCTGCCCCTCCCAAATTTCTTGATGAACAGCAACAACAAACAAATCTA
GAGTTCCTCACATGGGGAAGGTATAATCGGTTTATTATGTGTTGGATGTACTCATCCTTGTCTGAGGAGAAAATTGGTGAAATAGTTAGCTTAGAGACTGCATATGAAAT
CTGGAATTCTCTGAAACGTGCTTACGACTCTAAAACAACAGCTAGGATTATGGGGTTAAAAACTCAGTTGCACAAGATTAAGAAAGATGGTCAGTCAGTCAGTCAATACT
TATCTCAGATTAAAGAGGTTGTTGATGAATTTTCTGCCATAGGTGAGCCTATTTCTTATAGAGATCATTTAGCTCATATTTTGGATGGTCTCGGTAGTGAATATAATGCT
TTTGTGACTACTATTCAGAACCGTTCTGATAACCCTGCTTTAGAGGATGTTCGCAGCTTATTATTAGCTTATGAAGCTCGCCTTGAGAAACAAAATACTGTAGAGCAGCT
TAATCTTGCCCAAGTAAATCTTAATAGCCTTCAACTTTCACATAACAGTCGTCGTTCCTCACCCCGATCTCCCTCCAATCAATTTTTTAGACCTCCCTTCAATCCATCTT
TGTTTCCTTCTCATTACTCCCCTTCACAGCAAGTTGCTCCTAGCCTATTAGGAAAACCTCAACCTCCACCTACTCAAAAATGGCCTTCTAGACCTAATCCAAATCGCCCT
CCATGTCAGATTTGTGGCAAATTTGGGCACACTGCTCTTATTTGTCACCATAGAACAAACTTAGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGATAATGACGGGAGAATTCATTATCTTCATCTTCCTCAAACCAAGTTTGGGCTTGAGATGGAAGAAAGTAGAAGAGCCATAAGGCTCCTTCTTCTCTTCTTCAT
TAATGGCATCAGAAACCGAGTCTTCTTCCTCCTCTTTAGGGCCTGTGGTTTCCTCAATAGCACCTATAACTACTCCAGTAGTTTCTCCATCTACCCCAATCACTACCCCC
ATCGTTTCTCCCATTACACAAACAGTCCAACCTCCCTTTCGCCCACCAAGACAAGCTCAACCCTATTTTTCGTCACCAACACAGCCTCGCCTCCACAAAATCAACCTTCG
GTAAATCTCTATCAGCAACCACAGTCGTTCTATCCTTCCTATTTTCAACCATATTATCCATCATCCTTTCCGAGACCTCAGTTCTTTGTTCCTCAGTCCATCCCTCAGTC
TTATCCCCAATCTTCAGCTGCTATTCCGAATTCTCTTTCCCCCAATTCATATCCTACTCTACCTCAGCCCCTTGCCGTCAAACTCAACGACAACAATTTTCTCTTATGGA
AAAATCAATTGTTGAATGCGGTTCTTGCCAATGGTCTTCAGGGCTTTCTTGATGGCTCGATTCCTGCCCCTCCCAAATTTCTTGATGAACAGCAACAACAAACAAATCTA
GAGTTCCTCACATGGGGAAGGTATAATCGGTTTATTATGTGTTGGATGTACTCATCCTTGTCTGAGGAGAAAATTGGTGAAATAGTTAGCTTAGAGACTGCATATGAAAT
CTGGAATTCTCTGAAACGTGCTTACGACTCTAAAACAACAGCTAGGATTATGGGGTTAAAAACTCAGTTGCACAAGATTAAGAAAGATGGTCAGTCAGTCAGTCAATACT
TATCTCAGATTAAAGAGGTTGTTGATGAATTTTCTGCCATAGGTGAGCCTATTTCTTATAGAGATCATTTAGCTCATATTTTGGATGGTCTCGGTAGTGAATATAATGCT
TTTGTGACTACTATTCAGAACCGTTCTGATAACCCTGCTTTAGAGGATGTTCGCAGCTTATTATTAGCTTATGAAGCTCGCCTTGAGAAACAAAATACTGTAGAGCAGCT
TAATCTTGCCCAAGTAAATCTTAATAGCCTTCAACTTTCACATAACAGTCGTCGTTCCTCACCCCGATCTCCCTCCAATCAATTTTTTAGACCTCCCTTCAATCCATCTT
TGTTTCCTTCTCATTACTCCCCTTCACAGCAAGTTGCTCCTAGCCTATTAGGAAAACCTCAACCTCCACCTACTCAAAAATGGCCTTCTAGACCTAATCCAAATCGCCCT
CCATGTCAGATTTGTGGCAAATTTGGGCACACTGCTCTTATTTGTCACCATAGAACAAACTTAGCTTAA
Protein sequenceShow/hide protein sequence
MRDNDGRIHYLHLPQTKFGLEMEESRRAIRLLLLFFINGIRNRVFFLLFRACGFLNSTYNYSSSFSIYPNHYPHRFSHYTNSPTSLSPTKTSSTLFFVTNTASPPQNQPS
VNLYQQPQSFYPSYFQPYYPSSFPRPQFFVPQSIPQSYPQSSAAIPNSLSPNSYPTLPQPLAVKLNDNNFLLWKNQLLNAVLANGLQGFLDGSIPAPPKFLDEQQQQTNL
EFLTWGRYNRFIMCWMYSSLSEEKIGEIVSLETAYEIWNSLKRAYDSKTTARIMGLKTQLHKIKKDGQSVSQYLSQIKEVVDEFSAIGEPISYRDHLAHILDGLGSEYNA
FVTTIQNRSDNPALEDVRSLLLAYEARLEKQNTVEQLNLAQVNLNSLQLSHNSRRSSPRSPSNQFFRPPFNPSLFPSHYSPSQQVAPSLLGKPQPPPTQKWPSRPNPNRP
PCQICGKFGHTALICHHRTNLA