; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000815 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000815
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:17283159..17284310
RNA-Seq ExpressionLag0000815
SyntenyLag0000815
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]6.1e-5646.78Show/hide
Query:  PLFPAPQNQPFP------PNPNFFTLNPYPTLP---QPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI
        P  P P + P P      PNP     +P P +P   QPL VKL+D+N+++WK QLLN V+AN L  FLDGS   PP+FLD  Q Q NP++  W+RYNR +
Subjt:  PLFPAPQNQPFP------PNPNFFTLNPYPTLP---QPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIY+S++E  +G+IV   SA++IW +L   Y + + A +  L+T LQ IKK+G +   Y+ K + + +   +IGEP++Y DHL + L GLG +YN FV
Subjt:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVD
        T+IQ+++  PS+E+VHSLLL+Y+ARLE+Q++ D
Subjt:  TTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVD

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]8.8e-4744.39Show/hide
Query:  PLFPAPQNQPFP------PNPNFFTLNP---YPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI
        P  P P + P P      PNP     +P    P++ QPL VKL+D+N+++WK QLLN V+AN L  FLDGS   PP+FLD  Q Q NP++  W+RYNR +
Subjt:  PLFPAPQNQPFP------PNPNFFTLNP---YPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIY+S++E  +G+IV   SA++IW +L   Y + + A +  L+T LQ IKK+G +   Y+ K + + +   +IGEP++Y DHL + L GLG +YN FV
Subjt:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TTIQNRSDNPSLED
        T+IQ+++  PS+E+
Subjt:  TTIQNRSDNPSLED

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.7e-5042.16Show/hide
Query:  PRPLFPAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSS
        P    P  Q+Q     P    L   P++ QP T+KL+ +N+L+WKNQLLN ++AN L  F+DGS   PP+F D  +   N +Y+ W+R+NR IM WIY+S
Subjt:  PRPLFPAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSS

Query:  LSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRS
        L++  MG+IV   SA EIW +L   Y S++ A+I  L+ +LQ ++KDG +  +Y+ K K I +   A+GEP+S +DHL ++  GL  EYNAFVT+I  R 
Subjt:  LSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRS

Query:  DNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFPKPNFTL------PSQSPVFPSILGKP
        DN  LE+++SLLL+YE RLE Q +  Q  L+  Q +L+ LN +    R N   P    +    N T       P+ +   PSILGKP
Subjt:  DNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFPKPNFTL------PSQSPVFPSILGKP

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.0e-4844.07Show/hide
Query:  NQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEKMGEI
        NQ   P     TL P P+L Q L++KL++ N LL K+QLLN ++AN L  F+D   ++PP++LD    Q NP+++ W+R N+ +M WIYSSL+   +G+I
Subjt:  NQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEKMGEI

Query:  VSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVH
        V   +A +IW+SL   Y+S + A +M L +QLQ+IKK    +++YL ++K + D+F  IGEP+SYRD L  IL+GL  EY+ FVT+I NRSD PSL++VH
Subjt:  VSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVH

Query:  SLLLAYEARLEKQTSVDQLNLNLAQVH---LSTLNSHHSHRRSNAQTPPSLNSFPKPNFTLPSQSPVFPS
        SLL  YE RL  Q S+DQ NLN  Q +       NS    +        +LN + + N T     PVFP+
Subjt:  SLLLAYEARLEKQTSVDQLNLNLAQVH---LSTLNSHHSHRRSNAQTPPSLNSFPKPNFTLPSQSPVFPS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.3e-9367.14Show/hide
Query:  PAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK
        P P     PPNP  F+ NP+PTLPQPL VKLNDNNFLLWKNQLLNAV+AN L G+LDG+I  PPQFLD +Q Q NP Y  WERYNR +MCWIYSSLSEEK
Subjt:  PAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK

Query:  MGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSL
        MGE+VSL++  +IWSSLT  YDS TTARIMGLKT+LQ ++KDG SV+QYL KIKEIADKF A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I NR+D+PSL
Subjt:  MGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSL

Query:  EDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFP---KPNF-TLPSQSPVFPSILGKP
        EDV SLLLAYEARL+KQ +VDQ  LN+AQ +L  L+  H     N++ PP   SFP   K +F   P  +    SILGKP
Subjt:  EDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFP---KPNF-TLPSQSPVFPSILGKP

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein8.3e-5142.16Show/hide
Query:  PRPLFPAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSS
        P    P  Q+Q     P    L   P++ QP T+KL+ +N+L+WKNQLLN ++AN L  F+DGS   PP+F D  +   N +Y+ W+R+NR IM WIY+S
Subjt:  PRPLFPAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSS

Query:  LSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRS
        L++  MG+IV   SA EIW +L   Y S++ A+I  L+ +LQ ++KDG +  +Y+ K K I +   A+GEP+S +DHL ++  GL  EYNAFVT+I  R 
Subjt:  LSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRS

Query:  DNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFPKPNFTL------PSQSPVFPSILGKP
        DN  LE+++SLLL+YE RLE Q +  Q  L+  Q +L+ LN +    R N   P    +    N T       P+ +   PSILGKP
Subjt:  DNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFPKPNFTL------PSQSPVFPSILGKP

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE13.9e-4844.07Show/hide
Query:  NQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEKMGEI
        NQ   P     TL P P+L Q L++KL++ N LL K+QLLN ++AN L  F+D   ++PP++LD    Q NP+++ W+R N+ +M WIYSSL+   +G+I
Subjt:  NQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEKMGEI

Query:  VSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVH
        V   +A +IW+SL   Y+S + A +M L +QLQ+IKK    +++YL ++K + D+F  IGEP+SYRD L  IL+GL  EY+ FVT+I NRSD PSL++VH
Subjt:  VSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVH

Query:  SLLLAYEARLEKQTSVDQLNLNLAQVH---LSTLNSHHSHRRSNAQTPPSLNSFPKPNFTLPSQSPVFPS
        SLL  YE RL  Q S+DQ NLN  Q +       NS    +        +LN + + N T     PVFP+
Subjt:  SLLLAYEARLEKQTSVDQLNLNLAQVH---LSTLNSHHSHRRSNAQTPPSLNSFPKPNFTLPSQSPVFPS

A0A6J1DQX7 uncharacterized protein LOC1110223156.1e-9467.14Show/hide
Query:  PAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK
        P P     PPNP  F+ NP+PTLPQPL VKLNDNNFLLWKNQLLNAV+AN L G+LDG+I  PPQFLD +Q Q NP Y  WERYNR +MCWIYSSLSEEK
Subjt:  PAPQNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK

Query:  MGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSL
        MGE+VSL++  +IWSSLT  YDS TTARIMGLKT+LQ ++KDG SV+QYL KIKEIADKF A+GEP+SYRDHLAH+LDGLGSEYNAFVT+I NR+D+PSL
Subjt:  MGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSL

Query:  EDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFP---KPNF-TLPSQSPVFPSILGKP
        EDV SLLLAYEARL+KQ +VDQ  LN+AQ +L  L+  H     N++ PP   SFP   K +F   P  +    SILGKP
Subjt:  EDVHSLLLAYEARLEKQTSVDQLNLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFP---KPNF-TLPSQSPVFPSILGKP

A0A7J0EGI5 Uncharacterized protein3.0e-5646.78Show/hide
Query:  PLFPAPQNQPFP------PNPNFFTLNPYPTLP---QPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI
        P  P P + P P      PNP     +P P +P   QPL VKL+D+N+++WK QLLN V+AN L  FLDGS   PP+FLD  Q Q NP++  W+RYNR +
Subjt:  PLFPAPQNQPFP------PNPNFFTLNPYPTLP---QPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIY+S++E  +G+IV   SA++IW +L   Y + + A +  L+T LQ IKK+G +   Y+ K + + +   +IGEP++Y DHL + L GLG +YN FV
Subjt:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVD
        T+IQ+++  PS+E+VHSLLL+Y+ARLE+Q++ D
Subjt:  TTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVD

A0A7J0GPN0 UBX domain-containing protein4.3e-4744.39Show/hide
Query:  PLFPAPQNQPFP------PNPNFFTLNP---YPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI
        P  P P + P P      PNP     +P    P++ QPL VKL+D+N+++WK QLLN V+AN L  FLDGS   PP+FLD  Q Q NP++  W+RYNR +
Subjt:  PLFPAPQNQPFP------PNPNFFTLNP---YPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFI

Query:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV
        M WIY+S++E  +G+IV   SA++IW +L   Y + + A +  L+T LQ IKK+G +   Y+ K + + +   +IGEP++Y DHL + L GLG +YN FV
Subjt:  MCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFV

Query:  TTIQNRSDNPSLED
        T+IQ+++  PS+E+
Subjt:  TTIQNRSDNPSLED

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.2e-1925.32Show/hide
Query:  KLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQ-NPDYLGWERYNRFIMCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTAR
        KL   N+L+W  Q+       EL GFLDGS   PP  +  +   + NPDY  W+R ++ I   +  ++S      +    +AA+IW +L   Y + +   
Subjt:  KLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQ-NPDYLGWERYNRFIMCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTAR

Query:  IMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLA
        +  L+TQL++  K   ++  Y+  +    D+   +G+P+ + + +  +L+ L  EY   +  I  +   P+L ++H  LL +E+++   +S   + +   
Subjt:  IMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLA

Query:  QV---HLSTLNSHHSHRRSNAQTPPSLNSFPKP
         V   + +T N++++  R+N     + N+  KP
Subjt:  QV---HLSTLNSHHSHRRSNAQTPPSLNSFPKP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.8e-1427.19Show/hide
Query:  KLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQ-NPDYLGWERYNRFIMCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTAR
        KL   N+L+W  Q+       EL GFLDGS   PP  +  +   + NPDY  W R ++ I   I  ++S      +    +AA+IW +L   Y + +   
Subjt:  KLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQ-NPDYLGWERYNRFIMCWIYSSLSEEKMGEIVSLKSAAEIWSSLTCSYDSNTTAR

Query:  IMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLA
        +    TQL+ I +                D+   +G+P+ + + +  +L+ L  +Y   +  I  +   PSL ++H  L+  E++L        L LN A
Subjt:  IMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVDQLNLNLA

Query:  QVHLSTLNSHHSHRRSN
        +V   T N   +HR +N
Subjt:  QVHLSTLNSHHSHRRSN

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.6e-1421.35Show/hide
Query:  PLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK-MGEIVSLKSAAEIWSSLTCSYDSN
        P+ + + ++N+  W+   L   L+ ++ G +DG++              N + + W++ +  +   +Y +L+ ++  G  V+  ++ +IW  +   + +N
Subjt:  PLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEK-MGEIVSLKSAAEIWSSLTCSYDSN

Query:  TTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEK
          AR + L ++L+        V  Y  K+K++AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   E RL++
Subjt:  TTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCCGATGAAAGTTCTTCCTCTAGGATCGTCTCCGATCAACAAAGCAACCCAGTGATTACACCAAATACTCCTAATACCACTCCCATTATCACTCCCATTCATAA
TCCTCCGATAGTTTCGACTCCTGTTGCCACTCCAGTAGCCCGACCGCCAGTTCAAAATCGAAATCAGCCTAATCCTACTCAACCTGCTTTTGCTCCCTTCAATCCAAATC
CCTATCCTCAAGCTCAACCATACTATCCCTATGCTCAGCAACCATACTTCCCACCTCAGCAACTCCCTCAGAACCAACCATTTTACCCTCGTCCTCTGTTTCCAGCTCCA
CAAAATCAACCATTCCCTCCTAACCCTAATTTCTTCACCCTAAATCCCTATCCCACACTTCCTCAACCACTGACTGTCAAGCTCAATGACAACAACTTTCTCTTGTGGAA
AAATCAGCTGCTCAACGCTGTTCTCGCGAATGAATTGTACGGTTTCCTCGACGGTTCGATCGCTGCTCCTCCTCAATTTCTAGATCAAAATCAGACTCAACAGAACCCTG
ACTATCTTGGATGGGAGAGGTACAACCGTTTTATTATGTGTTGGATATACTCGTCTTTGTCTGAAGAGAAAATGGGAGAAATAGTAAGTTTGAAATCTGCTGCTGAAATC
TGGTCTTCCCTTACTTGTTCTTACGATTCTAATACTACTGCTCGAATCATGGGTCTAAAAACTCAACTGCAAAAGATAAAAAAAGATGGTTTCTCTGTAACTCAGTACCT
AGGAAAGATTAAGGAAATTGCTGACAAATTTGTTGCTATTGGGGAACCGATTTCTTATCGTGATCACTTAGCACATATTTTGGATGGTTTAGGTAGTGAATACAATGCTT
TTGTCACTACAATTCAAAATAGGTCTGATAATCCTTCTTTAGAAGATGTTCACAGTTTGTTATTAGCCTATGAAGCTCGATTGGAGAAACAAACATCTGTTGACCAACTT
AATCTTAATCTAGCTCAAGTGCATCTTAGCACCCTTAACAGTCATCATTCTCATCGCCGTTCTAATGCTCAAACACCTCCTTCTCTTAATTCCTTTCCCAAGCCAAATTT
TACTCTGCCCTCCCAATCCCCTGTTTTTCCCAGCATACTTGGCAAACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACGCCGATGAAAGTTCTTCCTCTAGGATCGTCTCCGATCAACAAAGCAACCCAGTGATTACACCAAATACTCCTAATACCACTCCCATTATCACTCCCATTCATAA
TCCTCCGATAGTTTCGACTCCTGTTGCCACTCCAGTAGCCCGACCGCCAGTTCAAAATCGAAATCAGCCTAATCCTACTCAACCTGCTTTTGCTCCCTTCAATCCAAATC
CCTATCCTCAAGCTCAACCATACTATCCCTATGCTCAGCAACCATACTTCCCACCTCAGCAACTCCCTCAGAACCAACCATTTTACCCTCGTCCTCTGTTTCCAGCTCCA
CAAAATCAACCATTCCCTCCTAACCCTAATTTCTTCACCCTAAATCCCTATCCCACACTTCCTCAACCACTGACTGTCAAGCTCAATGACAACAACTTTCTCTTGTGGAA
AAATCAGCTGCTCAACGCTGTTCTCGCGAATGAATTGTACGGTTTCCTCGACGGTTCGATCGCTGCTCCTCCTCAATTTCTAGATCAAAATCAGACTCAACAGAACCCTG
ACTATCTTGGATGGGAGAGGTACAACCGTTTTATTATGTGTTGGATATACTCGTCTTTGTCTGAAGAGAAAATGGGAGAAATAGTAAGTTTGAAATCTGCTGCTGAAATC
TGGTCTTCCCTTACTTGTTCTTACGATTCTAATACTACTGCTCGAATCATGGGTCTAAAAACTCAACTGCAAAAGATAAAAAAAGATGGTTTCTCTGTAACTCAGTACCT
AGGAAAGATTAAGGAAATTGCTGACAAATTTGTTGCTATTGGGGAACCGATTTCTTATCGTGATCACTTAGCACATATTTTGGATGGTTTAGGTAGTGAATACAATGCTT
TTGTCACTACAATTCAAAATAGGTCTGATAATCCTTCTTTAGAAGATGTTCACAGTTTGTTATTAGCCTATGAAGCTCGATTGGAGAAACAAACATCTGTTGACCAACTT
AATCTTAATCTAGCTCAAGTGCATCTTAGCACCCTTAACAGTCATCATTCTCATCGCCGTTCTAATGCTCAAACACCTCCTTCTCTTAATTCCTTTCCCAAGCCAAATTT
TACTCTGCCCTCCCAATCCCCTGTTTTTCCCAGCATACTTGGCAAACCTTAG
Protein sequenceShow/hide protein sequence
MNADESSSSRIVSDQQSNPVITPNTPNTTPIITPIHNPPIVSTPVATPVARPPVQNRNQPNPTQPAFAPFNPNPYPQAQPYYPYAQQPYFPPQQLPQNQPFYPRPLFPAP
QNQPFPPNPNFFTLNPYPTLPQPLTVKLNDNNFLLWKNQLLNAVLANELYGFLDGSIAAPPQFLDQNQTQQNPDYLGWERYNRFIMCWIYSSLSEEKMGEIVSLKSAAEI
WSSLTCSYDSNTTARIMGLKTQLQKIKKDGFSVTQYLGKIKEIADKFVAIGEPISYRDHLAHILDGLGSEYNAFVTTIQNRSDNPSLEDVHSLLLAYEARLEKQTSVDQL
NLNLAQVHLSTLNSHHSHRRSNAQTPPSLNSFPKPNFTLPSQSPVFPSILGKP