; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025091 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025091
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr10:8523411..8524451
RNA-Seq ExpressionLag0025091
SyntenyLag0025091
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]8.0e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]8.0e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]8.0e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]8.0e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]3.9e-6347.9Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        FGF+DGS+ AP + L  SS        ++S    T  + V  P+  FEDW+AKD ALMTLINATLS  ALAYVV   TSK+V + LEKHYSS SRTNVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQ----------------------------------YLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ                                  YL IYALNGLS +YNT  TS+RTRAQ  +F ELHV +KSEESA+EKQ + +D  + P  +
Subjt:  LKSDLQ----------------------------------YLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASR--GLNPNSFF---RGRSQGRGR--------NQGRGR-------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVA-S
         A++    +R    +PN      RG++ GRG+        NQGRGR             R  CQIC + GH+ALD YNRMN++FQGRHPP QLAA+VA  
Subjt:  VANTQHGASR--GLNPNSFF---RGRSQGRGR--------NQGRGR-------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVA-S

Query:  HNSAQSNNNPSSSTWLTDSGCNAHITANLNNLSV
        +NS  +  N S +TWL DS CN H+TA+L+NLS+
Subjt:  HNSAQSNNNPSSSTWLTDSGCNAHITANLNNLSV

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X23.9e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X33.9e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X13.9e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

A0A5D3CLI6 T4.53.9e-6146.97Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        +GF+DG+ P PPR    SST   P                 Q N  +EDW+AKD ALMT+INATLSP ALAYVVG ++SK+V D L K YSS SR+NVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ ++                                  IYALNGL  +YNTF+TS+RTR+QP TF ELHVLL++EESAL KQ++ DDS + PTV+
Subjt:  LKSDLQYLF----------------------------------IYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA
        ++++Q   S      N+F RG   G G++ G GR                         CQIC R GH+ALD +NRMNYNFQGRHPP QLAA+VAS N+A
Subjt:  VANTQHGASRGLN-PNSFFRGRSQGRGRNQGRGR----------------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSA

Query:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV
          +   SSS  LTDSGCN  IT+++N +S+
Subjt:  QSNNNPSSSTWLTDSGCNAHITANLNNLSV

A0A6J1D9L6 uncharacterized protein LOC1110188921.9e-6347.9Show/hide
Query:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN
        FGF+DGS+ AP + L  SS        ++S    T  + V  P+  FEDW+AKD ALMTLINATLS  ALAYVV   TSK+V + LEKHYSS SRTNVVN
Subjt:  FGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRTNVVN

Query:  LKSDLQ----------------------------------YLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM
        LKSDLQ                                  YL IYALNGLS +YNT  TS+RTRAQ  +F ELHV +KSEESA+EKQ + +D  + P  +
Subjt:  LKSDLQ----------------------------------YLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVM

Query:  VANTQHGASR--GLNPNSFF---RGRSQGRGR--------NQGRGR-------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVA-S
         A++    +R    +PN      RG++ GRG+        NQGRGR             R  CQIC + GH+ALD YNRMN++FQGRHPP QLAA+VA  
Subjt:  VANTQHGASR--GLNPNSFF---RGRSQGRGR--------NQGRGR-------------RVVCQICLRTGHSALDNYNRMNYNFQGRHPPAQLAALVA-S

Query:  HNSAQSNNNPSSSTWLTDSGCNAHITANLNNLSV
        +NS  +  N S +TWL DS CN H+TA+L+NLS+
Subjt:  HNSAQSNNNPSSSTWLTDSGCNAHITANLNNLSV

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.8e-0826.43Show/hide
Query:  GPQTFGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRT
        G +  GF+DGS P PP  +    T   P V+ D T                  W  +D  + + I   +S +    V   +T+ ++ + L K Y++ S  
Subjt:  GPQTFGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRT

Query:  NVVNLK--------------SDLQYLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVMV---ANT-----QHGA
        +V  L+               D        L  L  DY      +  +  PP+  E+H  L + ES L   N ++  P    V+     NT       G 
Subjt:  NVVNLK--------------SDLQYLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVMV---ANT-----QHGA

Query:  SRGLNPN---------SFFRGRSQGRGRNQGRGRRVVCQICLRTGHSA-----LDNYNRMNYNFQGRHP--PAQLAALVASHNSAQSNNNPSSSTWLTDS
        +R  N N         S    RS  R      GR   CQIC   GHSA     L  +       Q   P  P Q  A +A ++   +NN      WL DS
Subjt:  SRGLNPN---------SFFRGRSQGRGRNQGRGRRVVCQICLRTGHSA-----LDNYNRMNYNFQGRHP--PAQLAALVASHNSAQSNNNPSSSTWLTDS

Query:  GCNAHITANLNNLS
        G   HIT++ NNLS
Subjt:  GCNAHITANLNNLS

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATTCCAATTGACGTCAATTCTCAAGGCCCACAAACTTTTGGTTTTGTTGACGGATCCTTGCCTGCACCTCCTAGGGTTCTTCCTCAATCTTCAACGAAGGTTTC
GCCTGCTGTCTCTTCTGATTCGACTGCAGTCGCCACCGTTGTCATTGAAGTTTCTCAACCAAATCTGAAATTTGAAGATTGGCTCGCCAAAGATCATGCCCTGATGACCC
TTATTAATGCCACTCTCTCGCCGGCGGCTCTTGCCTATGTTGTTGGATGCTCCACTTCGAAAGAGGTATGTGATGCTCTTGAGAAGCATTACTCTTCTACCTCTAGAACT
AATGTGGTTAATCTGAAGTCTGATTTGCAATACCTGTTCATCTATGCCTTAAATGGATTGTCGTTTGATTACAATACCTTCAAGACATCTCTTCGCACTCGTGCCCAACC
ACCGACATTTGCTGAGCTACATGTTCTTCTCAAGTCCGAAGAATCTGCCCTGGAAAAACAGAATCGTTCCGATGATTCCCCATCCCTGCCCACTGTTATGGTAGCCAACA
CTCAGCATGGTGCCTCTCGAGGTCTGAACCCCAACTCTTTCTTTCGTGGGCGGTCTCAAGGTCGTGGAAGAAATCAGGGTCGTGGACGACGTGTTGTCTGTCAAATTTGC
CTTCGGACTGGCCATTCAGCTTTGGACAACTATAATAGGATGAACTACAACTTTCAGGGTCGTCATCCTCCGGCCCAATTAGCTGCTCTTGTGGCTTCTCACAACTCTGC
GCAATCCAACAACAATCCATCCTCTTCAACTTGGTTGACAGATTCGGGTTGTAACGCCCACATCACAGCTAACTTAAACAACCTCAGCGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATTCCAATTGACGTCAATTCTCAAGGCCCACAAACTTTTGGTTTTGTTGACGGATCCTTGCCTGCACCTCCTAGGGTTCTTCCTCAATCTTCAACGAAGGTTTC
GCCTGCTGTCTCTTCTGATTCGACTGCAGTCGCCACCGTTGTCATTGAAGTTTCTCAACCAAATCTGAAATTTGAAGATTGGCTCGCCAAAGATCATGCCCTGATGACCC
TTATTAATGCCACTCTCTCGCCGGCGGCTCTTGCCTATGTTGTTGGATGCTCCACTTCGAAAGAGGTATGTGATGCTCTTGAGAAGCATTACTCTTCTACCTCTAGAACT
AATGTGGTTAATCTGAAGTCTGATTTGCAATACCTGTTCATCTATGCCTTAAATGGATTGTCGTTTGATTACAATACCTTCAAGACATCTCTTCGCACTCGTGCCCAACC
ACCGACATTTGCTGAGCTACATGTTCTTCTCAAGTCCGAAGAATCTGCCCTGGAAAAACAGAATCGTTCCGATGATTCCCCATCCCTGCCCACTGTTATGGTAGCCAACA
CTCAGCATGGTGCCTCTCGAGGTCTGAACCCCAACTCTTTCTTTCGTGGGCGGTCTCAAGGTCGTGGAAGAAATCAGGGTCGTGGACGACGTGTTGTCTGTCAAATTTGC
CTTCGGACTGGCCATTCAGCTTTGGACAACTATAATAGGATGAACTACAACTTTCAGGGTCGTCATCCTCCGGCCCAATTAGCTGCTCTTGTGGCTTCTCACAACTCTGC
GCAATCCAACAACAATCCATCCTCTTCAACTTGGTTGACAGATTCGGGTTGTAACGCCCACATCACAGCTAACTTAAACAACCTCAGCGTCTGA
Protein sequenceShow/hide protein sequence
MEIPIDVNSQGPQTFGFVDGSLPAPPRVLPQSSTKVSPAVSSDSTAVATVVIEVSQPNLKFEDWLAKDHALMTLINATLSPAALAYVVGCSTSKEVCDALEKHYSSTSRT
NVVNLKSDLQYLFIYALNGLSFDYNTFKTSLRTRAQPPTFAELHVLLKSEESALEKQNRSDDSPSLPTVMVANTQHGASRGLNPNSFFRGRSQGRGRNQGRGRRVVCQIC
LRTGHSALDNYNRMNYNFQGRHPPAQLAALVASHNSAQSNNNPSSSTWLTDSGCNAHITANLNNLSV