; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023129 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023129
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr7:44631174..44635380
RNA-Seq ExpressionLag0023129
SyntenyLag0023129
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-3432.49Show/hide
Query:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM
        N+E S  S  +Q     NKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +                                 G+M + 
Subjt:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM

Query:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT
        ++ + + C                                                                    I + LG++Y++M+SVI+A+T + +
Subjt:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT

Query:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR
        + +V++LLLT ES+ ESK  +I +  LPS N+  Q   +       ++QN    N   N+  G  N   NRG R   NRN+PQCQIC K G+ A +C+ R
Subjt:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR

Query:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG
                T  S S    P+ HN          PQM AM+AA + N D N YPDSGA NHLT+SLSN+S+ SEY   NQI   NG+GLPI++ G
Subjt:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-3432.49Show/hide
Query:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM
        N+E S  S  +Q     NKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +                                 G+M + 
Subjt:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM

Query:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT
        ++ + + C                                                                    I + LG++Y++M+SVI+A+T + +
Subjt:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT

Query:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR
        + +V++LLLT ES+ ESK  +I +  LPS N+  Q   +       ++QN    N   N+  G  N   NRG R   NRN+PQCQIC K G+ A +C+ R
Subjt:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR

Query:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG
                T  S S    P+ HN          PQM AM+AA + N D N YPDSGA NHLT+SLSN+S+ SEY   NQI   NG+GLPI++ G
Subjt:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]4.4e-2639.5Show/hide
Query:  AMEKMLMFKTIFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHS
        A +K+ +   I  I + L +E+E+ VSVI+A+T TQT+ +V +LLL+HE R E + ++  D  LPS NL  Q    N+ Q  S + Q+    NNR +   
Subjt:  AMEKMLMFKTIFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHS

Query:  NFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSRVQMP-----GAYATQFSPSGPVFPSCHN----FGQKQ--FGGPF-----PQMQAMMAAPNYNQDC
        N G     R WN+ NRPQCQI  KFGH A++CY R +       G    Q   SG    S ++    FG +Q  F   F       M A +A  ++N+D 
Subjt:  NFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSRVQMP-----GAYATQFSPSGPVFPSCHN----FGQKQ--FGGPF-----PQMQAMMAAPNYNQDC

Query:  NQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAG
        N YPDSGA NH+T++ +N++ S+EY  +NQ+ IGNG G
Subjt:  NQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAG

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.5e-2929.34Show/hide
Query:  SENNNSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GA
        S   NS+ +   Q+S+ INP +K+S V+L+D+  LLWKFQI TAL+G+ L+ +I  + + P + V                                 G+
Subjt:  SENNNSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GA

Query:  MEKMLMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKT
        M + ++ + + C                                                                    I + LG E++ ++SVITA+ 
Subjt:  MEKMLMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKT

Query:  STQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQ--NTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVK
          QT+ +V +LLL  E R E +  +  D  LPS NL + +  +  N  Q+   N  Q N+ + RGRG +N   NR  R W   N+PQCQIC +FGH A++
Subjt:  STQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQ--NTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVK

Query:  CYSRVQM----PGAYATQFSPSG-----PVFPSCHNF--------GQKQFGGPFP-QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNN
        CY R +     P      FSP+G     P+    HN         G        P QMQA+M A ++N+D N Y DSG  NH+TN   N S+ SEY  + 
Subjt:  CYSRVQM----PGAYATQFSPSG-----PVFPSCHNF--------GQKQFGGPFP-QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNN

Query:  QILIGNGAG
        +I +GNG G
Subjt:  QILIGNGAG

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]3.0e-2742.15Show/hide
Query:  IFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPST--NLAVQNPIQNTVQNS------SQNVQQQNFGNNRGRGHSNF
        I  I S LG+EYE+ VSVIT K    TI DV ALLL+H+ RIE + +   D  LPS   NLA Q   QN   NS        +  QQ++ N+  RG   F
Subjt:  IFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPST--NLAVQNPIQNTVQNS------SQNVQQQNFGNNRGRGHSNF

Query:  GQNRGGRTWNNRNRPQCQICNKFGHIAVKCY---SRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNS
         +N GGR WN+RN+ QCQIC++FGH A + Y   S VQ    Y+T++      F S   + Q+Q         AM+ + + N+D N YPDSGA NH+T+ 
Subjt:  GQNRGGRTWNNRNRPQCQICNKFGHIAVKCY---SRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNS

Query:  LSNMSVSSEYPRNNQILIGNGAG
        L N+S+ +E      I   NGAG
Subjt:  LSNMSVSSEYPRNNQILIGNGAG

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-3532.49Show/hide
Query:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM
        N+E S  S  +Q     NKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +                                 G+M + 
Subjt:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM

Query:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT
        ++ + + C                                                                    I + LG++Y++M+SVI+A+T + +
Subjt:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT

Query:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR
        + +V++LLLT ES+ ESK  +I +  LPS N+  Q   +       ++QN    N   N+  G  N   NRG R   NRN+PQCQIC K G+ A +C+ R
Subjt:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR

Query:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG
                T  S S    P+ HN          PQM AM+AA + N D N YPDSGA NHLT+SLSN+S+ SEY   NQI   NG+GLPI++ G
Subjt:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-949.5e-3532.49Show/hide
Query:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM
        N+E S  S  +Q     NKIS VKL+D+ FLLWKFQILTALE +DL+  +  + EPP + +                                 G+M + 
Subjt:  NSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GAMEKM

Query:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT
        ++ + + C                                                                    I + LG++Y++M+SVI+A+T + +
Subjt:  LMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKTSTQT

Query:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR
        + +V++LLLT ES+ ESK  +I +  LPS N+  Q   +       ++QN    N   N+  G  N   NRG R   NRN+PQCQIC K G+ A +C+ R
Subjt:  IHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQ--NSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSR

Query:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG
                T  S S    P+ HN          PQM AM+AA + N D N YPDSGA NHLT+SLSN+S+ SEY   NQI   NG+GLPI++ G
Subjt:  VQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLG

A0A6J1C6N9 dr1-associated corepressor homolog isoform X12.1e-2639.5Show/hide
Query:  AMEKMLMFKTIFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHS
        A +K+ +   I  I + L +E+E+ VSVI+A+T TQT+ +V +LLL+HE R E + ++  D  LPS NL  Q    N+ Q  S + Q+    NNR +   
Subjt:  AMEKMLMFKTIFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHS

Query:  NFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSRVQMP-----GAYATQFSPSGPVFPSCHN----FGQKQ--FGGPF-----PQMQAMMAAPNYNQDC
        N G     R WN+ NRPQCQI  KFGH A++CY R +       G    Q   SG    S ++    FG +Q  F   F       M A +A  ++N+D 
Subjt:  NFGQNRGGRTWNNRNRPQCQICNKFGHIAVKCYSRVQMP-----GAYATQFSPSGPVFPSCHN----FGQKQ--FGGPF-----PQMQAMMAAPNYNQDC

Query:  NQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAG
        N YPDSGA NH+T++ +N++ S+EY  +NQ+ IGNG G
Subjt:  NQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAG

A0A6J1DLT9 uncharacterized protein LOC1110217577.0e-3029.34Show/hide
Query:  SENNNSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GA
        S   NS+ +   Q+S+ INP +K+S V+L+D+  LLWKFQI TAL+G+ L+ +I  + + P + V                                 G+
Subjt:  SENNNSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSEN------------------------------GA

Query:  MEKMLMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKT
        M + ++ + + C                                                                    I + LG E++ ++SVITA+ 
Subjt:  MEKMLMFKTIFC--------------------------------------------------------------------IFSRLGAEYETMVSVITAKT

Query:  STQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQ--NTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVK
          QT+ +V +LLL  E R E +  +  D  LPS NL + +  +  N  Q+   N  Q N+ + RGRG +N   NR  R W   N+PQCQIC +FGH A++
Subjt:  STQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQ--NTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHIAVK

Query:  CYSRVQM----PGAYATQFSPSG-----PVFPSCHNF--------GQKQFGGPFP-QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNN
        CY R +     P      FSP+G     P+    HN         G        P QMQA+M A ++N+D N Y DSG  NH+TN   N S+ SEY  + 
Subjt:  CYSRVQM----PGAYATQFSPSG-----PVFPSCHNF--------GQKQFGGPFP-QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNN

Query:  QILIGNGAG
        +I +GNG G
Subjt:  QILIGNGAG

A0A6J1DYD5 uncharacterized protein LOC1110246581.5e-2742.15Show/hide
Query:  IFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPST--NLAVQNPIQNTVQNS------SQNVQQQNFGNNRGRGHSNF
        I  I S LG+EYE+ VSVIT K    TI DV ALLL+H+ RIE + +   D  LPS   NLA Q   QN   NS        +  QQ++ N+  RG   F
Subjt:  IFCIFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPST--NLAVQNPIQNTVQNS------SQNVQQQNFGNNRGRGHSNF

Query:  GQNRGGRTWNNRNRPQCQICNKFGHIAVKCY---SRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNS
         +N GGR WN+RN+ QCQIC++FGH A + Y   S VQ    Y+T++      F S   + Q+Q         AM+ + + N+D N YPDSGA NH+T+ 
Subjt:  GQNRGGRTWNNRNRPQCQICNKFGHIAVKCY---SRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNS

Query:  LSNMSVSSEYPRNNQILIGNGAG
        L N+S+ +E      I   NGAG
Subjt:  LSNMSVSSEYPRNNQILIGNGAG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-0926.64Show/hide
Query:  IFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTW--
        +   L  EY+ ++  I AK +  T+ ++   LL HES+I    AV    ++P T  AV +    T  N        N   NR   + N   N   + W  
Subjt:  IFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTW--

Query:  --------NNRNRP---QCQICNKFGHIAVKCYSRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSL
                NN+++P   +CQIC   GH A +C S++Q   +      P  P  P               Q +A +A  +     N   DSGA +H+T+  
Subjt:  --------NNRNRP---QCQICNKFGHIAVKCYSRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSL

Query:  SNMSVSSEYPRNNQILIGNGAGLPISNLG
        +N+S+   Y   + +++ +G+ +PIS+ G
Subjt:  SNMSVSSEYPRNNQILIGNGAGLPISNLG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0825.33Show/hide
Query:  IFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNN
        +   L  +Y+ ++  I AK +  ++ ++   L+  ES++    A+    ++P T   V +   NT +N +     +N+ NN  R +S    + G R+ N 
Subjt:  IFSRLGAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNN

Query:  RNRP---QCQICNKFGHIAVKCYSRVQMPGAYATQFSPSGPVFPSCHNF----GQKQFGGPFP--QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMS
        + +P   +CQIC+  GH A +C                     P  H F     Q+Q   PF   Q +A +A  +     N   DSGA +H+T+  +N+S
Subjt:  RNRP---QCQICNKFGHIAVKCYSRVQMPGAYATQFSPSGPVFPSCHNF----GQKQFGGPFP--QMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMS

Query:  VSSEYPRNNQILIGNGAGLPISNLG
            Y   + ++I +G+ +PI++ G
Subjt:  VSSEYPRNNQILIGNGAGLPISNLG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCCACACATGCAATGTCGACCCCTTGACATAATTGAATACATGCGTAAGAAACATGGCATTGATATTAGTTATGGTACAGCTTGGAGAGCAAGGGAGATAGCATT
ACGTGATATTAGGGGCTCCTTAGAAGAATCTCATGCCCTTATCCCATCATTTGCAGCAAGACTAATTGAAAAAAGCTCAGGATGGGAACTCACAGAACTATCCCTTGGCT
TTTTTTGCATTGTTGACTCAGAGAATGACACTTCCTTTAAGATTGCGAAAAAACCACTCCCACCAAAGCGTCATTCTCTGAGTCAACATTACAAAAAGCCAAGGGATAGT
TTTGTGAGTTCCCATCCAACGTACAAGCCAGCCTTGTCTAACGACCTATTTTTTGTTCGCATGGAGACCTCTGAGAATAATAACTCAGAAATTTCGAGCCGTTCGCAAAG
TAGCCAAGCGATCAACCCCGAAAACAAGATCTCAACTGTTAAGTTGTCCGATGAGAAATTTCTTCTTTGGAAGTTTCAAATTCTTACTGCACTTGAGGGGCATGATCTCG
ATCAACATATCAGTGAAGATTGTGAACCACCGCCTGAGAAAGTAAGTGAAAATGGTGCAATGGAAAAGATGTTGATGTTCAAGACCATATTTTGTATTTTCTCTCGTCTA
GGGGCTGAGTATGAGACTATGGTGTCTGTTATCACTGCTAAAACTAGTACACAGACTATTCATGATGTTGTAGCTTTGTTATTAACTCATGAAAGTCGCATTGAGAGTAA
AACTGCTGTTATTCCTGATAATATTCTACCCTCGACTAATTTGGCCGTTCAAAATCCTATACAAAACACTGTGCAAAATTCTTCTCAGAATGTGCAACAGCAAAATTTTG
GTAATAATAGAGGTAGAGGTCATTCAAATTTTGGTCAAAATAGAGGTGGAAGAACCTGGAATAATCGAAATCGACCTCAATGTCAGATATGTAATAAATTTGGTCATATT
GCTGTTAAATGTTACTCTCGTGTCCAAATGCCTGGTGCTTATGCTACCCAGTTCAGTCCTTCTGGTCCTGTTTTTCCCTCTTGTCATAATTTTGGCCAGAAACAATTTGG
TGGTCCTTTCCCACAAATGCAGGCTATGATGGCTGCTCCTAATTACAACCAAGATTGTAACCAGTATCCTGACTCAGGAGCCATGAATCACTTGACGAATAGCCTGAGTA
ACATGTCTGTGAGTTCTGAATATCCTAGAAACAATCAGATTTTGATTGGCAATGGTGCAGGTTTGCCTATCTCTAATCTTGGCTATCAAACAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCCCACACATGCAATGTCGACCCCTTGACATAATTGAATACATGCGTAAGAAACATGGCATTGATATTAGTTATGGTACAGCTTGGAGAGCAAGGGAGATAGCATT
ACGTGATATTAGGGGCTCCTTAGAAGAATCTCATGCCCTTATCCCATCATTTGCAGCAAGACTAATTGAAAAAAGCTCAGGATGGGAACTCACAGAACTATCCCTTGGCT
TTTTTTGCATTGTTGACTCAGAGAATGACACTTCCTTTAAGATTGCGAAAAAACCACTCCCACCAAAGCGTCATTCTCTGAGTCAACATTACAAAAAGCCAAGGGATAGT
TTTGTGAGTTCCCATCCAACGTACAAGCCAGCCTTGTCTAACGACCTATTTTTTGTTCGCATGGAGACCTCTGAGAATAATAACTCAGAAATTTCGAGCCGTTCGCAAAG
TAGCCAAGCGATCAACCCCGAAAACAAGATCTCAACTGTTAAGTTGTCCGATGAGAAATTTCTTCTTTGGAAGTTTCAAATTCTTACTGCACTTGAGGGGCATGATCTCG
ATCAACATATCAGTGAAGATTGTGAACCACCGCCTGAGAAAGTAAGTGAAAATGGTGCAATGGAAAAGATGTTGATGTTCAAGACCATATTTTGTATTTTCTCTCGTCTA
GGGGCTGAGTATGAGACTATGGTGTCTGTTATCACTGCTAAAACTAGTACACAGACTATTCATGATGTTGTAGCTTTGTTATTAACTCATGAAAGTCGCATTGAGAGTAA
AACTGCTGTTATTCCTGATAATATTCTACCCTCGACTAATTTGGCCGTTCAAAATCCTATACAAAACACTGTGCAAAATTCTTCTCAGAATGTGCAACAGCAAAATTTTG
GTAATAATAGAGGTAGAGGTCATTCAAATTTTGGTCAAAATAGAGGTGGAAGAACCTGGAATAATCGAAATCGACCTCAATGTCAGATATGTAATAAATTTGGTCATATT
GCTGTTAAATGTTACTCTCGTGTCCAAATGCCTGGTGCTTATGCTACCCAGTTCAGTCCTTCTGGTCCTGTTTTTCCCTCTTGTCATAATTTTGGCCAGAAACAATTTGG
TGGTCCTTTCCCACAAATGCAGGCTATGATGGCTGCTCCTAATTACAACCAAGATTGTAACCAGTATCCTGACTCAGGAGCCATGAATCACTTGACGAATAGCCTGAGTA
ACATGTCTGTGAGTTCTGAATATCCTAGAAACAATCAGATTTTGATTGGCAATGGTGCAGGTTTGCCTATCTCTAATCTTGGCTATCAAACAAATTAA
Protein sequenceShow/hide protein sequence
MTPHMQCRPLDIIEYMRKKHGIDISYGTAWRAREIALRDIRGSLEESHALIPSFAARLIEKSSGWELTELSLGFFCIVDSENDTSFKIAKKPLPPKRHSLSQHYKKPRDS
FVSSHPTYKPALSNDLFFVRMETSENNNSEISSRSQSSQAINPENKISTVKLSDEKFLLWKFQILTALEGHDLDQHISEDCEPPPEKVSENGAMEKMLMFKTIFCIFSRL
GAEYETMVSVITAKTSTQTIHDVVALLLTHESRIESKTAVIPDNILPSTNLAVQNPIQNTVQNSSQNVQQQNFGNNRGRGHSNFGQNRGGRTWNNRNRPQCQICNKFGHI
AVKCYSRVQMPGAYATQFSPSGPVFPSCHNFGQKQFGGPFPQMQAMMAAPNYNQDCNQYPDSGAMNHLTNSLSNMSVSSEYPRNNQILIGNGAGLPISNLGYQTN