; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000290 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000290
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr4:3049769..3051052
RNA-Seq ExpressionLag0000290
SyntenyLag0000290
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]2.6e-5547.74Show/hide
Query:  PLYPPSTGFFQPYYPSSFPRPQYPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDY
        P  PP T    P   SS P P        PQI      PN  PS+ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   PP+FLD QQ Q NP++
Subjt:  PLYPPSTGFFQPYYPSSFPRPQYPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDY

Query:  LTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILD
         +W+RYNR +M W+Y+S++E  +G+IV   SAS+IW +L+R Y + + A +  L+T LQ I+K+GL+   Y+ + + + +  ++IGEP++Y DHL + L 
Subjt:  LTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILD

Query:  GLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVD
        GLG +YN FVTSIQ++A  P++E+V SLLL+Y+ARLE+Q++ D
Subjt:  GLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVD

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]6.6e-5944.78Show/hide
Query:  PSSFPRPQ------YPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNR
        P++ P  Q       PPLP  P +          PS+ QP ++KL   N+L+WKNQLLN ++ANGL  F+DGS P PP+F D  +   N +Y+ W+R+NR
Subjt:  PSSFPRPQ------YPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNR

Query:  FIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNA
         IM W+Y+SL++  MG+IV   SA EIW +L + Y S + A+I  L+ +LQ +RKDGL+  +Y+ + K+I +  +A+GEP+S +DHL ++  GL  EYNA
Subjt:  FIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNA

Query:  FVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFN-PLPFTSSPVASTGFSPSILGKPQSQ
        FVTSI  R DN  LE++ SLLL+YE RLE QN+   L+  QANLA L++N    R    +P   F +   N    F S P  S  F PSILGKPQ +
Subjt:  FVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFN-PLPFTSSPVASTGFSPSILGKPQSQ

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]9.2e-5348.71Show/hide
Query:  APQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVN
        APQI Q      P PSL Q LS+KL +TN LL K+QLLN ++ANGL  F+D    +PPK+LDA   Q NP+++ W+R N+ +M W+YSSL+   +G+IV 
Subjt:  APQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVN

Query:  LESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSL
          +A +IW SL   Y+S + A +M L +QLQRI+K  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NR+D P+L++V SL
Subjt:  LESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSL

Query:  LLAYEARLEKQNSVDHLNLAQANLASLSLNNN
        L  YE RL +++   +LN  QAN      NN+
Subjt:  LLAYEARLEKQNSVDHLNLAQANLASLSLNNN

RVX14312.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.1e-4840.71Show/hide
Query:  PQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESA
        P   FA +  PSL Q  +V L  +N+LLW+ Q+LN ++ANGL   + G IP P +FL       NP+Y  W+R NR +MCW+YSSL+E  M +I+ L++A
Subjt:  PQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESA

Query:  SEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAY
        SEIW +L++ + + + ARIM L+ QLQ  +K GLS+ +YL +IK I D   AIGE I+ +D + ++L GLG+EYN+FV ++ +  +  +LE++ S+LL +
Subjt:  SEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAY

Query:  EARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTS
        E +LE+Q+  +  NL QAN+ ++++  +++++   S   T  R  FN   ++S
Subjt:  EARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.1e-9868.21Show/hide
Query:  PPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKM
        PP P     P   F+ NP+P+LPQPL+VKL D NFLLWKNQLLNAV+ANGL G+LDG+I  PP+FLD  Q QPNP Y  WERYNR +MCW+YSSLSEEKM
Subjt:  PPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKM

Query:  GEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALE
        GE+V+LE+  +IW+SL R YDSKTTARIMGLKT+LQ +RKDG SVSQYL++IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NRAD+P+LE
Subjt:  GEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALE

Query:  DVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTSSPVASTGFSPSILGKPQS
        DVRSLLLAYEARL+KQN+VD LN+AQANL +LSL +NS+R     P   F  P      F +SP+ S   S SILGKPQS
Subjt:  DVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTSSPVASTGFSPSILGKPQS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein3.2e-5944.78Show/hide
Query:  PSSFPRPQ------YPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNR
        P++ P  Q       PPLP  P +          PS+ QP ++KL   N+L+WKNQLLN ++ANGL  F+DGS P PP+F D  +   N +Y+ W+R+NR
Subjt:  PSSFPRPQ------YPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNR

Query:  FIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNA
         IM W+Y+SL++  MG+IV   SA EIW +L + Y S + A+I  L+ +LQ +RKDGL+  +Y+ + K+I +  +A+GEP+S +DHL ++  GL  EYNA
Subjt:  FIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNA

Query:  FVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFN-PLPFTSSPVASTGFSPSILGKPQSQ
        FVTSI  R DN  LE++ SLLL+YE RLE QN+   L+  QANLA L++N    R    +P   F +   N    F S P  S  F PSILGKPQ +
Subjt:  FVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFN-PLPFTSSPVASTGFSPSILGKPQSQ

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE14.4e-5348.71Show/hide
Query:  APQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVN
        APQI Q      P PSL Q LS+KL +TN LL K+QLLN ++ANGL  F+D    +PPK+LDA   Q NP+++ W+R N+ +M W+YSSL+   +G+IV 
Subjt:  APQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVN

Query:  LESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSL
          +A +IW SL   Y+S + A +M L +QLQRI+K  + +S+YLS++K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI NR+D P+L++V SL
Subjt:  LESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSL

Query:  LLAYEARLEKQNSVDHLNLAQANLASLSLNNN
        L  YE RL +++   +LN  QAN      NN+
Subjt:  LLAYEARLEKQNSVDHLNLAQANLASLSLNNN

A0A438JZB9 Retrovirus-related Pol polyprotein from transposon RE11.5e-4840.71Show/hide
Query:  PQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESA
        P   FA +  PSL Q  +V L  +N+LLW+ Q+LN ++ANGL   + G IP P +FL       NP+Y  W+R NR +MCW+YSSL+E  M +I+ L++A
Subjt:  PQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESA

Query:  SEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAY
        SEIW +L++ + + + ARIM L+ QLQ  +K GLS+ +YL +IK I D   AIGE I+ +D + ++L GLG+EYN+FV ++ +  +  +LE++ S+LL +
Subjt:  SEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAY

Query:  EARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTS
        E +LE+Q+  +  NL QAN+ ++++  +++++   S   T  R  FN   ++S
Subjt:  EARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTS

A0A6J1DQX7 uncharacterized protein LOC1110223155.4e-9968.21Show/hide
Query:  PPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKM
        PP P     P   F+ NP+P+LPQPL+VKL D NFLLWKNQLLNAV+ANGL G+LDG+I  PP+FLD  Q QPNP Y  WERYNR +MCW+YSSLSEEKM
Subjt:  PPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKM

Query:  GEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALE
        GE+V+LE+  +IW+SL R YDSKTTARIMGLKT+LQ +RKDG SVSQYL++IK+I DKF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI NRAD+P+LE
Subjt:  GEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALE

Query:  DVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTSSPVASTGFSPSILGKPQS
        DVRSLLLAYEARL+KQN+VD LN+AQANL +LSL +NS+R     P   F  P      F +SP+ S   S SILGKPQS
Subjt:  DVRSLLLAYEARLEKQNSVDHLNLAQANLASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTSSPVASTGFSPSILGKPQS

A0A7J0EGI5 Uncharacterized protein1.3e-5547.74Show/hide
Query:  PLYPPSTGFFQPYYPSSFPRPQYPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDY
        P  PP T    P   SS P P        PQI      PN  PS+ QPL+VKL D N+++WK QLLN V+ANGL  FLDGS   PP+FLD QQ Q NP++
Subjt:  PLYPPSTGFFQPYYPSSFPRPQYPPLPQAPQIPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDY

Query:  LTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILD
         +W+RYNR +M W+Y+S++E  +G+IV   SAS+IW +L+R Y + + A +  L+T LQ I+K+GL+   Y+ + + + +  ++IGEP++Y DHL + L 
Subjt:  LTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILD

Query:  GLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVD
        GLG +YN FVTSIQ++A  P++E+V SLLL+Y+ARLE+Q++ D
Subjt:  GLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-0723.85Show/hide
Query:  DTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGL
        D  F  W+ ++ + ++  GLH  LD     P    D  +++       W   +      +   LS++ +  I++ ++A  IW  L+  Y SKT    + L
Subjt:  DTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTARIMGL

Query:  KTQLQRIR-KDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLA
        K QL  +   +G +   +L+    +  + + +G  I   D    +L+ L S Y+   T+I +      L+DV S LL  E   +K  +     + +    
Subjt:  KTQLQRIR-KDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQANLA

Query:  SLS-LNNNSRRSTSRSPS
        S    +NN  RS +R  S
Subjt:  SLS-LNNNSRRSTSRSPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.2e-2025.11Show/hide
Query:  KLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQS-QPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTAR
        KLT TN+L+W  Q+        L GFLDGS   PP  +    + + NPDY  W+R ++ I   +  ++S      +    +A++IW +L++ Y + +   
Subjt:  KLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQS-QPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKRSYDSKTTAR

Query:  IMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNL---
        +  L+TQL++  K   ++  Y+  +    D+ + +G+P+ + + +  +L+ L  EY   +  I  +   P L ++   LL +E+++   +S   + +   
Subjt:  IMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNL---

Query:  --AQANLASLSLNNNSRRS
          +  N  + + NNN  R+
Subjt:  --AQANLASLSLNNNSRRS

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.0e-1724.88Show/hide
Query:  LSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLE-SASEIWNSLKRSYDSKT
        +++ L   N+ +W+       L+ G+ G +DGS  TP    + +          W+  +  +  W+Y ++++  +  I+ +  +A ++W SL+  +    
Subjt:  LSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLE-SASEIWNSLKRSYDSKT

Query:  TARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARL--EKQNSVDHL
         AR +  + +L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I++++  P+  + RS+LL  E+RL  + ++S+ H 
Subjt:  TARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARL--EKQNSVDHL

Query:  N
        N
Subjt:  N


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACAGAAGCCTCCTCCTCGTCTTCTTCTTCTGCACCGCTTACCTCACCAGTCCTTTCTCCATCAACTCCCGTTACTACACCGATTTCTACACCAATAGACCAGAC
AACTCAGCTACGCCCTTCTTTTCCCCAAATCCGTCCGAACTCCACCCCTGTATTTCAACAAACTCGACCCACAATCTCTACTCAGACTTCACCTCAACCATACCCACAAA
ATCCATACCCATCTCAGCCCCTTTATCCACCTTCTACTGGATTTTTCCAACCTTATTACCCATCATCCTTTCCACGCCCACAGTATCCTCCGTTGCCGCAAGCACCTCAG
ATTCCTCAACCTCATTTTGCCCCAAATCCCTACCCGTCTTTACCTCAACCTCTATCAGTTAAGCTCACAGATACAAATTTCTTACTCTGGAAGAATCAACTGCTGAATGC
GGTGCTGGCCAATGGACTTCATGGTTTTCTCGATGGCTCCATACCGACTCCTCCAAAGTTCCTGGATGCTCAACAATCTCAACCAAATCCGGATTATCTCACTTGGGAAA
GGTACAATCGATTCATTATGTGTTGGATGTATTCCTCACTCTCTGAAGAAAAAATGGGTGAAATAGTGAATCTAGAATCTGCCTCTGAAATATGGAACTCCTTGAAACGC
TCTTACGATTCTAAGACTACTGCTAGGATAATGGGCCTTAAAACACAACTTCAAAGAATTAGGAAGGATGGTCTCTCTGTTAGCCAGTACTTGTCTCAAATTAAGGATAT
CACTGATAAATTTTCAGCTATTGGAGAGCCCATTTCTTATCGAGATCATTTGGCTCATATCTTAGATGGTCTTGGGAGTGAATACAATGCGTTTGTTACTTCTATTCAGA
ACCGTGCTGATAATCCTGCCCTAGAGGATGTCCGTAGTCTCCTTCTTGCTTATGAAGCCCGGTTAGAAAAACAGAATAGTGTTGATCACCTTAACTTAGCTCAGGCAAAT
CTTGCTAGTCTTAGTCTCAACAATAACAGCCGCCGGTCTACTTCTCGTTCCCCTTCAGTGACTTTCCCTAGACCTCCCTTCAATCCATTGCCTTTTACCTCCTCTCCCGT
AGCCTCAACTGGCTTTTCCCCTAGTATTCTTGGTAAGCCTCAATCCCAACCACTACAAAATGGCCTTCCCGATCAAATCCTAGTCGCCCTCAGTGCCAAATATGTGGTAA
ATTTGGGCATACCGCCCTTATTTGTCATCACCGAACCAATCTTGCTTATCAAACTCCTCCTCCACAAGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAACAGAAGCCTCCTCCTCGTCTTCTTCTTCTGCACCGCTTACCTCACCAGTCCTTTCTCCATCAACTCCCGTTACTACACCGATTTCTACACCAATAGACCAGAC
AACTCAGCTACGCCCTTCTTTTCCCCAAATCCGTCCGAACTCCACCCCTGTATTTCAACAAACTCGACCCACAATCTCTACTCAGACTTCACCTCAACCATACCCACAAA
ATCCATACCCATCTCAGCCCCTTTATCCACCTTCTACTGGATTTTTCCAACCTTATTACCCATCATCCTTTCCACGCCCACAGTATCCTCCGTTGCCGCAAGCACCTCAG
ATTCCTCAACCTCATTTTGCCCCAAATCCCTACCCGTCTTTACCTCAACCTCTATCAGTTAAGCTCACAGATACAAATTTCTTACTCTGGAAGAATCAACTGCTGAATGC
GGTGCTGGCCAATGGACTTCATGGTTTTCTCGATGGCTCCATACCGACTCCTCCAAAGTTCCTGGATGCTCAACAATCTCAACCAAATCCGGATTATCTCACTTGGGAAA
GGTACAATCGATTCATTATGTGTTGGATGTATTCCTCACTCTCTGAAGAAAAAATGGGTGAAATAGTGAATCTAGAATCTGCCTCTGAAATATGGAACTCCTTGAAACGC
TCTTACGATTCTAAGACTACTGCTAGGATAATGGGCCTTAAAACACAACTTCAAAGAATTAGGAAGGATGGTCTCTCTGTTAGCCAGTACTTGTCTCAAATTAAGGATAT
CACTGATAAATTTTCAGCTATTGGAGAGCCCATTTCTTATCGAGATCATTTGGCTCATATCTTAGATGGTCTTGGGAGTGAATACAATGCGTTTGTTACTTCTATTCAGA
ACCGTGCTGATAATCCTGCCCTAGAGGATGTCCGTAGTCTCCTTCTTGCTTATGAAGCCCGGTTAGAAAAACAGAATAGTGTTGATCACCTTAACTTAGCTCAGGCAAAT
CTTGCTAGTCTTAGTCTCAACAATAACAGCCGCCGGTCTACTTCTCGTTCCCCTTCAGTGACTTTCCCTAGACCTCCCTTCAATCCATTGCCTTTTACCTCCTCTCCCGT
AGCCTCAACTGGCTTTTCCCCTAGTATTCTTGGTAAGCCTCAATCCCAACCACTACAAAATGGCCTTCCCGATCAAATCCTAGTCGCCCTCAGTGCCAAATATGTGGTAA
ATTTGGGCATACCGCCCTTATTTGTCATCACCGAACCAATCTTGCTTATCAAACTCCTCCTCCACAAGCTATGA
Protein sequenceShow/hide protein sequence
MSTEASSSSSSSAPLTSPVLSPSTPVTTPISTPIDQTTQLRPSFPQIRPNSTPVFQQTRPTISTQTSPQPYPQNPYPSQPLYPPSTGFFQPYYPSSFPRPQYPPLPQAPQ
IPQPHFAPNPYPSLPQPLSVKLTDTNFLLWKNQLLNAVLANGLHGFLDGSIPTPPKFLDAQQSQPNPDYLTWERYNRFIMCWMYSSLSEEKMGEIVNLESASEIWNSLKR
SYDSKTTARIMGLKTQLQRIRKDGLSVSQYLSQIKDITDKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNRADNPALEDVRSLLLAYEARLEKQNSVDHLNLAQAN
LASLSLNNNSRRSTSRSPSVTFPRPPFNPLPFTSSPVASTGFSPSILGKPQSQPLQNGLPDQILVALSAKYVVNLGIPPLFVITEPILLIKLLLHKL