; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035609 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035609
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:25263770..25275999
RNA-Seq ExpressionLag0035609
SyntenyLag0035609
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]5.0e-4239.92Show/hide
Query:  SLNQNPLPQPNYPYPYPQFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAP
        S +  P P P    P P  + P PN                   PQ+  T P PN  PS+ +PLAVKL D+N+++WK  LLN+VIANGL  +LDG+   P
Subjt:  SLNQNPLPQPNYPYPYPQFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAP

Query:  QKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL--------------------------------TPQIKDITDKFSAI
         +FLD  Q Q NPE+  W+RYNR +M WIY+S++E  +G+IV  T+A  IW +L                                  + + + +  ++I
Subjt:  QKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL--------------------------------TPQIKDITDKFSAI

Query:  GEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVD
        GEP++Y DHL + L GLG +YN FVTSIQ+ +  PS+E+V SLLL+Y+ARLE+Q+A D
Subjt:  GEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVD

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]3.0e-3938.35Show/hide
Query:  PPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIW
        PP    PS+ +P  +KL  +N+L+WKN LLNV+IANGL  ++DG+ P P +F D  +   N EY  W+R+NR IM WIY+SL++  MG+IV   +AF+IW
Subjt:  PPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIW

Query:  TSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARL
         +L                                  + K+I +  +A+GEP+S +DHL ++  GL  EYN FVTSI    ++  LE++ SLLL+YE RL
Subjt:  TSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARL

Query:  EKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNK--SQFNQFNPFSKPPFTSSNQQPSFSQTSVLGKP
        E Q A  QL+  Q NL+ LN+     R +F+     F Q        F S     +  Q S+LGKP
Subjt:  EKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNK--SQFNQFNPFSKPPFTSSNQQPSFSQTSVLGKP

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.0e-3542.11Show/hide
Query:  PSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLTPQ
        PSL + L++KL + N LL K+ LLNV+IANGL  ++D    +P K+LD    Q NPE+  W+R N+ +M WIYSSL+   +G+IV  +TA DIW SL  +
Subjt:  PSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLTPQ

Query:  --------------------------------IKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAV
                                        +K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI N S+ PSL++V SLL  YE RL +++  
Subjt:  --------------------------------IKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAV

Query:  DQLNIAQVN
          LN  Q N
Subjt:  DQLNIAQVN

RVX14312.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.0e-3535.48Show/hide
Query:  NSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL
        ++ PSL +   V L  +N+LLW+  +LN++IANGL   + G IPAP +FL  ++   NPEY+ W+R NR +MCWIYSSL+E  M +I+ L TA +IWT+L
Subjt:  NSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL

Query:  TP--------------------------------QIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQ
                                          +IK I D   AIGE I+ +D + ++L GLG EYN FV ++ +     SLE++ S+LL +E +LE+Q
Subjt:  TP--------------------------------QIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQ

Query:  TAVDQLNIAQVNLSSLNLR-HNNNRQSFNKSQFNQFNPFSKPPFTSSN
           ++ N+ Q N++++N++ HN   Q  ++ +      F+   ++S N
Subjt:  TAVDQLNIAQVNLSSLNLR-HNNNRQSFNKSQFNQFNPFSKPPFTSSN

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]7.2e-7354.52Show/hide
Query:  QFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAG
        QF  P PN+ AQP  P P  F A              N +P+LP+PL VKL DNNFLLWKN LLN VIANGL GYLDGTI  P +FLD +Q QPNP Y  
Subjt:  QFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAG

Query:  WERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGL
        WERYNR +MCWIYSSLSEEKMGE+V+L T  DIW+SLT                                 +IK+I DKF+A+GEP+SYRDHLAH+LDGL
Subjt:  WERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGL

Query:  GVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNKSQFNQF-NPFSKPPFTSSNQQPSFSQTSVLGKP
        G EYN FVTSI N ++SPSLEDVRSLLLAYEARL+KQ  VDQLNIAQ NL +L+L+HN+ R     S  N + + F   P +++  Q      S+LGKP
Subjt:  GVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNKSQFNQF-NPFSKPPFTSSNQQPSFSQTSVLGKP

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.5e-3938.35Show/hide
Query:  PPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIW
        PP    PS+ +P  +KL  +N+L+WKN LLNV+IANGL  ++DG+ P P +F D  +   N EY  W+R+NR IM WIY+SL++  MG+IV   +AF+IW
Subjt:  PPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIW

Query:  TSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARL
         +L                                  + K+I +  +A+GEP+S +DHL ++  GL  EYN FVTSI    ++  LE++ SLLL+YE RL
Subjt:  TSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARL

Query:  EKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNK--SQFNQFNPFSKPPFTSSNQQPSFSQTSVLGKP
        E Q A  QL+  Q NL+ LN+     R +F+     F Q        F S     +  Q S+LGKP
Subjt:  EKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNK--SQFNQFNPFSKPPFTSSNQQPSFSQTSVLGKP

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE19.9e-3642.11Show/hide
Query:  PSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLTPQ
        PSL + L++KL + N LL K+ LLNV+IANGL  ++D    +P K+LD    Q NPE+  W+R N+ +M WIYSSL+   +G+IV  +TA DIW SL  +
Subjt:  PSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLTPQ

Query:  --------------------------------IKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAV
                                        +K + D+F+ IGEP+SYRD L  IL+GL  EY+ FVTSI N S+ PSL++V SLL  YE RL +++  
Subjt:  --------------------------------IKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAV

Query:  DQLNIAQVN
          LN  Q N
Subjt:  DQLNIAQVN

A0A438JZB9 Retrovirus-related Pol polyprotein from transposon RE19.9e-3635.48Show/hide
Query:  NSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL
        ++ PSL +   V L  +N+LLW+  +LN++IANGL   + G IPAP +FL  ++   NPEY+ W+R NR +MCWIYSSL+E  M +I+ L TA +IWT+L
Subjt:  NSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL

Query:  TP--------------------------------QIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQ
                                          +IK I D   AIGE I+ +D + ++L GLG EYN FV ++ +     SLE++ S+LL +E +LE+Q
Subjt:  TP--------------------------------QIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQ

Query:  TAVDQLNIAQVNLSSLNLR-HNNNRQSFNKSQFNQFNPFSKPPFTSSN
           ++ N+ Q N++++N++ HN   Q  ++ +      F+   ++S N
Subjt:  TAVDQLNIAQVNLSSLNLR-HNNNRQSFNKSQFNQFNPFSKPPFTSSN

A0A6J1DQX7 uncharacterized protein LOC1110223153.5e-7354.52Show/hide
Query:  QFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAG
        QF  P PN+ AQP  P P  F A              N +P+LP+PL VKL DNNFLLWKN LLN VIANGL GYLDGTI  P +FLD +Q QPNP Y  
Subjt:  QFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAG

Query:  WERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGL
        WERYNR +MCWIYSSLSEEKMGE+V+L T  DIW+SLT                                 +IK+I DKF+A+GEP+SYRDHLAH+LDGL
Subjt:  WERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLT--------------------------------PQIKDITDKFSAIGEPISYRDHLAHILDGL

Query:  GVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNKSQFNQF-NPFSKPPFTSSNQQPSFSQTSVLGKP
        G EYN FVTSI N ++SPSLEDVRSLLLAYEARL+KQ  VDQLNIAQ NL +L+L+HN+ R     S  N + + F   P +++  Q      S+LGKP
Subjt:  GVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNKSQFNQF-NPFSKPPFTSSNQQPSFSQTSVLGKP

A0A7J0EGI5 Uncharacterized protein2.4e-4239.92Show/hide
Query:  SLNQNPLPQPNYPYPYPQFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAP
        S +  P P P    P P  + P PN                   PQ+  T P PN  PS+ +PLAVKL D+N+++WK  LLN+VIANGL  +LDG+   P
Subjt:  SLNQNPLPQPNYPYPYPQFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAP

Query:  QKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL--------------------------------TPQIKDITDKFSAI
         +FLD  Q Q NPE+  W+RYNR +M WIY+S++E  +G+IV  T+A  IW +L                                  + + + +  ++I
Subjt:  QKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSL--------------------------------TPQIKDITDKFSAI

Query:  GEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVD
        GEP++Y DHL + L GLG +YN FVTSIQ+ +  PS+E+V SLLL+Y+ARLE+Q+A D
Subjt:  GEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.8e-0921.65Show/hide
Query:  LAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNP-EYAGWERYNRFIMCWIYSSLSEEKMGEIVNL-TTAFDIWTSL-------
        + + L   N+ +W+     + ++ G+ G++DG+            + P P     W+  +  +  WIY ++++  +  I+ +  TA D+W SL       
Subjt:  LAVKLTDNNFLLWKNHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNP-EYAGWERYNRFIMCWIYSSLSEEKMGEIVNL-TTAFDIWTSL-------

Query:  -------------------------TPQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQT
                                   ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I++ S  PS  + RS+LL  E+RL  ++
Subjt:  -------------------------TPQIKDITDKFSAIGEPISYRDHLAHILDGLGVEYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAAGAAAGATGTAGAATCTGATGGTGAATTGCTGCCAAAATACTCACACACCAAACCCACCTCCATAAGCTTGTTCTTCAAGTTTTTCTCTTCAAATTCCATCAC
TAGAGGATCCCACAATTCGTTCCAAAGGCCTGAGAATAGCAAGGAAGATCCAGTGGTGGATCCCTCTCATTCGGACGTGATCTCGGTTCATTCATGTACCCTTCCTAACT
TGGGTGTTACAGTTATTTACCGTATATCAGGGAAGCCGCCATTTGAGTTGTTGAGAGTTGAGCTGAAGGTGTTCGTTCTGAGGGGTTGCTGTGTTCTATATTCTTCTTGT
GCCTCTGCCAATGATTCTACCAGGGTTTTGTTTGGAAGGAATTACAGATTTGCAGTGCATTTGAGAAATCAGTTGATGCTGTGGGAAGGTGATGAGGATGTTCAACAAAA
AGTAGGTGATGTTGCTATTACTAGAAGTGAGTTGGAGAATGCAGAAAAAATGTATTTAATCTCAAGATGGAGATGCATGGAGTCTAAATCATCAACTGATGTGGATCATA
AGTCTTCAATATGCCCAAATGATTTATTACAAATGTTATTGAGTCAGAAAAGAGAGTTGATGAAATTGGAAGGTGAATATGTTGGGTCAATGCGAGTTGGGGGGAAATTC
TTCTTCGAAGAAAGGCATGAATACTCTGAAGAAGAATTGCGAAGGACAGAGAGATCGAAAGGAGAACTTTTGGTGCTGGAAGAATACATTACGCTTGTTGAGAAGCCAAG
CTGCCTACAAAAAGTTGTAAACGTCATGTTACGTTGGAGAGATGATTTGAGATTGCCAAAAGATACAATTGTTCGTGAAGGACAAAGACGCATAGAGAAATTCGGGAATT
CTTCTTCGAGCCTAGGCTCTATATTTTTCTGGAAAATGACAACAGAAGCAGGGTCTTCATCATCGTCTTCTTCAGCAGCTCCAGTTACACCAGTGGTTTCTCCTAGCACT
CGAATAACCACACCTATTGTTACTCCAATCCAAAACCCCAGACATCCAGCTCAACCCCATTTTTCTACTCAAAATCCTCCCCCTCAACAAAATCAATTTTCGTTGAATCA
AAATCCTCTTCCTCAACCAAATTATCCCTATCCCTATCCCCAGTTCACTCATCCCCAACCAAATTATTTTGCCCAACCCTATTACCCTCGGCCTCCTCTGTTTCCAGCCA
ACCAAGCTTATCCTCAAGTTCCAGCTACTTTCCCCCCTCCGAATTCCTATCCCTCATTACCAAAACCCCTTGCTGTTAAACTTACCGACAACAATTTTCTATTATGGAAA
AACCACCTGCTCAATGTTGTCATTGCCAATGGTCTCTCTGGCTACTTAGACGGAACGATACCTGCTCCTCAGAAATTTCTCGATCAGAATCAAACACAGCCGAATCCGGA
GTATGCTGGCTGGGAACGATACAACAGATTCATCATGTGCTGGATATATTCCTCTCTCTCGGAAGAAAAAATGGGTGAGATCGTTAACTTAACCACTGCTTTTGATATAT
GGACTTCTCTTACTCCTCAGATTAAGGACATTACTGATAAATTCTCTGCTATTGGTGAACCCATATCATATCGTGATCATTTAGCGCATATTCTTGATGGTCTTGGCGTT
GAATATAATGTTTTTGTTACTAGTATTCAGAATTGGTCGAATAGTCCCTCTTTGGAAGATGTTAGAAGCTTGTTGTTAGCCTATGAAGCTAGATTAGAGAAGCAAACAGC
TGTTGATCAACTCAATATTGCTCAAGTTAATCTCAGTAGCCTTAATCTTCGGCATAATAATAATCGTCAATCTTTTAACAAGTCTCAGTTCAACCAGTTTAATCCCTTTT
CCAAGCCTCCTTTTACTTCCTCGAACCAGCAACCTTCCTTTTCCCAAACCAGTGTGTTAGGCAAGCCTAATTTCCCTACATATCAGCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAAGAAAGATGTAGAATCTGATGGTGAATTGCTGCCAAAATACTCACACACCAAACCCACCTCCATAAGCTTGTTCTTCAAGTTTTTCTCTTCAAATTCCATCAC
TAGAGGATCCCACAATTCGTTCCAAAGGCCTGAGAATAGCAAGGAAGATCCAGTGGTGGATCCCTCTCATTCGGACGTGATCTCGGTTCATTCATGTACCCTTCCTAACT
TGGGTGTTACAGTTATTTACCGTATATCAGGGAAGCCGCCATTTGAGTTGTTGAGAGTTGAGCTGAAGGTGTTCGTTCTGAGGGGTTGCTGTGTTCTATATTCTTCTTGT
GCCTCTGCCAATGATTCTACCAGGGTTTTGTTTGGAAGGAATTACAGATTTGCAGTGCATTTGAGAAATCAGTTGATGCTGTGGGAAGGTGATGAGGATGTTCAACAAAA
AGTAGGTGATGTTGCTATTACTAGAAGTGAGTTGGAGAATGCAGAAAAAATGTATTTAATCTCAAGATGGAGATGCATGGAGTCTAAATCATCAACTGATGTGGATCATA
AGTCTTCAATATGCCCAAATGATTTATTACAAATGTTATTGAGTCAGAAAAGAGAGTTGATGAAATTGGAAGGTGAATATGTTGGGTCAATGCGAGTTGGGGGGAAATTC
TTCTTCGAAGAAAGGCATGAATACTCTGAAGAAGAATTGCGAAGGACAGAGAGATCGAAAGGAGAACTTTTGGTGCTGGAAGAATACATTACGCTTGTTGAGAAGCCAAG
CTGCCTACAAAAAGTTGTAAACGTCATGTTACGTTGGAGAGATGATTTGAGATTGCCAAAAGATACAATTGTTCGTGAAGGACAAAGACGCATAGAGAAATTCGGGAATT
CTTCTTCGAGCCTAGGCTCTATATTTTTCTGGAAAATGACAACAGAAGCAGGGTCTTCATCATCGTCTTCTTCAGCAGCTCCAGTTACACCAGTGGTTTCTCCTAGCACT
CGAATAACCACACCTATTGTTACTCCAATCCAAAACCCCAGACATCCAGCTCAACCCCATTTTTCTACTCAAAATCCTCCCCCTCAACAAAATCAATTTTCGTTGAATCA
AAATCCTCTTCCTCAACCAAATTATCCCTATCCCTATCCCCAGTTCACTCATCCCCAACCAAATTATTTTGCCCAACCCTATTACCCTCGGCCTCCTCTGTTTCCAGCCA
ACCAAGCTTATCCTCAAGTTCCAGCTACTTTCCCCCCTCCGAATTCCTATCCCTCATTACCAAAACCCCTTGCTGTTAAACTTACCGACAACAATTTTCTATTATGGAAA
AACCACCTGCTCAATGTTGTCATTGCCAATGGTCTCTCTGGCTACTTAGACGGAACGATACCTGCTCCTCAGAAATTTCTCGATCAGAATCAAACACAGCCGAATCCGGA
GTATGCTGGCTGGGAACGATACAACAGATTCATCATGTGCTGGATATATTCCTCTCTCTCGGAAGAAAAAATGGGTGAGATCGTTAACTTAACCACTGCTTTTGATATAT
GGACTTCTCTTACTCCTCAGATTAAGGACATTACTGATAAATTCTCTGCTATTGGTGAACCCATATCATATCGTGATCATTTAGCGCATATTCTTGATGGTCTTGGCGTT
GAATATAATGTTTTTGTTACTAGTATTCAGAATTGGTCGAATAGTCCCTCTTTGGAAGATGTTAGAAGCTTGTTGTTAGCCTATGAAGCTAGATTAGAGAAGCAAACAGC
TGTTGATCAACTCAATATTGCTCAAGTTAATCTCAGTAGCCTTAATCTTCGGCATAATAATAATCGTCAATCTTTTAACAAGTCTCAGTTCAACCAGTTTAATCCCTTTT
CCAAGCCTCCTTTTACTTCCTCGAACCAGCAACCTTCCTTTTCCCAAACCAGTGTGTTAGGCAAGCCTAATTTCCCTACATATCAGCCTTGA
Protein sequenceShow/hide protein sequence
MRKKDVESDGELLPKYSHTKPTSISLFFKFFSSNSITRGSHNSFQRPENSKEDPVVDPSHSDVISVHSCTLPNLGVTVIYRISGKPPFELLRVELKVFVLRGCCVLYSSC
ASANDSTRVLFGRNYRFAVHLRNQLMLWEGDEDVQQKVGDVAITRSELENAEKMYLISRWRCMESKSSTDVDHKSSICPNDLLQMLLSQKRELMKLEGEYVGSMRVGGKF
FFEERHEYSEEELRRTERSKGELLVLEEYITLVEKPSCLQKVVNVMLRWRDDLRLPKDTIVREGQRRIEKFGNSSSSLGSIFFWKMTTEAGSSSSSSSAAPVTPVVSPST
RITTPIVTPIQNPRHPAQPHFSTQNPPPQQNQFSLNQNPLPQPNYPYPYPQFTHPQPNYFAQPYYPRPPLFPANQAYPQVPATFPPPNSYPSLPKPLAVKLTDNNFLLWK
NHLLNVVIANGLSGYLDGTIPAPQKFLDQNQTQPNPEYAGWERYNRFIMCWIYSSLSEEKMGEIVNLTTAFDIWTSLTPQIKDITDKFSAIGEPISYRDHLAHILDGLGV
EYNVFVTSIQNWSNSPSLEDVRSLLLAYEARLEKQTAVDQLNIAQVNLSSLNLRHNNNRQSFNKSQFNQFNPFSKPPFTSSNQQPSFSQTSVLGKPNFPTYQP