; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041315 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041315
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:15461354..15469259
RNA-Seq ExpressionLag0041315
SyntenyLag0041315
Gene Ontology termsNA
InterPro domainsIPR029026 - tRNA (guanine-N1-)-methyltransferase, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600811.1 hypothetical protein SDJN03_06044, partial [Cucurbita argyrosperma subsp. sororia]1.1e-8181.36Show/hide
Query:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR
        IPPMN+NP+EHLDSTYRHVVD IFTSLLNFLGR+GGLYINLR QGYLQI+SPRAFWD DSFREMLREVLS +HR+PQENDAI+WH TNLQSLF+  PQ+R
Subjt:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR

Query:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR
        R+IGCL +DQ GN+QE+ IHDFLRFLVHP+ENL+FVVGDF NYR LE +IDQ+VKVSDPGTPTNL V+ ICQECQRR
Subjt:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.6e-3539.31Show/hide
Query:  TQPPNQPLSPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNP
        T PP    + N T      P PQ       +    LP       P L  + SIKL++TN LL K+QLLNV++ANGL DF+D    +P K+LD    Q+NP
Subjt:  TQPPNQPLSPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNP

Query:  EFLLWER------------------------------------SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHI
        EF+ W+R                                     Y+S + A +M L +QLQ+IKK  + +S+YL+++K V D+F+ IGEP+SYRD L  I
Subjt:  EFLLWER------------------------------------SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHI

Query:  LDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQLNNS
        L+GL  EY+ FV +I NRSD PSL++V SLL  YE RL ++++ + LN  QAN      NNS
Subjt:  LDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQLNNS

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]7.6e-6750Show/hide
Query:  PRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLWE-------
        P P    QP  P             F A   P LP   ++KLND NFLLWKNQLLN V+ANGL  +LDG+I  P +FLD  QLQ NP +  WE       
Subjt:  PRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLWE-------

Query:  -----------------------------RSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVI
                                     R YDSKTTARIMGLKT+LQ ++KDG SVSQYLA+IK++ADKF+A+GEP+SYRDHLAH+LDGLGSEYN FV 
Subjt:  -----------------------------RSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVI

Query:  TIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQL-NNSRRPASRNPTPATRTPFNPSTFPSFSSSTAAPFSNSLLGKPQSQPIQKWPQ
        +I NR+D+PSLEDVRSLLLAYEARL+KQ  V+QLN+AQANL  L L +NS+RP  +   P         +FP  +S  +A  S S+LGKPQS  + KWP 
Subjt:  TIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQL-NNSRRPASRNPTPATRTPFNPSTFPSFSSSTAAPFSNSLLGKPQSQPIQKWPQ

Query:  DPPPISLNAKYVANLG
         P    +  +    LG
Subjt:  DPPPISLNAKYVANLG

XP_022942054.1 uncharacterized protein LOC111447242 [Cucurbita moschata]5.4e-8180.79Show/hide
Query:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR
        IPPMN+NP+EHLDSTYRHVVD IFTSLLNFLGR+GGLYINLR QGYLQI+SPRAFWD DSFREMLREVLS +HR+PQENDAI+WH TNLQSLF+  PQ+R
Subjt:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR

Query:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR
        R+IGCL +DQ GN+QE+ IHDFLRFLVHP+ENL+FVVGDF NYR LE +IDQ+VKVSDPGTP NL V+ ICQECQRR
Subjt:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR

XP_022989850.1 uncharacterized protein LOC111486913 [Cucurbita maxima]1.0e-7980.23Show/hide
Query:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR
        IPPMN+NP+EHLDSTYRHVVD IFTSLLNFLGR+GGLYINL  QGYLQI+SPRAFWD DSFREMLREVLS +HR+ QENDAI+WH TNLQSLF+  PQ+R
Subjt:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR

Query:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR
        R+IGCL +DQ GN+QE+ IHDFLRFLVHP+ENL+FVVGDF NYR LE +IDQ+VKVSDPGTPTNL V+ ICQECQRR
Subjt:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE17.5e-3639.31Show/hide
Query:  TQPPNQPLSPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNP
        T PP    + N T      P PQ       +    LP       P L  + SIKL++TN LL K+QLLNV++ANGL DF+D    +P K+LD    Q+NP
Subjt:  TQPPNQPLSPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNP

Query:  EFLLWER------------------------------------SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHI
        EF+ W+R                                     Y+S + A +M L +QLQ+IKK  + +S+YL+++K V D+F+ IGEP+SYRD L  I
Subjt:  EFLLWER------------------------------------SYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHI

Query:  LDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQLNNS
        L+GL  EY+ FV +I NRSD PSL++V SLL  YE RL ++++ + LN  QAN      NNS
Subjt:  LDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQLNNS

A0A6J1DQX7 uncharacterized protein LOC1110223153.7e-6750Show/hide
Query:  PRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLWE-------
        P P    QP  P             F A   P LP   ++KLND NFLLWKNQLLN V+ANGL  +LDG+I  P +FLD  QLQ NP +  WE       
Subjt:  PRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLWE-------

Query:  -----------------------------RSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVI
                                     R YDSKTTARIMGLKT+LQ ++KDG SVSQYLA+IK++ADKF+A+GEP+SYRDHLAH+LDGLGSEYN FV 
Subjt:  -----------------------------RSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVI

Query:  TIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQL-NNSRRPASRNPTPATRTPFNPSTFPSFSSSTAAPFSNSLLGKPQSQPIQKWPQ
        +I NR+D+PSLEDVRSLLLAYEARL+KQ  V+QLN+AQANL  L L +NS+RP  +   P         +FP  +S  +A  S S+LGKPQS  + KWP 
Subjt:  TIQNRSDNPSLEDVRSLLLAYEARLEKQTVVEQLNMAQANLSALQL-NNSRRPASRNPTPATRTPFNPSTFPSFSSSTAAPFSNSLLGKPQSQPIQKWPQ

Query:  DPPPISLNAKYVANLG
         P    +  +    LG
Subjt:  DPPPISLNAKYVANLG

A0A6J1FQ78 uncharacterized protein LOC1114472422.6e-8180.79Show/hide
Query:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR
        IPPMN+NP+EHLDSTYRHVVD IFTSLLNFLGR+GGLYINLR QGYLQI+SPRAFWD DSFREMLREVLS +HR+PQENDAI+WH TNLQSLF+  PQ+R
Subjt:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR

Query:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR
        R+IGCL +DQ GN+QE+ IHDFLRFLVHP+ENL+FVVGDF NYR LE +IDQ+VKVSDPGTP NL V+ ICQECQRR
Subjt:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR

A0A6J1JNI2 uncharacterized protein LOC1114869135.0e-8080.23Show/hide
Query:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR
        IPPMN+NP+EHLDSTYRHVVD IFTSLLNFLGR+GGLYINL  QGYLQI+SPRAFWD DSFREMLREVLS +HR+ QENDAI+WH TNLQSLF+  PQ+R
Subjt:  IPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLS-EHRIPQENDAIVWHRTNLQSLFNRFPQDR

Query:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR
        R+IGCL +DQ GN+QE+ IHDFLRFLVHP+ENL+FVVGDF NYR LE +IDQ+VKVSDPGTPTNL V+ ICQECQRR
Subjt:  RIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRR

A0A7J0EGI5 Uncharacterized protein2.2e-3537.08Show/hide
Query:  SPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPAT---SIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLW
        S +  P PP  P    P   + + N N      + +P +P      ++KL+D N+++WK QLLN+V+ANGL +FLDGS   P +FLD QQ Q NPEF  W
Subjt:  SPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPAT---SIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDEQQLQLNPEFLLW

Query:  ------------------------------------ERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLG
                                            ER Y + + A +  L+T LQ IKK+GL+   Y+ + + + +  ++IGEP++Y DHL + L GLG
Subjt:  ------------------------------------ERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLG

Query:  SEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVE
         +YNPFV +IQ+++  PS+E+V SLLL+Y+ARLE+Q+  +
Subjt:  SEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEKQTVVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.8e-0523.23Show/hide
Query:  ERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEK
        +  + +   AR + L ++L+      + V+ Y  ++K +AD    +  P++ R+ + ++L+GL  +++  +  I++R   PS +D  ++L   E RL++
Subjt:  ERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-0830.48Show/hide
Query:  FLLWERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEAR
        +L  E  +     AR +  + +L+    D LSV +Y  ++K ++D  + +  PIS R  + H+L+GL  +Y+  +  I+++S  PS  + RS+LL  E+R
Subjt:  FLLWERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEAR

Query:  LEKQT
        L  ++
Subjt:  LEKQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAATGAAGTCTGAAAGTCTCAATGTGCAGATAGTTGATGTAATTTTAACGAGCCTTCTGAATTTGTCTGGGAGATTAAGTGACCTCTATATTAACTTGAGGAA
TGAAGGTTATCTTCAAATTCAATATCCTCTAGGTGCTTGGGATGCTGATTCATTCGCACAACTATTGCGTGATGCACTTAATGAATATAGGGCTCCACAAGACAATGATG
CAATTGTTTGGCATACAACAAATTTGCAATCTTTGTTTGATCGATTTCCTCCGAATCATCAAGTACTAGGTTTTTTGCCCGGTCAACATGAGGTAGACAGACATGAAATT
GACATCCATGACCTTTTGGTTGATCCGAGGGAAAATTTAGTATTTGTGGTCATCCCTCCCATGAATGAGAATCCAATAGAGCATTTGGATTCTACTTATCGTCATGTAGT
AGATGCAATTTTTACGAGTCTTTTGAATTTTCTAGGGAGAATAGGCGGCCTCTATATTAACTTGAGGAATCAAGGTTATCTTCAAATTCATTCTCCTCGAGCTTTTTGGG
ATGCTGATTCATTTCGAGAGATGTTGCGTGAGGTACTTAGTGAGCATAGGATTCCACAAGAGAATGATGCAATTGTTTGGCATAGAACAAATTTGCAATCTTTGTTTAAT
CGTTTTCCTCAAGATCGTCGAATAATAGGTTGTTTGCCTGATGACCAGGATGGAAATAGACAAGAAGTTAACATCCATGACTTTTTGCGATTTTTGGTTCATCCAAGTGA
AAATTTAATATTTGTGGTTGGTGACTTTCACAACTATAGAGAGCTTGAGAGTAAGATTGATCAACACGTCAAAGTTTCAGATCCTGGAACACCTACAAATTTGTTTGTCC
AGATCATTTGTCAAGAATGTCAACGAAGAGCTTCTCGCTCCTTTCCGATGACGACAGGTGAATCTTCTTCCTCATCTTCTTCAACCTCTTCAACCACGCCGGTAATCTCT
CCTAACGTTACCCCTGTCATTACGCCATTAAACCAGTCGATTAATCCTCTCACTCAACCTCCAAATCAACCTCTCAGTCCTAATATCACTCCTCGCCCACCGCTTCAACC
CCAACCGCAGCAACCTCAACATTTCGCCCACCTCAACAATCGCAACCTTCCATTTTTCCAAGCAACCTCAATCCCCTGTCTTCCCCCAGCAACCTCAATCAAGCTCAACG
ATACCAACTTCCTTCTCTGGAAGAATCAACTATTGAACGTCGTATTAGCAAATGGGTTGAACGACTTTCTGGATGGATCCATTCCAGCACCTGCAAAATTTCTTGATGAA
CAACAACTACAACTGAACCCGGAGTTTCTTCTATGGGAAAGATCTTATGACTCCAAAACTACTGCTAGAATTATGGGTTTGAAAACCCAACTCCAAAAGATTAAGAAAGA
TGGCCTCTCTGTTAGTCAATACCTAGCTCAAATCAAAGATGTCGCTGATAAGTTTTCTGCAATAGGTGAACCCATATCCTATCGAGATCACCTAGCCCATATTTTGGATG
GTCTTGGTAGTGAATACAATCCGTTTGTGATAACCATTCAAAATAGATCCGATAATCCCTCTCTTGAAGATGTTAGGAGTCTCTTGTTGGCTTATGAGGCTAGATTAGAG
AAGCAAACTGTTGTGGAACAATTAAATATGGCCCAAGCGAATCTGAGTGCTCTCCAACTGAATAACAGCCGCCGACCAGCCTCTAGAAATCCAACTCCAGCCACTAGAAC
TCCCTTCAATCCATCTACTTTTCCATCTTTTTCTAGTTCCACCGCTGCTCCTTTTTCTAATAGTCTTTTAGGTAAGCCTCAATCTCAACCTATCCAAAAATGGCCCCAAG
ATCCTCCTCCAATAAGCCTCAATGCCAAATATGTGGCAAATTTGGGCATACCGCTCTCATTTGCCATCATCGGACCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAATGAAGTCTGAAAGTCTCAATGTGCAGATAGTTGATGTAATTTTAACGAGCCTTCTGAATTTGTCTGGGAGATTAAGTGACCTCTATATTAACTTGAGGAA
TGAAGGTTATCTTCAAATTCAATATCCTCTAGGTGCTTGGGATGCTGATTCATTCGCACAACTATTGCGTGATGCACTTAATGAATATAGGGCTCCACAAGACAATGATG
CAATTGTTTGGCATACAACAAATTTGCAATCTTTGTTTGATCGATTTCCTCCGAATCATCAAGTACTAGGTTTTTTGCCCGGTCAACATGAGGTAGACAGACATGAAATT
GACATCCATGACCTTTTGGTTGATCCGAGGGAAAATTTAGTATTTGTGGTCATCCCTCCCATGAATGAGAATCCAATAGAGCATTTGGATTCTACTTATCGTCATGTAGT
AGATGCAATTTTTACGAGTCTTTTGAATTTTCTAGGGAGAATAGGCGGCCTCTATATTAACTTGAGGAATCAAGGTTATCTTCAAATTCATTCTCCTCGAGCTTTTTGGG
ATGCTGATTCATTTCGAGAGATGTTGCGTGAGGTACTTAGTGAGCATAGGATTCCACAAGAGAATGATGCAATTGTTTGGCATAGAACAAATTTGCAATCTTTGTTTAAT
CGTTTTCCTCAAGATCGTCGAATAATAGGTTGTTTGCCTGATGACCAGGATGGAAATAGACAAGAAGTTAACATCCATGACTTTTTGCGATTTTTGGTTCATCCAAGTGA
AAATTTAATATTTGTGGTTGGTGACTTTCACAACTATAGAGAGCTTGAGAGTAAGATTGATCAACACGTCAAAGTTTCAGATCCTGGAACACCTACAAATTTGTTTGTCC
AGATCATTTGTCAAGAATGTCAACGAAGAGCTTCTCGCTCCTTTCCGATGACGACAGGTGAATCTTCTTCCTCATCTTCTTCAACCTCTTCAACCACGCCGGTAATCTCT
CCTAACGTTACCCCTGTCATTACGCCATTAAACCAGTCGATTAATCCTCTCACTCAACCTCCAAATCAACCTCTCAGTCCTAATATCACTCCTCGCCCACCGCTTCAACC
CCAACCGCAGCAACCTCAACATTTCGCCCACCTCAACAATCGCAACCTTCCATTTTTCCAAGCAACCTCAATCCCCTGTCTTCCCCCAGCAACCTCAATCAAGCTCAACG
ATACCAACTTCCTTCTCTGGAAGAATCAACTATTGAACGTCGTATTAGCAAATGGGTTGAACGACTTTCTGGATGGATCCATTCCAGCACCTGCAAAATTTCTTGATGAA
CAACAACTACAACTGAACCCGGAGTTTCTTCTATGGGAAAGATCTTATGACTCCAAAACTACTGCTAGAATTATGGGTTTGAAAACCCAACTCCAAAAGATTAAGAAAGA
TGGCCTCTCTGTTAGTCAATACCTAGCTCAAATCAAAGATGTCGCTGATAAGTTTTCTGCAATAGGTGAACCCATATCCTATCGAGATCACCTAGCCCATATTTTGGATG
GTCTTGGTAGTGAATACAATCCGTTTGTGATAACCATTCAAAATAGATCCGATAATCCCTCTCTTGAAGATGTTAGGAGTCTCTTGTTGGCTTATGAGGCTAGATTAGAG
AAGCAAACTGTTGTGGAACAATTAAATATGGCCCAAGCGAATCTGAGTGCTCTCCAACTGAATAACAGCCGCCGACCAGCCTCTAGAAATCCAACTCCAGCCACTAGAAC
TCCCTTCAATCCATCTACTTTTCCATCTTTTTCTAGTTCCACCGCTGCTCCTTTTTCTAATAGTCTTTTAGGTAAGCCTCAATCTCAACCTATCCAAAAATGGCCCCAAG
ATCCTCCTCCAATAAGCCTCAATGCCAAATATGTGGCAAATTTGGGCATACCGCTCTCATTTGCCATCATCGGACCAATTTAG
Protein sequenceShow/hide protein sequence
MEEMKSESLNVQIVDVILTSLLNLSGRLSDLYINLRNEGYLQIQYPLGAWDADSFAQLLRDALNEYRAPQDNDAIVWHTTNLQSLFDRFPPNHQVLGFLPGQHEVDRHEI
DIHDLLVDPRENLVFVVIPPMNENPIEHLDSTYRHVVDAIFTSLLNFLGRIGGLYINLRNQGYLQIHSPRAFWDADSFREMLREVLSEHRIPQENDAIVWHRTNLQSLFN
RFPQDRRIIGCLPDDQDGNRQEVNIHDFLRFLVHPSENLIFVVGDFHNYRELESKIDQHVKVSDPGTPTNLFVQIICQECQRRASRSFPMTTGESSSSSSSTSSTTPVIS
PNVTPVITPLNQSINPLTQPPNQPLSPNITPRPPLQPQPQQPQHFAHLNNRNLPFFQATSIPCLPPATSIKLNDTNFLLWKNQLLNVVLANGLNDFLDGSIPAPAKFLDE
QQLQLNPEFLLWERSYDSKTTARIMGLKTQLQKIKKDGLSVSQYLAQIKDVADKFSAIGEPISYRDHLAHILDGLGSEYNPFVITIQNRSDNPSLEDVRSLLLAYEARLE
KQTVVEQLNMAQANLSALQLNNSRRPASRNPTPATRTPFNPSTFPSFSSSTAAPFSNSLLGKPQSQPIQKWPQDPPPISLNAKYVANLGIPLSFAIIGPI