; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014531 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014531
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:1816174..1817299
RNA-Seq ExpressionLag0014531
SyntenyLag0014531
Gene Ontology termsGO:0034641 - cellular nitrogen compound metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.0e-4646.1Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        MSE IL Q++HCKSA EIW  L  IF+SR+LA  M+ K+KL NI+KG   L EYF KI + +DALA+I K V  +DHILYIL+GLG++Y +M+SVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAP------KENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCN
         + +VQEV++LLL  ES+ ESK  +  +  LP+ N+  Q          + N N    N   N+S N   GRG GR     NRG R   NRN+PQCQ+C 
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAP------KENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCN

Query:  KFGHTAIKCYSRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        K G++A +C+ R            P + S   S N  N+S       PQM+AM+AA + N D NWYPDS
Subjt:  KFGHTAIKCYSRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]1.5e-4942.6Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M E IL ++IHC +A E+W  L  ++ SR+LA +M++KSKL+NI+KG   L +YF K+K  +D+LAA GK+V VEDHI++IL+GL +E+++ VSVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA
         TQT+QEV +LLL+HE R E + ++N DGTLP+ NL+ Q    K +++ + ++ Q+ Y QN    +  G  NF +N     WN+ NRPQCQ+  KFGHTA
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA

Query:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        ++CY R +           M   F   S   +  N +   +Q    ++         MAA +A  +FN+D NWYPDS
Subjt:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

XP_022136883.1 dr1-associated corepressor homolog isoform X2 [Momordica charantia]1.5e-4942.6Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M E IL ++IHC +A E+W  L  ++ SR+LA +M++KSKL+NI+KG   L +YF K+K  +D+LAA GK+V VEDHI++IL+GL +E+++ VSVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA
         TQT+QEV +LLL+HE R E + ++N DGTLP+ NL+ Q    K +++ + ++ Q+ Y QN    +  G  NF +N     WN+ NRPQCQ+  KFGHTA
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA

Query:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        ++CY R +           M   F   S   +  N +   +Q    ++         MAA +A  +FN+D NWYPDS
Subjt:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]3.3e-4943.06Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M+E IL Q++ CKSA EIW  L  +F SR LA +M++K KL+N +KG  SL +YF KIK  +D+LA  GK++  EDHI++IL+GLG E+D ++SVITA+ 
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL--NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGH
          QT+QEV +LLL  E R E +  +N DG+LP+ NL++ + + K N +Q       Q NYSQ     RGRG  N   NR  R+W   N+PQCQ+C +FGH
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL--NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGH

Query:  TAIKCYSRVQM--------PGVFSTQ--------CVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        TA++CY R +         P  FS            P + +F +      + +     P QM A+M A +FN+D NWY DS
Subjt:  TAIKCYSRVQM--------PGVFSTQ--------CVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

XP_022158089.1 uncharacterized protein LOC111024658 [Momordica charantia]3.7e-4842.22Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M+E IL  ++HC +A EIW  L ++F +++L  +M++K++LQN++KGG SL EY  +IK  +D+L A GK +  EDHI++ILSGLG+EY++ VSVIT K 
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL----NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKF
        G  T+Q+V ALLL+H+ RIE + +   D TLP+A++++ +    +N+    +    N    Y Q++     RGR  F +N GGR WN+RN+ QCQ+C++F
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL----NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKF

Query:  GHTAIKCY---SRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        GHTA + Y   S VQ    +ST+             Y  SS+ Y Q     AAM+ + + N+D NWYPDS
Subjt:  GHTAIKCY---SRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-949.8e-4746.1Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        MSE IL Q++HCKSA EIW  L  IF+SR+LA  M+ K+KL NI+KG   L EYF KI + +DALA+I K V  +DHILYIL+GLG++Y +M+SVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAP------KENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCN
         + +VQEV++LLL  ES+ ESK  +  +  LP+ N+  Q          + N N    N   N+S N   GRG GR     NRG R   NRN+PQCQ+C 
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAP------KENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCN

Query:  KFGHTAIKCYSRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        K G++A +C+ R            P + S   S N  N+S       PQM+AM+AA + N D NWYPDS
Subjt:  KFGHTAIKCYSRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

A0A6J1C6N9 dr1-associated corepressor homolog isoform X17.3e-5042.6Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M E IL ++IHC +A E+W  L  ++ SR+LA +M++KSKL+NI+KG   L +YF K+K  +D+LAA GK+V VEDHI++IL+GL +E+++ VSVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA
         TQT+QEV +LLL+HE R E + ++N DGTLP+ NL+ Q    K +++ + ++ Q+ Y QN    +  G  NF +N     WN+ NRPQCQ+  KFGHTA
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA

Query:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        ++CY R +           M   F   S   +  N +   +Q    ++         MAA +A  +FN+D NWYPDS
Subjt:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

A0A6J1C8R2 dr1-associated corepressor homolog isoform X27.3e-5042.6Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M E IL ++IHC +A E+W  L  ++ SR+LA +M++KSKL+NI+KG   L +YF K+K  +D+LAA GK+V VEDHI++IL+GL +E+++ VSVI+A+T
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA
         TQT+QEV +LLL+HE R E + ++N DGTLP+ NL+ Q    K +++ + ++ Q+ Y QN    +  G  NF +N     WN+ NRPQCQ+  KFGHTA
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTA

Query:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        ++CY R +           M   F   S   +  N +   +Q    ++         MAA +A  +FN+D NWYPDS
Subjt:  IKCYSRVQ-----------MPGVF---STQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

A0A6J1DLT9 uncharacterized protein LOC1110217571.6e-4943.06Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M+E IL Q++ CKSA EIW  L  +F SR LA +M++K KL+N +KG  SL +YF KIK  +D+LA  GK++  EDHI++IL+GLG E+D ++SVITA+ 
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL--NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGH
          QT+QEV +LLL  E R E +  +N DG+LP+ NL++ + + K N +Q       Q NYSQ     RGRG  N   NR  R+W   N+PQCQ+C +FGH
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL--NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGH

Query:  TAIKCYSRVQM--------PGVFSTQ--------CVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        TA++CY R +         P  FS            P + +F +      + +     P QM A+M A +FN+D NWY DS
Subjt:  TAIKCYSRVQM--------PGVFSTQ--------CVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

A0A6J1DYD5 uncharacterized protein LOC1110246581.8e-4842.22Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        M+E IL  ++HC +A EIW  L ++F +++L  +M++K++LQN++KGG SL EY  +IK  +D+L A GK +  EDHI++ILSGLG+EY++ VSVIT K 
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL----NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKF
        G  T+Q+V ALLL+H+ RIE + +   D TLP+A++++ +    +N+    +    N    Y Q++     RGR  F +N GGR WN+RN+ QCQ+C++F
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGL----NQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKF

Query:  GHTAIKCY---SRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS
        GHTA + Y   S VQ    +ST+             Y  SS+ Y Q     AAM+ + + N+D NWYPDS
Subjt:  GHTAIKCY---SRVQMPGVFSTQCVPPNQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDS

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1127.62Show/hide
Query:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT
        +S ++   V    +A +IW  L +I+ +    H+ +++++L+   KG  ++++Y   +    D LA +GK +  ++ +  +L  L  EY  ++  I AK 
Subjt:  MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKT

Query:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLP-TAN-LSVQNPAPKENDNQRGLNQQQNYSQNFGAGR--GRGRFNFGQNRGGRSWNNRNRP---QCQLC
           T+ E+   LLNHES+I    AV+    +P TAN +S +N     N+N    N + +   N    +   +   NF  N      NN+++P   +CQ+C
Subjt:  GTQTVQEVIALLLNHESRIESKAAVNPDGTLP-TAN-LSVQNPAPKENDNQRGLNQQQNYSQNFGAGR--GRGRFNFGQNRGGRSWNNRNRP---QCQLC

Query:  NKFGHTAIKC
           GH+A +C
Subjt:  NKFGHTAIKC

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.0e-0824.06Show/hide
Query:  MSEAILEQVIHCK-SANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAK
        +++++L+ +I    +A ++W  L  +F     A  ++ +++L+       S++EY  K+K   D L  +   +     ++++L+GL  +YD +++VI  K
Subjt:  MSEAILEQVIHCK-SANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAK

Query:  TGTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRN
        +   +  E  ++LL  ESR+ +K+  +   T   +  +V    P++   +R   +  N + N G GR + +   G +  GR  NN N
Subjt:  TGTQTVQEVIALLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGAAGCTATCTTAGAGCAAGTTATTCATTGTAAAAGTGCAAATGAAATTTGGCATTGCTTGTCAGAGATTTTTAATTCCAGACATTTAGCTCACATGATGAAGAT
AAAATCAAAATTACAAAATATTCAAAAAGGAGGTTCTAGTCTAAATGAGTATTTCTCTAAAATTAAGAAGTACATAGATGCCCTAGCTGCTATAGGGAAGCAAGTCCCAG
TGGAGGATCACATACTGTATATACTATCTGGACTTGGAACTGAGTATGATACCATGGTGTCTGTGATAACTGCTAAGACAGGAACTCAAACAGTCCAAGAAGTCATAGCT
CTTCTTTTGAATCATGAAAGTCGCATTGAAAGCAAAGCTGCTGTTAATCCTGATGGAACACTACCAACGGCAAATCTTTCAGTTCAAAATCCCGCTCCTAAGGAAAACGA
TAATCAAAGAGGTTTGAATCAACAACAAAATTATTCTCAAAATTTTGGTGCAGGCAGAGGACGGGGGAGGTTTAATTTTGGACAAAACAGAGGAGGGAGGTCTTGGAATA
ATCGAAATAGGCCTCAATGTCAACTGTGCAATAAGTTTGGGCATACAGCTATCAAGTGTTACTCTCGTGTTCAAATGCCAGGGGTCTTCTCTACTCAATGTGTGCCTCCC
AATCAGTCATTCATTGCTAGTCAGAATTATGGTAATTCATCCAATCAATATGGTCAAATTCCCCCCCAGATGGCTGCAATGATGGCAGCACATAACTTCAACCAAGACTG
CAATTGGTATCCCGACTCAGAAATCAAATGTAGATCCAATCTAAAGTGGGGCATTCTGAGAATTGAATTGGGGACTTTCTTGCAAACTGGGGACTTCTTGCATCCTATCA
AGAAGCATAACATCGGGTCAAATGCCCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGAAGCTATCTTAGAGCAAGTTATTCATTGTAAAAGTGCAAATGAAATTTGGCATTGCTTGTCAGAGATTTTTAATTCCAGACATTTAGCTCACATGATGAAGAT
AAAATCAAAATTACAAAATATTCAAAAAGGAGGTTCTAGTCTAAATGAGTATTTCTCTAAAATTAAGAAGTACATAGATGCCCTAGCTGCTATAGGGAAGCAAGTCCCAG
TGGAGGATCACATACTGTATATACTATCTGGACTTGGAACTGAGTATGATACCATGGTGTCTGTGATAACTGCTAAGACAGGAACTCAAACAGTCCAAGAAGTCATAGCT
CTTCTTTTGAATCATGAAAGTCGCATTGAAAGCAAAGCTGCTGTTAATCCTGATGGAACACTACCAACGGCAAATCTTTCAGTTCAAAATCCCGCTCCTAAGGAAAACGA
TAATCAAAGAGGTTTGAATCAACAACAAAATTATTCTCAAAATTTTGGTGCAGGCAGAGGACGGGGGAGGTTTAATTTTGGACAAAACAGAGGAGGGAGGTCTTGGAATA
ATCGAAATAGGCCTCAATGTCAACTGTGCAATAAGTTTGGGCATACAGCTATCAAGTGTTACTCTCGTGTTCAAATGCCAGGGGTCTTCTCTACTCAATGTGTGCCTCCC
AATCAGTCATTCATTGCTAGTCAGAATTATGGTAATTCATCCAATCAATATGGTCAAATTCCCCCCCAGATGGCTGCAATGATGGCAGCACATAACTTCAACCAAGACTG
CAATTGGTATCCCGACTCAGAAATCAAATGTAGATCCAATCTAAAGTGGGGCATTCTGAGAATTGAATTGGGGACTTTCTTGCAAACTGGGGACTTCTTGCATCCTATCA
AGAAGCATAACATCGGGTCAAATGCCCAGTGA
Protein sequenceShow/hide protein sequence
MSEAILEQVIHCKSANEIWHCLSEIFNSRHLAHMMKIKSKLQNIQKGGSSLNEYFSKIKKYIDALAAIGKQVPVEDHILYILSGLGTEYDTMVSVITAKTGTQTVQEVIA
LLLNHESRIESKAAVNPDGTLPTANLSVQNPAPKENDNQRGLNQQQNYSQNFGAGRGRGRFNFGQNRGGRSWNNRNRPQCQLCNKFGHTAIKCYSRVQMPGVFSTQCVPP
NQSFIASQNYGNSSNQYGQIPPQMAAMMAAHNFNQDCNWYPDSEIKCRSNLKWGILRIELGTFLQTGDFLHPIKKHNIGSNAQ