; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001789 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001789
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:35381586..35384752
RNA-Seq ExpressionLag0001789
SyntenyLag0001789
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO69928.1 reverse transcriptase [Corchorus capsularis]1.4e-4446.45Show/hide
Query:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDG----FLAL----------FFFR-------AYKIVAKVIVKRMKWILQDIISENQ
        FN  L++I  +V+TEMN  LL+ F  EE+  A+ QMHP+KAP PD      +AL            FR        YKI++KV+V R+K  L + ISENQ
Subjt:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDG----FLAL----------FFFR-------AYKIVAKVIVKRMKWILQDIISENQ

Query:  SAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGD
        SAF+  R I DNI++ +E LHT++S + G++G+   KLDMSKAYDRVEW FL+ +M++LGF  RWV +IM C+ +  FS+++N + T    P RGLRQGD
Subjt:  SAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGD

Query:  PHSPHIY-FCS
        P S +++ FC+
Subjt:  PHSPHIY-FCS

XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]6.8e-4439.34Show/hide
Query:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF-----------------------------------------------F
        F+ +L  I  +VS EMN  LLA F  EEV  A++QMHP+KAP PDG   +F+                                               F
Subjt:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF-----------------------------------------------F

Query:  R-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVG
        R        YKI++KV+  R+K +L+++I E+QSAF+  RSI DN+++  E +H I  R+ G++  +  KLDMSKAYDRVEWS+L+ ++ KLGFH +W+ 
Subjt:  R-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVG

Query:  LIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA
        L+M C++T  +S+L+N +P G I+P RGLRQGDP SP+++   A
Subjt:  LIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]1.5e-4635.8Show/hide
Query:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFFFRAYK--------------------------------
        +F +  P     NA L+ +  +V   MN  L   F  EE+  A+ QM P+KAP PDG  A+FF + +K                                
Subjt:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFFFRAYK--------------------------------

Query:  ----------------------IVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM
                              +VAK I  R+K  L  IIS  QSAF+  R I DNIIIG+ECLH I+  +  +   V  KLD+ KAYDRVEWSFLK ++
Subjt:  ----------------------IVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM

Query:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSAL--RHCLHYYQEHPIKEAS----------QAYNQV------WRLF
         +LGF  +W+ LIM+CITT  FS++IN    G I PQRGLRQG P SP+++   A   +  +H+ +   +  A             +NQ       WRL 
Subjt:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSAL--RHCLHYYQEHPIKEAS----------QAYNQV------WRLF

Query:  TNPSLLASRVITATYAHGTSLFFAPVKSNCSFFWRSCVWARGLLLQGMRKQV
         NP  L ++VI A Y   T    A V S+ SF WRS +W R +L +G R ++
Subjt:  TNPSLLASRVITATYAHGTSLFFAPVKSNCSFFWRSCVWARGLLLQGMRKQV

XP_030936038.1 uncharacterized protein LOC115961147 [Quercus lobata]8.0e-4540.23Show/hide
Query:  FASIYPDE--AIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF------------------------------------
        F +IY  +  A F A+L  I  +VSTEMN+ LLA F  EEV  A+KQMHP+KAP PDG   +FF                                    
Subjt:  FASIYPDE--AIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF------------------------------------

Query:  -----------FR-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVL
                   FR        YK+++KV+  R+K +L  +I E+QS F+  R I DN+++  E +H I  R+ G++  +  KLDMS AYDRVEW +L+ +
Subjt:  -----------FR-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVL

Query:  MIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA
        M K+GFH RW+ LIM C+TT  +S+LIN +P G I P RGLRQGDP SP+++   A
Subjt:  MIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA

XP_030942013.1 uncharacterized protein LOC115967068 [Quercus lobata]6.8e-4439.61Show/hide
Query:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF-------------------------------------
        +F++ YPDE  F  +L  I  +VS +MND LL  F  EE+ RA+KQMHP+K+P P+    +FF                                     
Subjt:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFF-------------------------------------

Query:  ----------FR-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM
                  FR        YKIV+KV+  R+K +L  IISE QSAF+  R I DN+++  E +H I  +R G+KG +  KLDMSKAYDRVEW++L+ +M
Subjt:  ----------FR-------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM

Query:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA
         +LGF  RW+ L+M C+ +  +S+L+N +P G I+P RGLRQGDP SP+++   A
Subjt:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA

TrEMBL top hitse value%identityAlignment
A0A1R3HHY9 Reverse transcriptase6.6e-4546.45Show/hide
Query:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDG----FLAL----------FFFR-------AYKIVAKVIVKRMKWILQDIISENQ
        FN  L++I  +V+TEMN  LL+ F  EE+  A+ QMHP+KAP PD      +AL            FR        YKI++KV+V R+K  L + ISENQ
Subjt:  FNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDG----FLAL----------FFFR-------AYKIVAKVIVKRMKWILQDIISENQ

Query:  SAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGD
        SAF+  R I DNI++ +E LHT++S + G++G+   KLDMSKAYDRVEW FL+ +M++LGF  RWV +IM C+ +  FS+++N + T    P RGLRQGD
Subjt:  SAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGD

Query:  PHSPHIY-FCS
        P S +++ FC+
Subjt:  PHSPHIY-FCS

A0A2N9EWI9 Reverse transcriptase domain-containing protein2.1e-4343.61Show/hide
Query:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL-------------------ALFFFR-------AYKIVAKVI
        +FAS  P    F  +L  +  K++  MN  L A +  EEV +A+ QMHPSK+P PDG +                   ++  FR        YKI++KV+
Subjt:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL-------------------ALFFFR-------AYKIVAKVI

Query:  VKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINN
          R+K +L  ++SE+QSAF+  R I DN+ +  E LH +KS+R G+KG +  KLDMSKAYDRVEW FL+ LM KLGF   WV L+M CI T  +SI++N 
Subjt:  VKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINN

Query:  KPTGAIIPQRGLRQGDPHSPHIYFCSA
        +P G I P RG+RQGDP SP+++   A
Subjt:  KPTGAIIPQRGLRQGDPHSPHIYFCSA

A0A2N9F345 Uncharacterized protein9.5e-4446.23Show/hide
Query:  LKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL-------------------ALFFFR-------AYKIVAKVIVKRMKWILQDIISEN
        + E+   V+  MN +LL  F  EEV +A+ QMHP+KAP PDG +                    +  FR        YKI +KV+V RMK +L  +ISE+
Subjt:  LKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL-------------------ALFFFR-------AYKIVAKVIVKRMKWILQDIISEN

Query:  QSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQG
        QSAF+  R I DN+II  E +H +K+ R G    +  KLDMSKAYDRVEW +L+ +M+KLGFHP WV LIM C+TTA ++I++N +P G + PQRGLRQG
Subjt:  QSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQG

Query:  DPHSPHIYFCSA
        DP SP+++   A
Subjt:  DPHSPHIYFCSA

A0A2N9FT59 Reverse transcriptase domain-containing protein3.0e-4548.08Show/hide
Query:  LKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL---------------ALFFFR-------AYKIVAKVIVKRMKWILQDIISENQSAF
        L E+   V+  MN+ LL  F  EE+ RA+ QMHPSKAP PDG +                +  FR        YKIV+KV+V RMK IL  +IS++QSAF
Subjt:  LKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFL---------------ALFFFR-------AYKIVAKVIVKRMKWILQDIISENQSAF

Query:  ITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHS
        +  R I DN+II  E +H +K+ R G    +  KLDMSKAYDRVEW +L+ +M+KLGFH +WV L+M+C+ +A +SIL+N +P G I PQRGLRQGDP S
Subjt:  ITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHS

Query:  PHIYFCSA
        P+++   A
Subjt:  PHIYFCSA

A0A7N2R0C3 Reverse transcriptase domain-containing protein1.6e-4337.65Show/hide
Query:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFFFR-----------------------------------
        +++S +P E  F A L  +  +V+ +MN+ LL  F  EEV +A+ QMHP+K+P PDG   +FF +                                   
Subjt:  MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFFFR-----------------------------------

Query:  -------------------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM
                            YK+V+KV+  R+K +L D++ E QSAF+  R I DN+++  E +H I  RR G++G +  KLDMSKAYDRVEW +L+ +M
Subjt:  -------------------AYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLM

Query:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA
         ++GF  RW+ L+M C+TT  FS+LIN +P G I+P RGLRQGDP SP+++   A
Subjt:  IKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.9e-0525.19Show/hide
Query:  KIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKS-RRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTA
        KI+ K++  R++  ++ +I  +Q  FI     + NI    + ++ I+   R   K +V   +D  KA+D+++  F+   + KLG    ++ +I       
Subjt:  KIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKS-RRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTA

Query:  KFSILINNKPTGAIIPQRGLRQGDPHSPHIY
          +I++N +   A   + G RQG P SP ++
Subjt:  KFSILINNKPTGAIIPQRGLRQGDPHSPHIY

P14381 Transposon TX1 uncharacterized 149 kDa protein2.2e-1336.64Show/hide
Query:  YKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTA
        YKIVAK I  R+K +L ++I  +QS  +  R+IFDN+ +  + LH   +RRTG        LD  KA+DRV+  +L   +    F P++VG +     +A
Subjt:  YKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTA

Query:  KFSILINNKPTGAIIPQRGLRQGDPHSPHIY
        +  + IN   T  +   RG+RQG P S  +Y
Subjt:  KFSILINNKPTGAIIPQRGLRQGDPHSPHIY

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.3e-0534.48Show/hide
Query:  HTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIY
        H IK RR   K Y    LD+ KA+D V    +   M   G        IM  IT A  +I++  + T  I  + G++QGDP SP ++
Subjt:  HTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIY

Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein5.6e-0432.61Show/hide
Query:  NPLAAEVQAIVQALRLIQRLQYTEAQVCSDSINAIWMITKEMDTNTDVAHWITSIQDMCRAFQDISFHYIPRNRNLRADAMVKHALAHTSNI
        +PLAAE  AI  A+    +L+ ++  V SDS + +  +   +  N ++   +  I+ +   F+ ISF +IPR  N  ADA  K +L  + NI
Subjt:  NPLAAEVQAIVQALRLIQRLQYTEAQVCSDSINAIWMITKEMDTNTDVAHWITSIQDMCRAFQDISFHYIPRNRNLRADAMVKHALAHTSNI

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.6e-0533.68Show/hide
Query:  PLAAEVQAIVQALRLIQRLQYTEAQVCSDSINAIWMITKEMDTNTDVAHWITSIQDMCRAFQDISFHYIPRNRNLRADAMVKHALAHTSNIVQSS
        PL AE  A+  AL+  Q +  T+  + SDS   I  IT E   +T+    I  I ++   F D+SF ++PR+ N  AD + K +L   S +  S+
Subjt:  PLAAEVQAIVQALRLIQRLQYTEAQVCSDSINAIWMITKEMDTNTDVAHWITSIQDMCRAFQDISFHYIPRNRNLRADAMVKHALAHTSNIVQSS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.6e-1137.35Show/hide
Query:  IVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWV
        +V+R+K ++ ++I   Q++FI  R   DNI+   E +H+++ R+ G KG++  KLD+ KAYDR+ W +L+  +I  GF   W+
Subjt:  IVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECLHTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCATCCATATACCCAGATGAGGCGATTTTCAATGCGACATTAAAGGAGATTCATTGTAAGGTGTCTACAGAGATGAATGATAAATTGTTAGCTCATTTCGATTG
CGAAGAGGTGATTCGAGCTGTTAAGCAAATGCATCCGTCCAAAGCACCTGACCCCGATGGTTTTCTTGCACTTTTTTTTTTCAGAGCATACAAGATTGTGGCCAAAGTCA
TTGTTAAACGAATGAAGTGGATTCTCCAAGATATAATCTCAGAGAATCAATCGGCCTTCATAACTAGGAGATCCATATTCGATAACATCATTATCGGTCATGAGTGTTTG
CACACCATAAAATCTAGGCGCACAGGGCGCAAGGGCTATGTGACCTTTAAATTGGATATGAGCAAAGCGTATGATCGCGTTGAATGGTCTTTTCTGAAGGTCTTAATGAT
CAAACTTGGTTTTCATCCTCGGTGGGTAGGTTTGATTATGGATTGTATCACGACTGCAAAGTTCTCAATCCTCATCAATAATAAGCCGACGGGTGCCATCATTCCTCAGC
GGGGATTACGCCAGGGAGATCCACACTCTCCTCATATTTATTTTTGCTCTGCTCTGAGGCATTGTCTTCATTACTATCAGGAGCATCCAATCAAGGAGGCATCACAGGCC
TACAACCAGGTATGGCGATTATTCACCAACCCTAGTCTTCTGGCCTCTAGAGTTATCACTGCCACGTACGCTCATGGAACTTCTTTATTTTTCGCTCCCGTTAAATCCAA
TTGCTCCTTTTTTTGGAGGAGTTGTGTTTGGGCTCGCGGTTTGCTCTTACAAGGAATGCGCAAACAGGTGGATGCGGCGGTGAATCTGAATCGTAGTGGTGTCAGATTTG
GAGTTGTCATAGTGGGGGACGGAAATGTTATTCGTTGTGCAATGGAGATGATTGAAGATGTGGATCTTAATCCATTAGCAGCAGAGGTACAAGCCATTGTCCAAGCACTT
CGTCTCATACAACGATTGCAATATACAGAAGCCCAAGTGTGTTCGGACTCTATCAATGCCATTTGGATGATTACTAAGGAGATGGATACAAACACGGATGTTGCTCATTG
GATTACCAGCATTCAGGATATGTGTAGGGCTTTTCAAGACATTTCTTTCCATTACATTCCTAGGAATAGGAATTTGAGAGCCGACGCTATGGTCAAACATGCTTTGGCAC
ATACAAGTAACATAGTACAATCTTCAAGCTACACTGCAACCATTCTCCTTGACGATTACACAGCTTCTAGGATGGAAACTCTCCTCATGCACACTTCTCCTCACAATAGG
GTACCATTCATACACGCGAATATTGGAGAAGCTGAGAGAATAACCATGTGCACGCAAGCCGTACACAATCCGTTGGAGAAGTTGTTATCAGCTGATTTGGAAGTAAACTT
CCAGCTCATCAGCAAATCTAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCATCCATATACCCAGATGAGGCGATTTTCAATGCGACATTAAAGGAGATTCATTGTAAGGTGTCTACAGAGATGAATGATAAATTGTTAGCTCATTTCGATTG
CGAAGAGGTGATTCGAGCTGTTAAGCAAATGCATCCGTCCAAAGCACCTGACCCCGATGGTTTTCTTGCACTTTTTTTTTTCAGAGCATACAAGATTGTGGCCAAAGTCA
TTGTTAAACGAATGAAGTGGATTCTCCAAGATATAATCTCAGAGAATCAATCGGCCTTCATAACTAGGAGATCCATATTCGATAACATCATTATCGGTCATGAGTGTTTG
CACACCATAAAATCTAGGCGCACAGGGCGCAAGGGCTATGTGACCTTTAAATTGGATATGAGCAAAGCGTATGATCGCGTTGAATGGTCTTTTCTGAAGGTCTTAATGAT
CAAACTTGGTTTTCATCCTCGGTGGGTAGGTTTGATTATGGATTGTATCACGACTGCAAAGTTCTCAATCCTCATCAATAATAAGCCGACGGGTGCCATCATTCCTCAGC
GGGGATTACGCCAGGGAGATCCACACTCTCCTCATATTTATTTTTGCTCTGCTCTGAGGCATTGTCTTCATTACTATCAGGAGCATCCAATCAAGGAGGCATCACAGGCC
TACAACCAGGTATGGCGATTATTCACCAACCCTAGTCTTCTGGCCTCTAGAGTTATCACTGCCACGTACGCTCATGGAACTTCTTTATTTTTCGCTCCCGTTAAATCCAA
TTGCTCCTTTTTTTGGAGGAGTTGTGTTTGGGCTCGCGGTTTGCTCTTACAAGGAATGCGCAAACAGGTGGATGCGGCGGTGAATCTGAATCGTAGTGGTGTCAGATTTG
GAGTTGTCATAGTGGGGGACGGAAATGTTATTCGTTGTGCAATGGAGATGATTGAAGATGTGGATCTTAATCCATTAGCAGCAGAGGTACAAGCCATTGTCCAAGCACTT
CGTCTCATACAACGATTGCAATATACAGAAGCCCAAGTGTGTTCGGACTCTATCAATGCCATTTGGATGATTACTAAGGAGATGGATACAAACACGGATGTTGCTCATTG
GATTACCAGCATTCAGGATATGTGTAGGGCTTTTCAAGACATTTCTTTCCATTACATTCCTAGGAATAGGAATTTGAGAGCCGACGCTATGGTCAAACATGCTTTGGCAC
ATACAAGTAACATAGTACAATCTTCAAGCTACACTGCAACCATTCTCCTTGACGATTACACAGCTTCTAGGATGGAAACTCTCCTCATGCACACTTCTCCTCACAATAGG
GTACCATTCATACACGCGAATATTGGAGAAGCTGAGAGAATAACCATGTGCACGCAAGCCGTACACAATCCGTTGGAGAAGTTGTTATCAGCTGATTTGGAAGTAAACTT
CCAGCTCATCAGCAAATCTAGCTGA
Protein sequenceShow/hide protein sequence
MFASIYPDEAIFNATLKEIHCKVSTEMNDKLLAHFDCEEVIRAVKQMHPSKAPDPDGFLALFFFRAYKIVAKVIVKRMKWILQDIISENQSAFITRRSIFDNIIIGHECL
HTIKSRRTGRKGYVTFKLDMSKAYDRVEWSFLKVLMIKLGFHPRWVGLIMDCITTAKFSILINNKPTGAIIPQRGLRQGDPHSPHIYFCSALRHCLHYYQEHPIKEASQA
YNQVWRLFTNPSLLASRVITATYAHGTSLFFAPVKSNCSFFWRSCVWARGLLLQGMRKQVDAAVNLNRSGVRFGVVIVGDGNVIRCAMEMIEDVDLNPLAAEVQAIVQAL
RLIQRLQYTEAQVCSDSINAIWMITKEMDTNTDVAHWITSIQDMCRAFQDISFHYIPRNRNLRADAMVKHALAHTSNIVQSSSYTATILLDDYTASRMETLLMHTSPHNR
VPFIHANIGEAERITMCTQAVHNPLEKLLSADLEVNFQLISKSS