; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038492 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038492
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr2:18692322..18697696
RNA-Seq ExpressionLag0038492
SyntenyLag0038492
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154103.1 uncharacterized protein LOC111021439 [Momordica charantia]1.2e-5461.82Show/hide
Query:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK
        PSIE + + +       E +W   ++ YL  G LPTE+ EA+++Q+RA+ +VL++  LYKRGYS+PLL+CL P +A+YVMREIHEGVCG+HSGARSL HK
Subjt:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK

Query:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        I+RQGYYWPTML D + FT+ CD+CQRFA  PRQPPEPLTSI+S WPFAQWGI LI PLP  +G+
Subjt:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

XP_022156986.1 uncharacterized protein LOC111023816 [Momordica charantia]8.8e-5361.33Show/hide
Query:  AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDA
        + + +WMDPI D+L  G +P + ++A+K++R+A+H++++EG+L+KRGYS+PLLRCL P +A YVMREIHEGVCGNH GA+S+  K+VRQGYYW T+ +D 
Subjt:  AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDA

Query:  RHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        + F ++CD+CQRFA   RQPPE LT I SPWPFAQWGIDLIGPLP  KG+
Subjt:  RHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

XP_022158215.1 uncharacterized protein LOC111024751 [Momordica charantia]1.4e-5363.7Show/hide
Query:  SWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFT
        +WMDPI D+L  G +P + ++A+K++R+A+H++++EG+L+KRGYS+PLLRCL P +A YVMREIHEGVCGNH GARS+  K+VRQGYYWPT+ +D + F 
Subjt:  SWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFT

Query:  RSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        ++CD+CQRFA   RQPPE LT I SPWPFAQWGIDLI PLP  KG+
Subjt:  RSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

XP_022158579.1 uncharacterized protein LOC111025033 [Momordica charantia]1.1e-5864.85Show/hide
Query:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK
        PSIE + + +       E +W  P++ YL  G LPTE+ EA+++QRRA+ +VL++  LYK GYS+PLL+CL P +A+YVMREIHEGVCG+HSGARSL HK
Subjt:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK

Query:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        I+RQGYYWPTML D + FT+ CD+CQRFA  PRQPPEPLTSI+SPWPFAQWGIDLIGPLP  KG+
Subjt:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

XP_024036848.1 uncharacterized protein LOC112096880 [Citrus clementina]2.0e-5255.56Show/hide
Query:  AEAEDPRLEVGDPADGEGIPSIETKPSPEAGE--ASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIH
        A   DP++    P + + IPSIE        E   SWM+PI+ YL  G LP ++  A+K++ +AS + + +G LY+RGY++P L+CL   DA+YV+RE+H
Subjt:  AEAEDPRLEVGDPADGEGIPSIETKPSPEAGE--ASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIH

Query:  EGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKG
        EG+CGNHSG RSL HK++RQGY+WPTM QDA+  TRSC  CQ FA  P QPPE LTS++SPWPFAQWGIDLIGPLPK +G
Subjt:  EGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKG

TrEMBL top hitse value%identityAlignment
A0A2N9H093 Ribonuclease H2.7e-5258.18Show/hide
Query:  EGIPSIETKPSPE-------AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGAR
        +G   I+ +PS E       AG  +WM PI +YL +G LP +RTEA K++ RASHF L  G LYK G+S P LRCL P +ANYV+RE+HEG+CGNHSGAR
Subjt:  EGIPSIETKPSPE-------AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGAR

Query:  SLCHKIVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLP
        SL HK+ R GYYWP++L DA  + ++CD+CQRFA  PR PPE +T I SPWPFAQWG+D++GP P
Subjt:  SLCHKIVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLP

A0A6J1DJC7 Ribonuclease H5.9e-5561.82Show/hide
Query:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK
        PSIE + + +       E +W   ++ YL  G LPTE+ EA+++Q+RA+ +VL++  LYKRGYS+PLL+CL P +A+YVMREIHEGVCG+HSGARSL HK
Subjt:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK

Query:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        I+RQGYYWPTML D + FT+ CD+CQRFA  PRQPPEPLTSI+S WPFAQWGI LI PLP  +G+
Subjt:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

A0A6J1DV78 Ribonuclease H6.5e-5463.7Show/hide
Query:  SWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFT
        +WMDPI D+L  G +P + ++A+K++R+A+H++++EG+L+KRGYS+PLLRCL P +A YVMREIHEGVCGNH GARS+  K+VRQGYYWPT+ +D + F 
Subjt:  SWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFT

Query:  RSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        ++CD+CQRFA   RQPPE LT I SPWPFAQWGIDLI PLP  KG+
Subjt:  RSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

A0A6J1DWM2 Ribonuclease H4.2e-5361.33Show/hide
Query:  AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDA
        + + +WMDPI D+L  G +P + ++A+K++R+A+H++++EG+L+KRGYS+PLLRCL P +A YVMREIHEGVCGNH GA+S+  K+VRQGYYW T+ +D 
Subjt:  AGEASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDA

Query:  RHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        + F ++CD+CQRFA   RQPPE LT I SPWPFAQWGIDLIGPLP  KG+
Subjt:  RHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

A0A6J1DZU0 Ribonuclease H5.2e-5964.85Show/hide
Query:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK
        PSIE + + +       E +W  P++ YL  G LPTE+ EA+++QRRA+ +VL++  LYK GYS+PLL+CL P +A+YVMREIHEGVCG+HSGARSL HK
Subjt:  PSIETKPSPEAG-----EASWMDPILDYLGKGELPTERTEAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHK

Query:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR
        I+RQGYYWPTML D + FT+ CD+CQRFA  PRQPPEPLTSI+SPWPFAQWGIDLIGPLP  KG+
Subjt:  IVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFAQWGIDLIGPLPKVKGR

SwissProt top hitse value%identityAlignment
P08770 Putative AC transposase2.8e-0943.84Show/hide
Query:  FLSKKKDLDDGKT-EVARYLEEACVEDE-NFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGR
        +L + KD D  ++ E+ +Y+ E  ++    FDIL+WW+   A + I++Q+ARD+ +I +STV SESAFS GGR
Subjt:  FLSKKKDLDDGKT-EVARYLEEACVEDE-NFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-1039.05Show/hide
Query:  MFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSR-------QGDNFSDVKLYRSVVGSLQYATI-TRPEISFSVNKACQFMHAPTVIHWQPVKRILR
        ++LSQ KYI  +L +  M  A  ++TP+     LS +       +  N + V  Y S VGSL YA + TRP+I+ +V    +F+  P   HW+ VK ILR
Subjt:  MFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSR-------QGDNFSDVKLYRSVVGSLQYATI-TRPEISFSVNKACQFMHAPTVIHWQPVKRILR

Query:  YLKGT
        YL+GT
Subjt:  YLKGT

P92519 Uncharacterized mitochondrial protein AtMg008101.0e-1940.31Show/hide
Query:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP
        Y  G+++   P+ G+FLSQ+KY   +L+   M     ++TP+    L SS     + D   +RS+VG+LQY T+TRP+IS++VN  CQ MH PT+  +  
Subjt:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP

Query:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY
        +KR+LRY+KGT+ HGL ++  S L +  +
Subjt:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-2750.39Show/hide
Query:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP
        Y  G+E    PT G+ LSQ +YILDLL +T M  A  + TPM   P LS   G   +D   YR +VGSLQY   TRP+IS++VN+  QFMH PT  H Q 
Subjt:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP

Query:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY
        +KRILRYL GT +HG+ L   ++L LH Y
Subjt:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-2646.51Show/hide
Query:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP
        Y  G+E    P  G+ LSQ +Y LDLL +T M  A  +ATPM + P L+   G    D   YR +VGSLQY   TRP++S++VN+  Q+MH PT  HW  
Subjt:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP

Query:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY
        +KR+LRYL GT  HG+ L   ++L LH Y
Subjt:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY

Arabidopsis top hitse value%identityAlignment
AT1G18560.1 BED zinc finger ;hAT family dimerisation domain1.2e-0435.59Show/hide
Query:  EVARYLEEACVEDENFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGRE
        E+ +YL E+ V  +  D+L+WWKVN  R+  +S +ARD  ++  ++   E  F   G E
Subjt:  EVARYLEEACVEDENFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGRE

AT3G42170.1 BED zinc finger ;hAT family dimerisation domain1.0e-0638.71Show/hide
Query:  KTEVARYLEEACV-EDENFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGRE
        K+E+ +YL+E  +   + FD+L+WWK N  ++  +S++ARDI SIP+S    +  F    RE
Subjt:  KTEVARYLEEACV-EDENFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGRE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.7e-2035.26Show/hide
Query:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP
        Y  G+E++     G+ + Q KY LDLL +T + G    + PM      S+  G +F D K YR ++G L Y  ITR +ISF+VNK  QF  AP + H Q 
Subjt:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP

Query:  VKRILRYLKGTLSHGLMLYVPSSLLLHCYMAMQMRTGLQTRMTANPPLDFASSLLL
        V +IL Y+KGT+  GL     + + L  +     ++   TR + N    F  + L+
Subjt:  VKRILRYLKGTLSHGLMLYVPSSLLLHCYMAMQMRTGLQTRMTANPPLDFASSLLL

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.8e-0640.68Show/hide
Query:  YATITRPEISFSVNKACQFMHAPTVIHWQPVKRILRYLKGTLSHGLMLYVPSSLLLHCY
        Y TITRP+++F+VN+  QF  A      Q V ++L Y+KGT+  GL     S L L  +
Subjt:  YATITRPEISFSVNKACQFMHAPTVIHWQPVKRILRYLKGTLSHGLMLYVPSSLLLHCY

ATMG00810.1 DNA/RNA polymerases superfamily protein7.2e-2140.31Show/hide
Query:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP
        Y  G+++   P+ G+FLSQ+KY   +L+   M     ++TP+    L SS     + D   +RS+VG+LQY T+TRP+IS++VN  CQ MH PT+  +  
Subjt:  YNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQP

Query:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY
        +KR+LRY+KGT+ HGL ++  S L +  +
Subjt:  VKRILRYLKGTLSHGLMLYVPSSLLLHCY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTCGAGTGGAACTGGATCTTATTAAGGTACGTGCTGCTGTCCATGATAGATATATGTTTTTAAGTAAGAAAAAAGATCTAGATGATGGTAAAACAGAGGTGGCTCG
TTATTTAGAAGAAGCTTGTGTAGAGGATGAAAATTTTGATATATTAAATTGGTGGAAGGTAAATTATGCTCGATTCAAGATTATCAGCCAAGTAGCCAGGGACATCTATA
GCATTCCTATATCTACCGTGCCATCTGAATCAGCTTTTAGCACTGGAGGACGGGAGAAATTTCAAGCGATCACAAGAAAAGGAGGCTGCTGCGTTTTCGTTCGTGGGGCG
TCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGTGTGGAAGTTTCGTACCCTCCAACTGGGGGCATGTTTTTGTCCCAATCAAAGTACATACTTGATCTTCTGCATAA
GACAAAAATGGCAGGGGCTAACAGCATAGCCACACCAATGGTGAGTGGCCCTTTACTGTCAAGCCGACAAGGTGACAATTTTTCTGATGTCAAACTGTATCGCAGTGTTG
TAGGCTCTCTACAATATGCTACCATTACACGCCCTGAGATATCTTTTAGTGTAAACAAAGCATGTCAATTCATGCATGCTCCCACGGTTATTCATTGGCAACCGGTGAAG
CGAATCTTGCGATACTTAAAGGGTACATTAAGCCATGGTCTTATGCTATATGTTCCATCTTCCCTATTGTTACATTGTTACATGGCTATGCAGATGCGGACTGGGCTTCA
GACCCGGATGACCGCAAATCCACCTCTGGATTTTGCATCTTCTTTGCTGCTACGGAACTCGTATGGTTACATACTCTGTTTTGTGAACTCCGCATATCCTTATCCCAAAA
ACCGATACTTTGAAGAAAATCCTGTTGCAACATCTACCGGTTGTAGAACAGATTGCGGACATATTAACAAAGCCTCTATCTGCTACATCCTTCCTACAGTTAAAGAACAA
GCTCAATGTCCGAGATCCTCTCTCCATTGGCTTGCCGGGGGGGGGGTGATTTTGGACCACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAG
AGACCGACCCAGGGGAAGACCGGACCAAAGGGTCGAGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCCTTTTGCCGAGGCCGAGCATATGGTCGGCCTTGGCAAAA
TGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGCTTCTCCTCAGTTTTCTTACTTAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGG
TTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGGTGAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGGC
GCCATCTGTGGGGGGGGAGCCTGAGGATAAGTCCGAGGCCGAGCCTGAGGATAAGTCCGAGGCCGAGGCGGAAGATCCGAGGCTTGAGGTCGGTGATCCTGCAGATGGGG
AGGGTATCCCATCAATTGAAACTAAGCCCTCCCCAGAAGCAGGGGAAGCTTCATGGATGGATCCGATCCTTGATTACTTAGGGAAAGGCGAGCTGCCAACAGAGAGGACA
GAGGCTAAAAAGGTTCAGAGGCGAGCATCGCATTTTGTGTTAAGGGAAGGGAGACTGTATAAAAGAGGCTACTCGATGCCGCTGTTGAGGTGCCTTCCCCCTAGTGATGC
TAACTACGTTATGAGAGAGATACATGAGGGGGTGTGTGGAAATCACTCAGGAGCAAGGTCGTTGTGTCACAAGATAGTCAGGCAAGGTTACTATTGGCCAACCATGTTGC
AAGACGCTAGGCACTTCACTAGATCTTGTGATCAATGCCAAAGGTTTGCACCCACTCCAAGGCAACCACCTGAGCCTTTAACGAGCATTATCAGCCCGTGGCCGTTCGCG
CAATGGGGAATCGATCTGATCGGACCTCTCCCCAAGGTAAAGGGCAGACGAAGTATGCAGTTGTGGCTGTGGATTACTTCACAAAATGGGCAGAGGCAGAACCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGTCGAGTGGAACTGGATCTTATTAAGGTACGTGCTGCTGTCCATGATAGATATATGTTTTTAAGTAAGAAAAAAGATCTAGATGATGGTAAAACAGAGGTGGCTCG
TTATTTAGAAGAAGCTTGTGTAGAGGATGAAAATTTTGATATATTAAATTGGTGGAAGGTAAATTATGCTCGATTCAAGATTATCAGCCAAGTAGCCAGGGACATCTATA
GCATTCCTATATCTACCGTGCCATCTGAATCAGCTTTTAGCACTGGAGGACGGGAGAAATTTCAAGCGATCACAAGAAAAGGAGGCTGCTGCGTTTTCGTTCGTGGGGCG
TCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGTGTGGAAGTTTCGTACCCTCCAACTGGGGGCATGTTTTTGTCCCAATCAAAGTACATACTTGATCTTCTGCATAA
GACAAAAATGGCAGGGGCTAACAGCATAGCCACACCAATGGTGAGTGGCCCTTTACTGTCAAGCCGACAAGGTGACAATTTTTCTGATGTCAAACTGTATCGCAGTGTTG
TAGGCTCTCTACAATATGCTACCATTACACGCCCTGAGATATCTTTTAGTGTAAACAAAGCATGTCAATTCATGCATGCTCCCACGGTTATTCATTGGCAACCGGTGAAG
CGAATCTTGCGATACTTAAAGGGTACATTAAGCCATGGTCTTATGCTATATGTTCCATCTTCCCTATTGTTACATTGTTACATGGCTATGCAGATGCGGACTGGGCTTCA
GACCCGGATGACCGCAAATCCACCTCTGGATTTTGCATCTTCTTTGCTGCTACGGAACTCGTATGGTTACATACTCTGTTTTGTGAACTCCGCATATCCTTATCCCAAAA
ACCGATACTTTGAAGAAAATCCTGTTGCAACATCTACCGGTTGTAGAACAGATTGCGGACATATTAACAAAGCCTCTATCTGCTACATCCTTCCTACAGTTAAAGAACAA
GCTCAATGTCCGAGATCCTCTCTCCATTGGCTTGCCGGGGGGGGGGTGATTTTGGACCACACAGATGGACAAGGAGCTGACGAGGACAATCGGGCAGAGGTAGGATCAAG
AGACCGACCCAGGGGAAGACCGGACCAAAGGGTCGAGCCAAAATGGCCCGACCCATATGGTCGGCCTCGGCCTTTTGCCGAGGCCGAGCATATGGTCGGCCTTGGCAAAA
TGCCGAGGCCGACCATCCGGCCCGTTTGCACGGGCCGAGCCCGCTTCTCCTCAGTTTTCTTACTTAGGCATCGGAGGCGGTGTGGCCTACACCACGCCGGTGTGCAGCGG
TTTTTGCTGGTCTTGCAGGTCACGTCTTCCCCAGTTCCTACAAATTCACTGTTGGTGTCACGTGAAGGTCGGGTGAGTCTTCTGTCCGGATTTTGGCATCAACAGTTGGC
GCCATCTGTGGGGGGGGAGCCTGAGGATAAGTCCGAGGCCGAGCCTGAGGATAAGTCCGAGGCCGAGGCGGAAGATCCGAGGCTTGAGGTCGGTGATCCTGCAGATGGGG
AGGGTATCCCATCAATTGAAACTAAGCCCTCCCCAGAAGCAGGGGAAGCTTCATGGATGGATCCGATCCTTGATTACTTAGGGAAAGGCGAGCTGCCAACAGAGAGGACA
GAGGCTAAAAAGGTTCAGAGGCGAGCATCGCATTTTGTGTTAAGGGAAGGGAGACTGTATAAAAGAGGCTACTCGATGCCGCTGTTGAGGTGCCTTCCCCCTAGTGATGC
TAACTACGTTATGAGAGAGATACATGAGGGGGTGTGTGGAAATCACTCAGGAGCAAGGTCGTTGTGTCACAAGATAGTCAGGCAAGGTTACTATTGGCCAACCATGTTGC
AAGACGCTAGGCACTTCACTAGATCTTGTGATCAATGCCAAAGGTTTGCACCCACTCCAAGGCAACCACCTGAGCCTTTAACGAGCATTATCAGCCCGTGGCCGTTCGCG
CAATGGGGAATCGATCTGATCGGACCTCTCCCCAAGGTAAAGGGCAGACGAAGTATGCAGTTGTGGCTGTGGATTACTTCACAAAATGGGCAGAGGCAGAACCACTAG
Protein sequenceShow/hide protein sequence
MRRVELDLIKVRAAVHDRYMFLSKKKDLDDGKTEVARYLEEACVEDENFDILNWWKVNYARFKIISQVARDIYSIPISTVPSESAFSTGGREKFQAITRKGGCCVFVRGA
SLAKNGQVYNEGVEVSYPPTGGMFLSQSKYILDLLHKTKMAGANSIATPMVSGPLLSSRQGDNFSDVKLYRSVVGSLQYATITRPEISFSVNKACQFMHAPTVIHWQPVK
RILRYLKGTLSHGLMLYVPSSLLLHCYMAMQMRTGLQTRMTANPPLDFASSLLLRNSYGYILCFVNSAYPYPKNRYFEENPVATSTGCRTDCGHINKASICYILPTVKEQ
AQCPRSSLHWLAGGGVILDHTDGQGADEDNRAEVGSRDRPRGRPDQRVEPKWPDPYGRPRPFAEAEHMVGLGKMPRPTIRPVCTGRARFSSVFLLRHRRRCGLHHAGVQR
FLLVLQVTSSPVPTNSLLVSREGRVSLLSGFWHQQLAPSVGGEPEDKSEAEPEDKSEAEAEDPRLEVGDPADGEGIPSIETKPSPEAGEASWMDPILDYLGKGELPTERT
EAKKVQRRASHFVLREGRLYKRGYSMPLLRCLPPSDANYVMREIHEGVCGNHSGARSLCHKIVRQGYYWPTMLQDARHFTRSCDQCQRFAPTPRQPPEPLTSIISPWPFA
QWGIDLIGPLPKVKGRRSMQLWLWITSQNGQRQNH