; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031938 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031938
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:19695215..19707553
RNA-Seq ExpressionLag0031938
SyntenyLag0031938
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037581.1 reverse transcriptase [Cucumis melo var. makuwa]3.4e-2939.24Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK--------------------------------
        ++DGQS+RTIQTLEDMLRACVLQFKG+WDTHL LMEFAYNN+YQ SIGM PFEALYGR CRTPVCWN+                                
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK--------------------------------

Query:  ---------GEEDL-----------------------------WFSLVFQAFWGVQVESEAQFG---DFWTNLLGTSFS---------------------
                  EE+L                             WF +       +Q +     G   + W     T FS                     
Subjt:  ---------GEEDL-----------------------------WFSLVFQAFWGVQVESEAQFG---DFWTNLLGTSFS---------------------

Query:  ---FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYIE
              VEIEL VPDTL TSAESSRS+S+TW++LY E
Subjt:  ---FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYIE

KAA0064296.1 putative DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.5e-3445.81Show/hide
Query:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED------------------LWF--------
        +DGQSERTIQTLEDMLRACVLQFKG+WDT+LSLMEFAYNN+YQ SIGM PFEALYGRPCRTPVCWN+ E+D                  LWF        
Subjt:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED------------------LWF--------

Query:  --------------SLVFQAFWGV------------QVES-----------EAQFGDFWTNLLG--TSFSFLQ-----VEIELLVPDTLPTSAESSRSSS
                       LVF     +             +ES             Q G+    ++   ++FS L      VEIEL VPDTLPTSAES  S+S
Subjt:  --------------SLVFQAFWGV------------QVES-----------EAQFGDFWTNLLG--TSFSFLQ-----VEIELLVPDTLPTSAESSRSSS

Query:  NTWVK-----LYIEFSDSTGSTGIVRG
        + +       L++ F+ S GSTGIVRG
Subjt:  NTWVK-----LYIEFSDSTGSTGIVRG

KAA0066351.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]9.9e-2960.87Show/hide
Query:  IAEFGFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSS----ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFE
        ++++G P +I S R  R          W S ++   + ++ S+    ++DGQSERTIQTLEDMLRACVLQ KGSWDTHL LMEFAYNNNYQ SIGMTP+E
Subjt:  IAEFGFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSS----ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFE

Query:  ALYGRPCRTPVCWNK
        ALYGRPCRTPVCWN+
Subjt:  ALYGRPCRTPVCWNK

TYJ97418.1 putative DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.8e-2987.67Show/hide
Query:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK-GEEDL
        +DGQSERTIQTLEDMLRACVLQFKG+WDTHLSLMEFAYNN+YQ SIGM PFEALYGRPCRTPVCWN+ GE  L
Subjt:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK-GEEDL

TYK03091.1 reverse transcriptase [Cucumis melo var. makuwa]3.7e-3145.77Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED--------LWF-------------SLVF
        ++DGQS+RTIQTLEDMLRACVLQFKG+WDTHL LMEFAYNN+YQ SIGM PFEALYGR CRTPVCWN+  E         L F             S V 
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED--------LWF-------------SLVF

Query:  QAFWGVQVESEAQFGDFWTNLL----------------GTSFS------------------------FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYI
        Q    ++++ +  + +    +L                 T FS                           VEIEL VPDTL TSAESSRS+S+TW++LY 
Subjt:  QAFWGVQVESEAQFGDFWTNLL----------------GTSFS------------------------FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYI

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A5A7T7M6 Reverse transcriptase1.7e-2939.24Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK--------------------------------
        ++DGQS+RTIQTLEDMLRACVLQFKG+WDTHL LMEFAYNN+YQ SIGM PFEALYGR CRTPVCWN+                                
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK--------------------------------

Query:  ---------GEEDL-----------------------------WFSLVFQAFWGVQVESEAQFG---DFWTNLLGTSFS---------------------
                  EE+L                             WF +       +Q +     G   + W     T FS                     
Subjt:  ---------GEEDL-----------------------------WFSLVFQAFWGVQVESEAQFG---DFWTNLLGTSFS---------------------

Query:  ---FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYIE
              VEIEL VPDTL TSAESSRS+S+TW++LY E
Subjt:  ---FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYIE

A0A5A7VD33 Putative DNA/RNA polymerases superfamily protein1.7e-3445.81Show/hide
Query:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED------------------LWF--------
        +DGQSERTIQTLEDMLRACVLQFKG+WDT+LSLMEFAYNN+YQ SIGM PFEALYGRPCRTPVCWN+ E+D                  LWF        
Subjt:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED------------------LWF--------

Query:  --------------SLVFQAFWGV------------QVES-----------EAQFGDFWTNLLG--TSFSFLQ-----VEIELLVPDTLPTSAESSRSSS
                       LVF     +             +ES             Q G+    ++   ++FS L      VEIEL VPDTLPTSAES  S+S
Subjt:  --------------SLVFQAFWGV------------QVES-----------EAQFGDFWTNLLG--TSFSFLQ-----VEIELLVPDTLPTSAESSRSSS

Query:  NTWVK-----LYIEFSDSTGSTGIVRG
        + +       L++ F+ S GSTGIVRG
Subjt:  NTWVK-----LYIEFSDSTGSTGIVRG

A0A5A7VL47 DNA/RNA polymerases superfamily protein4.8e-2960.87Show/hide
Query:  IAEFGFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSS----ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFE
        ++++G P +I S R  R          W S ++   + ++ S+    ++DGQSERTIQTLEDMLRACVLQ KGSWDTHL LMEFAYNNNYQ SIGMTP+E
Subjt:  IAEFGFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSS----ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFE

Query:  ALYGRPCRTPVCWNK
        ALYGRPCRTPVCWN+
Subjt:  ALYGRPCRTPVCWNK

A0A5D3BGW7 Putative DNA/RNA polymerases superfamily protein2.8e-2987.67Show/hide
Query:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK-GEEDL
        +DGQSERTIQTLEDMLRACVLQFKG+WDTHLSLMEFAYNN+YQ SIGM PFEALYGRPCRTPVCWN+ GE  L
Subjt:  SDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNK-GEEDL

A0A5D3BTP3 Reverse transcriptase1.8e-3145.77Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED--------LWF-------------SLVF
        ++DGQS+RTIQTLEDMLRACVLQFKG+WDTHL LMEFAYNN+YQ SIGM PFEALYGR CRTPVCWN+  E         L F             S V 
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEED--------LWF-------------SLVF

Query:  QAFWGVQVESEAQFGDFWTNLL----------------GTSFS------------------------FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYI
        Q    ++++ +  + +    +L                 T FS                           VEIEL VPDTL TSAESSRS+S+TW++LY 
Subjt:  QAFWGVQVESEAQFGDFWTNLL----------------GTSFS------------------------FLQVEIELLVPDTLPTSAESSRSSSNTWVKLYI

Query:  E
        E
Subjt:  E

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-0541.07Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY
        ++DGQ+ERT QT+E +LR        +W  H+SL++ +YNN    +  MTPFE ++
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY

P0CT41 Transposon Tf2-12 polyprotein3.6e-0541.07Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY
        ++DGQ+ERT QT+E +LR        +W  H+SL++ +YNN    +  MTPFE ++
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-0636.27Show/hide
Query:  GFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCR
        GFP+ I S R  R+  D    +  +   +   S      ++DGQSERTIQTL  +LRA V     +W  +L  +EF YN+    ++G +PFE   G    
Subjt:  GFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCR

Query:  TP
        TP
Subjt:  TP

Q99315 Transposon Ty3-G Gag-Pol polyprotein7.2e-0635.29Show/hide
Query:  GFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCR
        GFP+ I S R  R+  D    +  +   +   S      ++DGQSERTIQTL  +LRA       +W  +L  +EF YN+    ++G +PFE   G    
Subjt:  GFPQAI-SQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALYGRPCR

Query:  TP
        TP
Subjt:  TP

Q9UR07 Transposon Tf2-11 polyprotein3.6e-0541.07Show/hide
Query:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY
        ++DGQ+ERT QT+E +LR        +W  H+SL++ +YNN    +  MTPFE ++
Subjt:  ESDGQSERTIQTLEDMLRACVLQFKGSWDTHLSLMEFAYNNNYQFSIGMTPFEALY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAACAAAAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGGCCAGTGCTTTTGGTTACAAGTGGATTTACAAGAGGAAACGAGACATTTAAGGT
TATCTTACTGACATTAAAAGAGGCTAGCAGCTCATTTCCAAATGAAAGTCTGGGAGAAGCTGAGTTAGTCTTGTCTCAATTGTCTTATGTCTACAAGATGAATGGAATTC
TGTTGTCTAAGGAACTGTGTCCTCAGACACCTCAAGAAGTTGAGGACATGAGACATATGCCCTATGCAGTAGGGTTGTCATATCTTAGGAGAACGAGGTTTTATATGCTC
GTGTATGGCGCTAAGGATTTGATCCTTAAAGGATACACTGACACAGATGTTTTAACTGATAAGGATTTGATGAAATCTACATCAGTGTCTGTCTTCACTCTTAATGGAGG
AGCAATAGCTGAATTTGGGTTCCCACAAGCCATATCTCAGCGACCCAAGAGAATAGTGGGTGATTCGAGTGGTGGTGTCCGTTGGAAGTCCCAAGAACGAGATCCTTGGA
GCATTGTGCAGTGTTCTTCGGAATCCGATGGCCAGTCAGAAAGGACCATCCAGACCTTAGAAGACATGTTGAGAGCATGTGTCCTTCAGTTTAAGGGAAGTTGGGATACC
CACTTATCACTTATGGAGTTTGCTTATAATAACAACTATCAGTTTAGTATCGGCATGACACCATTTGAGGCTTTGTATGGCAGACCATGCAGAACTCCTGTGTGCTGGAA
TAAGGGGGAGGAGGATCTATGGTTTTCTCTGGTTTTTCAAGCATTCTGGGGTGTACAAGTTGAATCAGAAGCTCAATTTGGAGATTTCTGGACAAATCTGTTGGGAACCT
CGTTTTCATTCCTTCAGGTAGAGATCGAGCTCCTGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCAATACGTGGGTGAAGTTGTACATTGAG
TTCAGCGATTCAACAGGGTCAACAGGTATTGTTAGAGGAGGACGATGTCTGTTGGCTTCACGCCATCTCCCTATTAAGCTAGCAGAGGCTTCAGATAAGAAGAATATCTT
CCAAATTGAAGAGAGAGACCTTGTTGGTTACAACCGGTACAACAGCCGCCCAACCACCGCGCCGCCGCCTGGCCGTTCGTGTCGCCGGCTCCTCGCGCCGCGCCGCCGTT
CGTCTCCCTCTCTTTTCTTCATCTGCGCGCGTGTAGGTAGGTGTCACAGTCGTGGGTCTCTTCCTCTCTCAGTGCGTCGCTGCTGCACGTGGGTTGGCCTCCGTCGAGCT
CTTTCTCCCTCTCGCGTCCTGTTTTGCCCGAGCAGCAAGACCCACGCAAGAAGCCACGACCTTTTCGTTTCAGCAGCCTTGTACTCGCGTTTGGCATCTTCTCCGAGCGT
GTTTGCTCGCTACAAGCAGTCGTGGGTCATTCTCTCTCTCTCTCGCACGTGGTTGGTCAGTTTGAGCAGCAAGCCACACGACTCTAGTCCCTTTTTGTATCGTTTGGCCG
TTTCGCGCGTGTTCAAAGGAGTTTGGTGGTTGTCAAATTCCAGCAAGCTTTTGGACTCATTAGTGTGGTTCGCTTGGGATCAAGTAGAGCTGACTCGAGTTATTTTCCAG
CGAGGACTCATCCTGTCTGATGGAGCTTTAGAGGATTCGAACTTGTTCCGAGTTGTCGATCTTGCCGTCGAACCTGAATCTAAGTTTGGTGTTGGAGCTTTTCGAGTTTT
TAGGAGCTCTTTGTTTGGTTATATGAGCGCTCGTGAATGTCAGGTGCCTCGAGTAAAAATGGTCGAGGGGTGGTATGTCACCAATGGAGTAGGAGGAGCATCAGGTGCCT
TGGGTAATATGGCCAAGGGGCGATGTGATGAGTCTATATATTGTGAAAGTCATCGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTAAAAATGGTCAAGGGTCGAAT
GCCGAGCTCCGTAGAGAGCATTGTGATCCTGGGTACAAATGGTTAGGGGACAGTGCGGCTCGAAGGGTCAGTCTTGGAGAGCTTTATGAAGGCTATGTGAAATTGAAGTC
ATCGGGTGCCTTGGGTAATATGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGACTCAAAGGAGTCGAATTGAGAGAACTCAGAGGAGTCAGATAGAGCTTGATAGGG
CTCGAGTTAGTGGTTATGGATTATGTGCCACTTACTTAGTACCGCTGTATTGTACTGATCCACCACCAGATTTTTTTCCAGGTTATGAGACCATATTTGGGCTTGGTGAT
GAGGAGGAGGCTTGGAAGAGGAGACCATTAGATAGTAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATCAACAAAAGTTGTTGATAAGACTGGTTATTCAACAAGAGTTGTTGATAAGGCCAGTGCTTTTGGTTACAAGTGGATTTACAAGAGGAAACGAGACATTTAAGGT
TATCTTACTGACATTAAAAGAGGCTAGCAGCTCATTTCCAAATGAAAGTCTGGGAGAAGCTGAGTTAGTCTTGTCTCAATTGTCTTATGTCTACAAGATGAATGGAATTC
TGTTGTCTAAGGAACTGTGTCCTCAGACACCTCAAGAAGTTGAGGACATGAGACATATGCCCTATGCAGTAGGGTTGTCATATCTTAGGAGAACGAGGTTTTATATGCTC
GTGTATGGCGCTAAGGATTTGATCCTTAAAGGATACACTGACACAGATGTTTTAACTGATAAGGATTTGATGAAATCTACATCAGTGTCTGTCTTCACTCTTAATGGAGG
AGCAATAGCTGAATTTGGGTTCCCACAAGCCATATCTCAGCGACCCAAGAGAATAGTGGGTGATTCGAGTGGTGGTGTCCGTTGGAAGTCCCAAGAACGAGATCCTTGGA
GCATTGTGCAGTGTTCTTCGGAATCCGATGGCCAGTCAGAAAGGACCATCCAGACCTTAGAAGACATGTTGAGAGCATGTGTCCTTCAGTTTAAGGGAAGTTGGGATACC
CACTTATCACTTATGGAGTTTGCTTATAATAACAACTATCAGTTTAGTATCGGCATGACACCATTTGAGGCTTTGTATGGCAGACCATGCAGAACTCCTGTGTGCTGGAA
TAAGGGGGAGGAGGATCTATGGTTTTCTCTGGTTTTTCAAGCATTCTGGGGTGTACAAGTTGAATCAGAAGCTCAATTTGGAGATTTCTGGACAAATCTGTTGGGAACCT
CGTTTTCATTCCTTCAGGTAGAGATCGAGCTCCTGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCAATACGTGGGTGAAGTTGTACATTGAG
TTCAGCGATTCAACAGGGTCAACAGGTATTGTTAGAGGAGGACGATGTCTGTTGGCTTCACGCCATCTCCCTATTAAGCTAGCAGAGGCTTCAGATAAGAAGAATATCTT
CCAAATTGAAGAGAGAGACCTTGTTGGTTACAACCGGTACAACAGCCGCCCAACCACCGCGCCGCCGCCTGGCCGTTCGTGTCGCCGGCTCCTCGCGCCGCGCCGCCGTT
CGTCTCCCTCTCTTTTCTTCATCTGCGCGCGTGTAGGTAGGTGTCACAGTCGTGGGTCTCTTCCTCTCTCAGTGCGTCGCTGCTGCACGTGGGTTGGCCTCCGTCGAGCT
CTTTCTCCCTCTCGCGTCCTGTTTTGCCCGAGCAGCAAGACCCACGCAAGAAGCCACGACCTTTTCGTTTCAGCAGCCTTGTACTCGCGTTTGGCATCTTCTCCGAGCGT
GTTTGCTCGCTACAAGCAGTCGTGGGTCATTCTCTCTCTCTCTCGCACGTGGTTGGTCAGTTTGAGCAGCAAGCCACACGACTCTAGTCCCTTTTTGTATCGTTTGGCCG
TTTCGCGCGTGTTCAAAGGAGTTTGGTGGTTGTCAAATTCCAGCAAGCTTTTGGACTCATTAGTGTGGTTCGCTTGGGATCAAGTAGAGCTGACTCGAGTTATTTTCCAG
CGAGGACTCATCCTGTCTGATGGAGCTTTAGAGGATTCGAACTTGTTCCGAGTTGTCGATCTTGCCGTCGAACCTGAATCTAAGTTTGGTGTTGGAGCTTTTCGAGTTTT
TAGGAGCTCTTTGTTTGGTTATATGAGCGCTCGTGAATGTCAGGTGCCTCGAGTAAAAATGGTCGAGGGGTGGTATGTCACCAATGGAGTAGGAGGAGCATCAGGTGCCT
TGGGTAATATGGCCAAGGGGCGATGTGATGAGTCTATATATTGTGAAAGTCATCGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTAAAAATGGTCAAGGGTCGAAT
GCCGAGCTCCGTAGAGAGCATTGTGATCCTGGGTACAAATGGTTAGGGGACAGTGCGGCTCGAAGGGTCAGTCTTGGAGAGCTTTATGAAGGCTATGTGAAATTGAAGTC
ATCGGGTGCCTTGGGTAATATGGCCAAGGGGCGATGCACAGTTCGAGGCCTTGGGACTCAAAGGAGTCGAATTGAGAGAACTCAGAGGAGTCAGATAGAGCTTGATAGGG
CTCGAGTTAGTGGTTATGGATTATGTGCCACTTACTTAGTACCGCTGTATTGTACTGATCCACCACCAGATTTTTTTCCAGGTTATGAGACCATATTTGGGCTTGGTGAT
GAGGAGGAGGCTTGGAAGAGGAGACCATTAGATAGTAGCTAG
Protein sequenceShow/hide protein sequence
MHQQKLLIRLVIQQELLIRPVLLVTSGFTRGNETFKVILLTLKEASSSFPNESLGEAELVLSQLSYVYKMNGILLSKELCPQTPQEVEDMRHMPYAVGLSYLRRTRFYML
VYGAKDLILKGYTDTDVLTDKDLMKSTSVSVFTLNGGAIAEFGFPQAISQRPKRIVGDSSGGVRWKSQERDPWSIVQCSSESDGQSERTIQTLEDMLRACVLQFKGSWDT
HLSLMEFAYNNNYQFSIGMTPFEALYGRPCRTPVCWNKGEEDLWFSLVFQAFWGVQVESEAQFGDFWTNLLGTSFSFLQVEIELLVPDTLPTSAESSRSSSNTWVKLYIE
FSDSTGSTGIVRGGRCLLASRHLPIKLAEASDKKNIFQIEERDLVGYNRYNSRPTTAPPPGRSCRRLLAPRRRSSPSLFFICARVGRCHSRGSLPLSVRRCCTWVGLRRA
LSPSRVLFCPSSKTHARSHDLFVSAALYSRLASSPSVFARYKQSWVILSLSRTWLVSLSSKPHDSSPFLYRLAVSRVFKGVWWLSNSSKLLDSLVWFAWDQVELTRVIFQ
RGLILSDGALEDSNLFRVVDLAVEPESKFGVGAFRVFRSSLFGYMSARECQVPRVKMVEGWYVTNGVGGASGALGNMAKGRCDESIYCESHRPRGDAQFEALGKNGQGSN
AELRREHCDPGYKWLGDSAARRVSLGELYEGYVKLKSSGALGNMAKGRCTVRGLGTQRSRIERTQRSQIELDRARVSGYGLCATYLVPLYCTDPPPDFFPGYETIFGLGD
EEEAWKRRPLDSS