; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042135 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042135
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:37034122..37038637
RNA-Seq ExpressionLag0042135
SyntenyLag0042135
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
GO:0140097 - catalytic activity, acting on DNA (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR004808 - AP endonuclease 1
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN75802.1 hypothetical protein VITISV_016976 [Vitis vinifera]2.1e-3030.74Show/hide
Query:  ESMETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYC
        E +ETK   +    ++SL S +  +W+ ++A G  G +L+ WD+  L V+E   G +S+S +   +   + W+ + VY P   ++R  LW EL ++    
Subjt:  ESMETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYC

Query:  TDPWCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN----------------------------------------------------
         DPWC GGDFN+     ER   GR    MR F + ++DL L+++ +                                                      
Subjt:  TDPWCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN----------------------------------------------------

Query:  ----DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP
            D+EKA+D ++W FL  VL   GF  +W++W+  C+   KFSI ING P G    S+GLRQGDPLSP
Subjt:  ----DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP

KAA0063088.1 uncharacterized protein E6C27_scaffold623G00050 [Cucumis melo var. makuwa]3.0e-3759.17Show/hide
Query:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW
        E+K+   D+ FIKSLWSSK+  W   E +G  GG+L +WD SKL V+E LKGGYSLS+  +T+CK+ CW++NVY PND+KERR +W EL SLS YCT  W
Subjt:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW

Query:  CTGGDFNITRWVHERFPVGR
        C  GD NI RW HERFP  R
Subjt:  CTGGDFNITRWVHERFPVGR

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.7e-4057.97Show/hide
Query:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF
        ID+  IKSLWSSK+I W  VE++G  GG+L MWD SK+ V+E LKGGYSLS+  +T CK+ CW++NVY P DY+ERRF+W  L SLS YCT  WC GG  
Subjt:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF

Query:  NITRWVHERFPVGRQ--GMRRFNKFIEDLGLMEIHLSN
        NITRW HE FP+ +Q  GMR+FN  I+ L + E+ L N
Subjt:  NITRWVHERFPVGRQ--GMRRFNKFIEDLGLMEIHLSN

TYK31266.1 hypothetical protein E5676_scaffold455G005560 [Cucumis melo var. makuwa]4.4e-3330.03Show/hide
Query:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF
        ++ +FIKSLWSSK+I W FV + G  GG+L MWD SK+SV E +K  +SLS+KCL+LCK+V                        +  Y           
Subjt:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF

Query:  NITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN--------------------------------------------------------------
            W HERFP+GR  +GM  FNKFI+ + LMEI L N                                                              
Subjt:  NITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN--------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------DLEKAFDRVDWDF
                                                                                               DLEKAFDRVDW+F
Subjt:  ---------------------------------------------------------------------------------------DLEKAFDRVDWDF

Query:  LEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP
        LE  L+LK F  KWI WI GC+++PKFSIF+NG+PRGR+ ASRG+RQGDP SP
Subjt:  LEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]6.0e-3856.25Show/hide
Query:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW
        ETK+  I+ +FIKSLWSSKE+  +FVEA G+ GGLL +WD+SK+ V    K  +SLS+KC T+ K++CW++NVY P DY+ERR LW+EL SL+    DPW
Subjt:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW

Query:  CTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN
        C GGDFN  R  HER+PVG+  + M  FNKFI    L+EI LSN
Subjt:  CTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN

TrEMBL top hitse value%identityAlignment
A0A438IMY2 Auxin transport protein BIG3.8e-3030.71Show/hide
Query:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYCTDP
        ETK   +    ++SL S +  +W+ ++A G  G +L+ WD+  L V+E   G +S+S +   +   + W+ + VY P   ++R  LW EL ++     DP
Subjt:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYCTDP

Query:  WCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN-------------------------------------------------------
        WC GGDFN+     ER   GR    MR F + ++DL L+++ +                                                         
Subjt:  WCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN-------------------------------------------------------

Query:  -DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP
         D+EKA+D ++W FL  VL   GF  +W++W+  C+   KFSI ING P G    S+GLRQGDPLSP
Subjt:  -DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP

A0A5A7V639 Uncharacterized protein1.4e-3759.17Show/hide
Query:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW
        E+K+   D+ FIKSLWSSK+  W   E +G  GG+L +WD SKL V+E LKGGYSLS+  +T+CK+ CW++NVY PND+KERR +W EL SLS YCT  W
Subjt:  ETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPW

Query:  CTGGDFNITRWVHERFPVGR
        C  GD NI RW HERFP  R
Subjt:  CTGGDFNITRWVHERFPVGR

A0A5D3BHE3 Uncharacterized protein8.2e-4157.97Show/hide
Query:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF
        ID+  IKSLWSSK+I W  VE++G  GG+L MWD SK+ V+E LKGGYSLS+  +T CK+ CW++NVY P DY+ERRF+W  L SLS YCT  WC GG  
Subjt:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF

Query:  NITRWVHERFPVGRQ--GMRRFNKFIEDLGLMEIHLSN
        NITRW HE FP+ +Q  GMR+FN  I+ L + E+ L N
Subjt:  NITRWVHERFPVGRQ--GMRRFNKFIEDLGLMEIHLSN

A0A5D3E6J9 Reverse transcriptase domain-containing protein2.2e-3330.03Show/hide
Query:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF
        ++ +FIKSLWSSK+I W FV + G  GG+L MWD SK+SV E +K  +SLS+KCL+LCK+V                        +  Y           
Subjt:  IDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSELRSLSYYCTDPWCTGGDF

Query:  NITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN--------------------------------------------------------------
            W HERFP+GR  +GM  FNKFI+ + LMEI L N                                                              
Subjt:  NITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN--------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------DLEKAFDRVDWDF
                                                                                               DLEKAFDRVDW+F
Subjt:  ---------------------------------------------------------------------------------------DLEKAFDRVDWDF

Query:  LEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP
        LE  L+LK F  KWI WI GC+++PKFSIF+NG+PRGR+ ASRG+RQGDP SP
Subjt:  LEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP

A5ALD2 Uncharacterized protein1.0e-3030.74Show/hide
Query:  ESMETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYC
        E +ETK   +    ++SL S +  +W+ ++A G  G +L+ WD+  L V+E   G +S+S +   +   + W+ + VY P   ++R  LW EL ++    
Subjt:  ESMETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWV-SNVYCPNDYKERRFLWSELRSLSYYC

Query:  TDPWCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN----------------------------------------------------
         DPWC GGDFN+     ER   GR    MR F + ++DL L+++ +                                                      
Subjt:  TDPWCTGGDFNITRWVHERFPVGR--QGMRRFNKFIEDLGLMEIHLSN----------------------------------------------------

Query:  ----DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP
            D+EKA+D ++W FL  VL   GF  +W++W+  C+   KFSI ING P G    S+GLRQGDPLSP
Subjt:  ----DLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRGLRQGDPLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAAGAGATTTTTTCCTTAAATATGGAAAACCAGAGTTTTCTGAGTTGGATTTATTCGATCGAACTGAGAAAGTTGATACGACTCAGGAAGATGTGGTGAAATCCTC
TGAGATTTCCTCCTTTTGGTGGAGTCTTGTTGGTGACCTTCATTTGAAAATTGAGCGTTGGTCTGATGAAAAACATTCTCATATGGAATTCATTAAGAGTTATGGTGAAA
ATTATTTTCGGACTGGAATTTCAATTAATCCTTTTATGGATGATAAAGCGTTGATTAAGTTTGATGTGGGTCTTTCTGATTTGAATTTTGATGGAAATTGGAGTCTTGTT
GGTGACCTTCATTTGAAAATTGAGCGTTGGTCTGATGAAAAGCATTCTCATATGGAATTCATTAAGAGTTATGGAGTTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTT
TCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTAAGTTATTCTCGGATAATGAATGGTTCTTTGAATGTGTTGTTTGGCCTTCCACGGGTGGAAGAAGGA
TTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTCTTTTGGGAAATGATTAGTGATTTCATTCTTAAAATTCATTCTTATGAGAATCAACCTATTCGGTCA
TTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTCAGCAGGTCAAGCCTTTCCCAATTCATATGCTGAGGTGGGTATTAATGAAGAAGCTTACTGGGTTCG
CAAGAATTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTTAAGATTGCCCTTGAGAATTTCTTTA
AATCTTCTGTCTTGGTTAATCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTA
CATTTGAAATTGGAATTTTGGTCCTATGAAATTCATTCCCAGTCGAAGTCTATAAAAAGTTATGGAGGTTGGCTTGCAATTAGAAATCTTCCATTGAACTTATGGCATCG
TGACTCCTTTGAAGCTATTGGAATGAACCTTGGAGGGTTGGTTATTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATT
TTTGTGGATTTATTCTTGCTGATATTAATGTTAAGATTGATGCCAATGACTTTTCAAATTCCCTGCAGGTGATTTTGGATGAAGAATCTGATATTGTTAATAAAGAGAAT
AGGATGAGTGAGCTGCCAGCTATCTCTCGGTTTCAAGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGACACAAGATGAATGTATTAAATACAGTGGATG
TATTATTCCTTCAACCAAGTTGATTAATGATGATAGTTGTTTTTTGAATAATGAAGATTTGAATGGGGATTTGGTTCTTTCAAAGGATGCCTCGGTGCAAGATGTAGGTA
TTAATTGCAGTGGTTGCTTTATTCCTTCAACCAAGATGATTAATGATGATAGCTGTTTTTTGACTAATGAAGTGCAACAGATTTTAAAAGAGAGAGGCCAAGTTAATGAG
ATGTTGGGTTCTCCAAAAGGTGCTTCATTGCATGACAAAAGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGCATTAATGAGCCGACATTAGCTCTCTCTCCTTC
ATTAAATGACAATGAATTTAATGAGTACAGTCCTCAGGAAGCCCAACAGTTTCAGACCAACTTTAATGCTGATCATTTGGAGTCTGAATGTACTCATTTAATTGCTGGAA
ATAAGGCTTCGGGATCTGTTATAATCAGTGCTGGAAACGGGTTGATTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTGATGTTTTTGTCAGA
GGTATTGGTAGTTCCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGACGAGTCTATGGAGACTAAACAAGCAGCAATTGATTTGAATTTCATTAAATCCTTATG
GAGTTCCAAGGAAATCGATTGGTCGTTTGTGGAAGCTTATGGAGAACCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAATTTTTAAAGGGTG
GTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAGAGTTTGTTGGGTATCAAATGTTTATTGTCCAAATGACTACAAAGAAAGGAGATTCCTTTGGTCTGAATTA
CGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTACTGGTGGCGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAGGGATGCGTAGATT
TAATAAATTCATTGAAGACTTGGGTCTTATGGAAATTCATTTATCAAATGATCTTGAGAAAGCCTTTGATAGGGTCGATTGGGATTTCCTTGAGTATGTTCTCAACTTGA
AAGGATTTTGTGAAAAATGGATTGAATGGATAAATGGATGTGTTAGGGATCCAAAATTTTCCATTTTCATTAATGGTCGGCCAAGAGGGAGAGTTTGTGCTTCTAGAGGT
CTTAGGCAAGGAGATCCTCTTTCTCCTTCCTCTCTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTAAGAGATTTTTTCCTTAAATATGGAAAACCAGAGTTTTCTGAGTTGGATTTATTCGATCGAACTGAGAAAGTTGATACGACTCAGGAAGATGTGGTGAAATCCTC
TGAGATTTCCTCCTTTTGGTGGAGTCTTGTTGGTGACCTTCATTTGAAAATTGAGCGTTGGTCTGATGAAAAACATTCTCATATGGAATTCATTAAGAGTTATGGTGAAA
ATTATTTTCGGACTGGAATTTCAATTAATCCTTTTATGGATGATAAAGCGTTGATTAAGTTTGATGTGGGTCTTTCTGATTTGAATTTTGATGGAAATTGGAGTCTTGTT
GGTGACCTTCATTTGAAAATTGAGCGTTGGTCTGATGAAAAGCATTCTCATATGGAATTCATTAAGAGTTATGGAGTTGAGATTTTGCAAAATCCTGTTTCTTCTTTCTT
TCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTAAGTTATTCTCGGATAATGAATGGTTCTTTGAATGTGTTGTTTGGCCTTCCACGGGTGGAAGAAGGA
TTATTCAAGTTCCTGCAGGCTTGAATAAGAAAGGATGGTATGTCTTTTGGGAAATGATTAGTGATTTCATTCTTAAAATTCATTCTTATGAGAATCAACCTATTCGGTCA
TTGTTAAGCAAAGAGGAGTGTCTTCCGGTTTTTGATAAAGTTTCAGCAGGTCAAGCCTTTCCCAATTCATATGCTGAGGTGGGTATTAATGAAGAAGCTTACTGGGTTCG
CAAGAATTGTGATGTGCTGGAAATAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCATTATTCTTGGAAGGATGTTAAGATTGCCCTTGAGAATTTCTTTA
AATCTTCTGTCTTGGTTAATCCCTTCATGGATGATAAAGCTTTGATTCATGCAGCAGATGGTGGCTTGGAATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTA
CATTTGAAATTGGAATTTTGGTCCTATGAAATTCATTCCCAGTCGAAGTCTATAAAAAGTTATGGAGGTTGGCTTGCAATTAGAAATCTTCCATTGAACTTATGGCATCG
TGACTCCTTTGAAGCTATTGGAATGAACCTTGGAGGGTTGGTTATTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATT
TTTGTGGATTTATTCTTGCTGATATTAATGTTAAGATTGATGCCAATGACTTTTCAAATTCCCTGCAGGTGATTTTGGATGAAGAATCTGATATTGTTAATAAAGAGAAT
AGGATGAGTGAGCTGCCAGCTATCTCTCGGTTTCAAGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGACACAAGATGAATGTATTAAATACAGTGGATG
TATTATTCCTTCAACCAAGTTGATTAATGATGATAGTTGTTTTTTGAATAATGAAGATTTGAATGGGGATTTGGTTCTTTCAAAGGATGCCTCGGTGCAAGATGTAGGTA
TTAATTGCAGTGGTTGCTTTATTCCTTCAACCAAGATGATTAATGATGATAGCTGTTTTTTGACTAATGAAGTGCAACAGATTTTAAAAGAGAGAGGCCAAGTTAATGAG
ATGTTGGGTTCTCCAAAAGGTGCTTCATTGCATGACAAAAGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGCATTAATGAGCCGACATTAGCTCTCTCTCCTTC
ATTAAATGACAATGAATTTAATGAGTACAGTCCTCAGGAAGCCCAACAGTTTCAGACCAACTTTAATGCTGATCATTTGGAGTCTGAATGTACTCATTTAATTGCTGGAA
ATAAGGCTTCGGGATCTGTTATAATCAGTGCTGGAAACGGGTTGATTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTGATGTTTTTGTCAGA
GGTATTGGTAGTTCCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGACGAGTCTATGGAGACTAAACAAGCAGCAATTGATTTGAATTTCATTAAATCCTTATG
GAGTTCCAAGGAAATCGATTGGTCGTTTGTGGAAGCTTATGGAGAACCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAATTTTTAAAGGGTG
GTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAGAGTTTGTTGGGTATCAAATGTTTATTGTCCAAATGACTACAAAGAAAGGAGATTCCTTTGGTCTGAATTA
CGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTACTGGTGGCGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAGGGATGCGTAGATT
TAATAAATTCATTGAAGACTTGGGTCTTATGGAAATTCATTTATCAAATGATCTTGAGAAAGCCTTTGATAGGGTCGATTGGGATTTCCTTGAGTATGTTCTCAACTTGA
AAGGATTTTGTGAAAAATGGATTGAATGGATAAATGGATGTGTTAGGGATCCAAAATTTTCCATTTTCATTAATGGTCGGCCAAGAGGGAGAGTTTGTGCTTCTAGAGGT
CTTAGGCAAGGAGATCCTCTTTCTCCTTCCTCTCTCTTTTAG
Protein sequenceShow/hide protein sequence
MVRDFFLKYGKPEFSELDLFDRTEKVDTTQEDVVKSSEISSFWWSLVGDLHLKIERWSDEKHSHMEFIKSYGENYFRTGISINPFMDDKALIKFDVGLSDLNFDGNWSLV
GDLHLKIERWSDEKHSHMEFIKSYGVEILQNPVSSFFHEKIKEEFGVIRLIKLFSDNEWFFECVVWPSTGGRRIIQVPAGLNKKGWYVFWEMISDFILKIHSYENQPIRS
LLSKEECLPVFDKVSAGQAFPNSYAEVGINEEAYWVRKNCDVLEIDLERSIVVSRLMAHYSWKDVKIALENFFKSSVLVNPFMDDKALIHAADGGLEFSANGKWKKFGNL
HLKLEFWSYEIHSQSKSIKSYGGWLAIRNLPLNLWHRDSFEAIGMNLGGLVIISSNTLNLLDCSEAFIEVEKNFCGFILADINVKIDANDFSNSLQVILDEESDIVNKEN
RMSELPAISRFQEAFNEDLDISKDVSTQDECIKYSGCIIPSTKLINDDSCFLNNEDLNGDLVLSKDASVQDVGINCSGCFIPSTKMINDDSCFLTNEVQQILKERGQVNE
MLGSPKGASLHDKSINNAGCKGFNASINEPTLALSPSLNDNEFNEYSPQEAQQFQTNFNADHLESECTHLIAGNKASGSVIISAGNGLIQAKVFKESSIQIPGGSDVFVR
GIGSSFNHSIHSPVDSDDESMETKQAAIDLNFIKSLWSSKEIDWSFVEAYGEPGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKRVCWVSNVYCPNDYKERRFLWSEL
RSLSYYCTDPWCTGGDFNITRWVHERFPVGRQGMRRFNKFIEDLGLMEIHLSNDLEKAFDRVDWDFLEYVLNLKGFCEKWIEWINGCVRDPKFSIFINGRPRGRVCASRG
LRQGDPLSPSSLF