; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006225 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006225
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:39618550..39621866
RNA-Seq ExpressionLag0006225
SyntenyLag0006225
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]5.5e-4839.11Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+LIK+++QAIPTY+MSCFRIPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ                                      W+   WG +LL  GLR  +G+G SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]2.7e-4738.1Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+LIK+++QAIPTY+MSCF+IPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQAWK-------------------------------------GFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ W+                                        WG +LL  G+R  +G+G SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQAWK-------------------------------------GFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++ +
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]3.2e-4838.83Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+L+K+++QAIPTY+MSCFRIPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ                                      W+   WG +LL  GLR  +GNG SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++ +
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM

XP_022131662.1 uncharacterized protein LOC111004787 [Momordica charantia]4.7e-4742.08Show/hide
Query:  LQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKDIGGLNFRDLVNFNQALLAKQA-----
        +QGWK  FFS  GKEVLIKS+ QAIP YAMS FR+PKG   ++S + A+FWWGS  D +K+HW  WE +C PK++GGLNFRDL  FNQAL+AKQ      
Subjt:  LQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKDIGGLNFRDLVNFNQALLAKQA-----

Query:  --------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVVSPPSSEVE-NVAILISS---------
                                        WKGF+WG DLL  GLR  +GNG +I +F+DPW+P P +F+ ++ P    +  VA LI+          
Subjt:  --------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVVSPPSSEVE-NVAILISS---------

Query:  --------------------SAPDGWIWHYDARGEYNVKS
                            ++ D WIWH+D RG YNVKS
Subjt:  --------------------SAPDGWIWHYDARGEYNVKS

XP_030505522.1 uncharacterized protein LOC115720515 [Cannabis sativa]4.7e-4735.15Show/hide
Query:  MSASESLGFYLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKR
        M+ +E +  YLGL  V  + K   F+ + DK+WS L  W ++ FSQ GKE+L+K+++Q++P Y MSCF IP+G   +I  L A++WWGS   KRK+HW+ 
Subjt:  MSASESLGFYLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKR

Query:  WEGLCKPKDIGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL
        W+ LC  K  GGL FR  V +NQALLAKQA                                     W+G VWG +LL  GLR+ +GNG +  +F DPW+
Subjt:  WEGLCKPKDIGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL

Query:  PIPTTFKVVSPPSSEVENVAILISSSA-----------------------------PDGWIWHYDARGEYNVKSGYKISM--MNCQTASMSEI
        P P +F  ++     +  V+ LI  S                               D W+WH+ + G Y+VKSGY +++   N Q +S +E+
Subjt:  PIPTTFKVVSPPSSEVENVAILISSSA-----------------------------PDGWIWHYDARGEYNVKSGYKISM--MNCQTASMSEI

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein2.7e-4839.11Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+LIK+++QAIPTY+MSCFRIPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ                                      W+   WG +LL  GLR  +G+G SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS

A0A5E4FZN9 PREDICTED: retrotransposon1.6e-4838.83Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+L+K+++QAIPTY+MSCFRIPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ                                      W+   WG +LL  GLR  +GNG SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++ +
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKISMM

A0A803NXV2 Uncharacterized protein6.0e-4836.53Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  V  + K R F  + DKV   ++GWK  FFS  G+E+LIK+I+QA+P Y MS F++P     K+ ++  +FWWG++ DKRK+ W +WE +C+PK 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQAWK-------------------------------------GFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL F+DLV FNQA++AKQ W+                                       +WG+ LL +GLRK +G+G+S+  F DPW+P P +F+ +
Subjt:  IGGLNFRDLVNFNQALLAKQAWK-------------------------------------GFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPPSSEVENVAILISS-----------------------------SAPDGWIWHYDARGEYNVKSGYKISM
        SP   +   V+ LI S                                DGW WHY++ G Y+VKSGYK+++
Subjt:  SPPSSEVENVAILISS-----------------------------SAPDGWIWHYDARGEYNVKSGYKISM

A0A803P9R9 Uncharacterized protein4.6e-4838.16Show/hide
Query:  SESLGFYLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEG
        +E+   YLGL     + K   F  + +KVW  LQGWK   FSQ G+EVLIKSIIQAIP Y MSCFRI KG+LS+I AL A+FWWGS   K ++HW  WE 
Subjt:  SESLGFYLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEG

Query:  LCKPKDIGGLNFRDLVNFNQALLAKQ----------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTF----KVVSPPSSEV----------
        LCK K  GG+ FR+L +FNQALLAKQ           W+G +WG +LL  G+R  + N   + +  D W+P    F    KV  PP + +          
Subjt:  LCKPKDIGGLNFRDLVNFNQALLAKQ----------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTF----KVVSPPSSEV----------

Query:  -----------ENVAILISSSA-----PDGWIWHYDARGEYNVKSGYKISMMNC------QTASMSEINLRNHHVPINGSCPVCHEEMETTDHAFFSAQG
                   E++  ++   A      D  +WH    GEY V SGY   M+NC      +T++     L    + I  +C  C  + ET  HA ++   
Subjt:  -----------ENVAILISSSA-----PDGWIWHYDARGEYNVKSGYKISMMNC------QTASMSEINLRNHHVPINGSCPVCHEEMETTDHAFFSAQG

Query:  LGIL
        L ++
Subjt:  LGIL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)2.7e-4839.11Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +G+ + F+ L DK+W  + GWK +  S+ GKE+LIK+++QAIPTY+MSCFRIPKG+  +++ + A+FWW    DKR +HW +WE LCK K 
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV
         GGL FRDL  FNQALLAKQ                                      W+   WG +LL  GLR  +G+G SI ++ D WLP P+ FK++
Subjt:  IGGLNFRDLVNFNQALLAKQA-------------------------------------WKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVV

Query:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS
        SPP             SS   NV +                 L S +  D  IWHY+  G Y+VKSGY+++
Subjt:  SPP-------------SSEVENVAI-----------------LISSSAPDGWIWHYDARGEYNVKSGYKIS

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003107.3e-1936Show/hide
Query:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPK-DIGGLNFRDLVNFNQALLAKQ----------------------------
        A+P YAMSCFR+ K +  K+++   +FWW S  +KRK+ W  W+ LCK K D GGL FRDL  FNQALLAKQ                            
Subjt:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPK-DIGGLNFRDLVNFNQALLAKQ----------------------------

Query:  ---------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL----PIP
                 AW+  + G +LL  GL + +G+G    ++ D W+    P+P
Subjt:  ---------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL----PIP

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1426.67Show/hide
Query:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD
        YLGL  +  +  + D+  L++K+   +  W ++  S  G+  LI S+I ++  + MS FR+P   + +I ++C+ F W       K     W  +C PKD
Subjt:  YLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKD

Query:  IGGLNFRDLVNFNQ---------ALLAKQAWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPI
         GGL  R L   N+           L    WK  +    L    ++ ++ NG +   + D W  I
Subjt:  IGGLNFRDLVNFNQ---------ALLAKQAWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPI

AT4G29090.1 Ribonuclease H-like superfamily protein1.8e-2026.32Show/hide
Query:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKDIGGLNFRDLVNFNQALLAKQAWK--------------------------
        A+PTY M+CF +PK +  +I ++ A FWW +  + + MHWK W+ L   K  GG+ F+D+  FN ALL KQ W+                          
Subjt:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKDIGGLNFRDLVNFNQALLAKQAWK--------------------------

Query:  ------GFVW-----GMDLLKVGLRKNLGNGRSIFMFNDPWL---PIPTTFKVVSPPSSEVENVAILISSS-----------------------------
               FVW       ++L+ G R  +GNG  I ++   WL   P     ++   P  E  +V+ ++  S                             
Subjt:  ------GFVW-----GMDLLKVGLRKNLGNGRSIFMFNDPWL---PIPTTFKVVSPPSSEVENVAILISSS-----------------------------

Query:  -------APDGWIWHYDARGEYNVKSGY
                 D + W Y + G+Y VKSGY
Subjt:  -------APDGWIWHYDARGEYNVKSGY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.2e-2036Show/hide
Query:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPK-DIGGLNFRDLVNFNQALLAKQ----------------------------
        A+P YAMSCFR+ K +  K+++   +FWW S  +KRK+ W  W+ LCK K D GGL FRDL  FNQALLAKQ                            
Subjt:  AIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPK-DIGGLNFRDLVNFNQALLAKQ----------------------------

Query:  ---------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL----PIP
                 AW+  + G +LL  GL + +G+G    ++ D W+    P+P
Subjt:  ---------AWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWL----PIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGCGTCTGAGTCTCTGGGATTTTATCTTGGCTTGTCGTTCGTCTTTCATCAGGGTAAATCCCGTGACTTTAAATTTTTACTAGATAAAGTTTGGTCTTTATTACA
AGGATGGAAGAGTCAATTCTTTTCTCAAAGGGGCAAAGAGGTGTTGATCAAAAGCATTATTCAGGCTATCCCTACCTATGCAATGAGTTGTTTTCGGATTCCCAAAGGCA
TATTATCCAAAATATCGGCACTATGTGCTAAATTCTGGTGGGGTTCGCATGGGGATAAGCGTAAAATGCATTGGAAAAGATGGGAGGGCCTGTGTAAGCCAAAGGATATT
GGTGGTTTAAATTTTCGAGATCTAGTAAATTTCAACCAGGCACTACTGGCAAAACAGGCTTGGAAGGGCTTTGTGTGGGGAATGGACCTCCTGAAGGTTGGTTTAAGGAA
AAATCTAGGCAACGGACGATCAATTTTTATGTTCAACGATCCATGGCTTCCCATACCTACTACTTTTAAGGTGGTTTCTCCACCCTCTTCTGAGGTTGAGAATGTTGCTA
TTCTGATTAGTAGTTCTGCTCCTGATGGATGGATATGGCACTATGATGCTAGAGGAGAGTATAACGTCAAGAGTGGCTACAAGATTAGTATGATGAATTGTCAAACGGCT
TCTATGTCAGAAATTAACCTCCGAAATCACCATGTGCCGATTAATGGGTCCTGCCCAGTATGTCACGAAGAAATGGAGACTACAGATCATGCCTTTTTCAGTGCTCAAGG
GCTAGGGATTTTGGAGAGAACAAGTGGCATTGGGTTAGTCATTCGAGATAAATCTGGGAATCTACAGGCAACTCAGAGTTTATGTTCTCCGGTTTGTTCCTTCCCGCTAG
GAGCAGAAGCGATAGTAGTGCTTGAAAGTCTTCGTCTGGCAAAAATCCTTGAATGTGCATCATTTAACGAAACCAAAGCTACCGCCGCCACCGCTCGCCGCTGCACCGTC
GCCGCCGCCACCGTGCCGTCGCTTTCCGTTGATTCCAACAAAGCTCCTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGCGTCTGAGTCTCTGGGATTTTATCTTGGCTTGTCGTTCGTCTTTCATCAGGGTAAATCCCGTGACTTTAAATTTTTACTAGATAAAGTTTGGTCTTTATTACA
AGGATGGAAGAGTCAATTCTTTTCTCAAAGGGGCAAAGAGGTGTTGATCAAAAGCATTATTCAGGCTATCCCTACCTATGCAATGAGTTGTTTTCGGATTCCCAAAGGCA
TATTATCCAAAATATCGGCACTATGTGCTAAATTCTGGTGGGGTTCGCATGGGGATAAGCGTAAAATGCATTGGAAAAGATGGGAGGGCCTGTGTAAGCCAAAGGATATT
GGTGGTTTAAATTTTCGAGATCTAGTAAATTTCAACCAGGCACTACTGGCAAAACAGGCTTGGAAGGGCTTTGTGTGGGGAATGGACCTCCTGAAGGTTGGTTTAAGGAA
AAATCTAGGCAACGGACGATCAATTTTTATGTTCAACGATCCATGGCTTCCCATACCTACTACTTTTAAGGTGGTTTCTCCACCCTCTTCTGAGGTTGAGAATGTTGCTA
TTCTGATTAGTAGTTCTGCTCCTGATGGATGGATATGGCACTATGATGCTAGAGGAGAGTATAACGTCAAGAGTGGCTACAAGATTAGTATGATGAATTGTCAAACGGCT
TCTATGTCAGAAATTAACCTCCGAAATCACCATGTGCCGATTAATGGGTCCTGCCCAGTATGTCACGAAGAAATGGAGACTACAGATCATGCCTTTTTCAGTGCTCAAGG
GCTAGGGATTTTGGAGAGAACAAGTGGCATTGGGTTAGTCATTCGAGATAAATCTGGGAATCTACAGGCAACTCAGAGTTTATGTTCTCCGGTTTGTTCCTTCCCGCTAG
GAGCAGAAGCGATAGTAGTGCTTGAAAGTCTTCGTCTGGCAAAAATCCTTGAATGTGCATCATTTAACGAAACCAAAGCTACCGCCGCCACCGCTCGCCGCTGCACCGTC
GCCGCCGCCACCGTGCCGTCGCTTTCCGTTGATTCCAACAAAGCTCCTAAGTGA
Protein sequenceShow/hide protein sequence
MSASESLGFYLGLSFVFHQGKSRDFKFLLDKVWSLLQGWKSQFFSQRGKEVLIKSIIQAIPTYAMSCFRIPKGILSKISALCAKFWWGSHGDKRKMHWKRWEGLCKPKDI
GGLNFRDLVNFNQALLAKQAWKGFVWGMDLLKVGLRKNLGNGRSIFMFNDPWLPIPTTFKVVSPPSSEVENVAILISSSAPDGWIWHYDARGEYNVKSGYKISMMNCQTA
SMSEINLRNHHVPINGSCPVCHEEMETTDHAFFSAQGLGILERTSGIGLVIRDKSGNLQATQSLCSPVCSFPLGAEAIVVLESLRLAKILECASFNETKATAATARRCTV
AAATVPSLSVDSNKAPK