; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001254 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001254
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:28029439..28031876
RNA-Seq ExpressionLag0001254
SyntenyLag0001254
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]9.0e-2642.28Show/hide
Query:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH
        TW L+ RL++  D  W++GGD NE+L  NE   G  R    M  FR V+  C+L DLGF G  YTWCN R     +S RLDRF+GN+ FC+LFP  +V H
Subjt:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH

Query:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIAR
           A SDH P+  +   S       +    F+FE  W    +C +II R
Subjt:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIAR

XP_027118730.1 uncharacterized protein LOC113735973 [Coffea arabica]7.9e-3028.05Show/hide
Query:  AEVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIA
        AE +  T K +W   +GL+   +G N+F+F+     D+ +V+   P  FD  LLV+   + + +PS  K    +FW+ VY+LPL W N    + IG+ + 
Subjt:  AEVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIA

Query:  DYE-------VTTTSNTGPRVPPMTSDSRGDLQ------TVADVSLK-SNLPK---------KAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNP----
         YE       ++       RV    ++    L        V +V  +   LP            + D  D L    +SD     + +G  +  Q P    
Subjt:  DYE-------VTTTSNTGPRVPPMTSDSRGDLQ------TVADVSLK-SNLPK---------KAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNP----

Query:  TSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHR
         S ++AT+++        PI LS   + +        +  S   +       E+ K    TW ++ +L   C   WV  GD NE+L   E      R   
Subjt:  TSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHR

Query:  LMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSH---FSSSPRRGNLSFKFEEWW
         + NFR+ L  CNL DLG  G+ +TWC  R   D    RLDR   + +F +LFP+  + H     SDH PI L L   H    + S  R +  F FE  W
Subjt:  LMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSH---FSSSPRRGNLSFKFEEWW

Query:  THHVECKEII
            +C+ II
Subjt:  THHVECKEII

XP_030498017.1 uncharacterized protein LOC115713672 [Cannabis sativa]1.4e-2640.49Show/hide
Query:  PKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNS
        PK    +D     +W+L+CRL    D  W+ GGD NEIL  NE   G  R    +T F++ LD C L+D+GF G  +TW N+R     V  RLDR+  N 
Subjt:  PKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNS

Query:  NFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARA
         + SLFP+  VL+ D+  SDHRPI + +L +   S P     SF+FE  W    EC +I+ +A
Subjt:  NFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARA

XP_040987679.1 uncharacterized protein LOC121235397 [Juglans microcarpa x Juglans regia]9.0e-2639.75Show/hide
Query:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH
        TW LI  L    D  W++ GD NE+L ++E S G     R M  F+ V+D C LMDLGF G+ +TWCN RE    +S RLDR + +  + + FP   V+H
Subjt:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH

Query:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGVVEERSLQ
             SDH PI+ ++   HF+S  RRG   F+FE  W     C+E+I  A +    ER ++
Subjt:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGVVEERSLQ

XP_041011336.1 uncharacterized protein LOC121255118 [Juglans microcarpa x Juglans regia]4.8e-2742.24Show/hide
Query:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH
        TW LI  L    D  W++ GD NE+L + E S G  R    M +FR V+D C L+DLGF G+ +TWCN RE    +S RLDR + N  + +LFP   V+H
Subjt:  TWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLH

Query:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGVVEERSLQ
         + A SDH PI+L L  + F    R+G   F FE  W     C+EII  A V    ER+++
Subjt:  QDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGVVEERSLQ

TrEMBL top hitse value%identityAlignment
A0A2N9GA96 Uncharacterized protein7.4e-2626.2Show/hide
Query:  TFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIAD-----
        T ++ W    G+ +  + KN+F+      +D  RV  QSP  FDK L+++       +P+   F  SAFWI VY+LP+      M+  +G  I+      
Subjt:  TFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIAD-----

Query:  YEVTTTSNTGP-----------RVPPMTSDSR-GDLQTVADVSLKSNL--------PKKAKGDGSDPLASQWSSDCSLPAAGIGDNEKK-----QNPTSV
         EV    N  P           + PP + D R     T+A+  + + L             GD   P +++ SS  S P   + + +       +   S+
Subjt:  YEVTTTSNTGP-----------RVPPMTSDSR-GDLQTVADVSLKSNL--------PKKAKGDGSDPLASQWSSDCSLPAAGIGDNEKK-----QNPTSV

Query:  VI-------------------ATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIK----CPKM------------EEESDKEAGTT-------
        V+                    T            +G S  +   ++KR   P +   D  K    C K             +E S K+   T       
Subjt:  VI-------------------ATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIK----CPKM------------EEESDKEAGTT-------

Query:  -------WQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFP
               W+L+  L +  +  W++ GD NEI    E      R+   M  FRE L  C+L DLGF G+ +TW N+R+    V  RLDR + +  + SLFP
Subjt:  -------WQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFP

Query:  NHLVLHQDWAKSDHRPIELNLLGSH--FSSSPRRGNLSFKFEEWWTHHVECKEIIARA
        N  V H   A SDH  + ++ L S      + RR  L F+FE+ W   + C+++IA A
Subjt:  NHLVLHQDWAKSDHRPIELNLLGSH--FSSSPRRGNLSFKFEEWWTHHVECKEIIARA

A0A2N9I0P4 Reverse transcriptase domain-containing protein1.2e-2826.93Show/hide
Query:  EVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGN----
        E I RTF+  W+ E+G  V+ LG N  +     + D  RV+   P  +DKFL+VL+        +      + FW+ ++ LPL      M   IG+    
Subjt:  EVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGN----

Query:  ------------AIADY---------EVTTTSNTGPRVP--PMTSDSRG----DLQTVADVSLK--SNL--------PKKAKGDGSDPLASQWSSDCSLP
                    A AD          E  T  +    +P  P+ +DS      ++Q  +  +LK  +NL        PK  K      +    +    LP
Subjt:  ------------AIADY---------EVTTTSNTGPRVP--PMTSDSRG----DLQTVADVSLK--SNL--------PKKAKGDGSDPLASQWSSDCSLP

Query:  AAGIGDNEKKQNPTSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIG----GDLN
        A   G      +          K+           +VQ L    + KD P++L    ++    E   +K  GT    IC +  F +    +G    GD N
Subjt:  AAGIGDNEKKQNPTSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIG----GDLN

Query:  EILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSS
        E+   +E   G     R M +FR+ +D C  +DLGF+G  +TWCN R     +  RLDR +  + + S+FPN  + H D   SDH P+ LNL+ ++ + S
Subjt:  EILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSS

Query:  PRRGNLSFKFEEWWTHHVECKEIIARA
         +R    F+FEE W   + C++++ +A
Subjt:  PRRGNLSFKFEEWWTHHVECKEIIARA

A0A2N9IMU2 Reverse transcriptase domain-containing protein3.0e-2726Show/hide
Query:  EVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIAD
        E I RTF+  W+ E+G  V+ LG N  +     + D  RV+   P  +DKFL+VL+        +      + FW+ ++ LPL                 
Subjt:  EVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIAD

Query:  YEVTTTSNTGPRVP-PMTSDSRGDLQTVADVSLKSNLPKKAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNPTSVV-----IATEKKQTKRFGYVPIGL
                 GP  P P    +R  +    + +L  +LP  AK  G + + S  +       A    +E++    + V     +  EK     F  V  GL
Subjt:  YEVTTTSNTGPRVP-PMTSDSRGDLQTVADVSLKSNLPKKAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNPTSVV-----IATEKKQTKRFGYVPIGL

Query:  SVQRLEEFQKR-----------------------KDGPIMLS------FDNIKCPKMEEE----------SDKEAGTTWQLICRLHNFCDDAWVIGGDLN
           RLE  + R                       K+  I ++       D+I    ME                   +W L+  LH      W   GD N
Subjt:  SVQRLEEFQKR-----------------------KDGPIMLS------FDNIKCPKMEEE----------SDKEAGTTWQLICRLHNFCDDAWVIGGDLN

Query:  EILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSS
        E+   +E   G     R M +FR+ +D C  +DLGF+G  +TWCN R     +  RLDR +  + + S+FPN  + H D   SDH P+ LNL+ ++ + S
Subjt:  EILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSHFSSS

Query:  PRRGNLSFKFEEWWTHHVECKEIIARA
         +R    F+FEE W   + C++++ +A
Subjt:  PRRGNLSFKFEEWWTHHVECKEIIARA

A0A2Z6P0M1 DUF4283 domain-containing protein5.7e-2623.76Show/hide
Query:  WKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIADYEVTTTSNTG
        W L+  ++++ L KN+F+FR   + D   V++     FD+ L+VL+     ++PSD +   + FW  +YDLPL   +D M E++GN I  + V      G
Subjt:  WKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIADYEVTTTSNTG

Query:  PRVPPMTSDSRGDLQTVADV--SLKSNLPKKAKGDGSDPLASQ----WSSDCSLPAAGI-GDNEKKQNPTSV------VIATEKKQTKRFGYVPIGLSVQ
         R+          L+T  D+   LK       +G        +    W    S P   +  D +K+Q+P+S         +T K ++K       GL ++
Subjt:  PRVPPMTSDSRGDLQTVADV--SLKSNLPKKAKGDGSDPLASQ----WSSDCSLPAAGI-GDNEKKQNPTSV------VIATEKKQTKRFGYVPIGLSVQ

Query:  RLEEFQKRKDG----------PIMLSFD------------------------------NIKCPKMEEESD-------------KEAGTTWQLICRLHNFC
         + E +K K            P    ++                               + C +  +  D                  TW+LI + H+  
Subjt:  RLEEFQKRKDG----------PIMLSFD------------------------------NIKCPKMEEESD-------------KEAGTTWQLICRLHNFC

Query:  DDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIE
           W+  GDLN++L  ++   G  R    +    + ++ C L D+GF G  +TW N R+    V  RLD+ +G  +F + F    V H     SDH  I 
Subjt:  DDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIE

Query:  LNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGV
        + L   H     ++    F+F++ WT    C+E + +   GV
Subjt:  LNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGV

A0A6P6WTP1 uncharacterized protein LOC1137359733.8e-3028.05Show/hide
Query:  AEVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIA
        AE +  T K +W   +GL+   +G N+F+F+     D+ +V+   P  FD  LLV+   + + +PS  K    +FW+ VY+LPL W N    + IG+ + 
Subjt:  AEVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIA

Query:  DYE-------VTTTSNTGPRVPPMTSDSRGDLQ------TVADVSLK-SNLPK---------KAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNP----
         YE       ++       RV    ++    L        V +V  +   LP            + D  D L    +SD     + +G  +  Q P    
Subjt:  DYE-------VTTTSNTGPRVPPMTSDSRGDLQ------TVADVSLK-SNLPK---------KAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNP----

Query:  TSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHR
         S ++AT+++        PI LS   + +        +  S   +       E+ K    TW ++ +L   C   WV  GD NE+L   E      R   
Subjt:  TSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNIKCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHR

Query:  LMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSH---FSSSPRRGNLSFKFEEWW
         + NFR+ L  CNL DLG  G+ +TWC  R   D    RLDR   + +F +LFP+  + H     SDH PI L L   H    + S  R +  F FE  W
Subjt:  LMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPNHLVLHQDWAKSDHRPIELNLLGSH---FSSSPRRGNLSFKFEEWW

Query:  THHVECKEII
            +C+ II
Subjt:  THHVECKEII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAGAAGTCATTTGGAGGACGTTTAAAGCTACTTGGAAACTAGAGCGGGGGCTACAAGTCGAGACGTTGGGTAAGAATATTTTCATTTTCAGACTCACGGTGGA
AGAGGATAGGACGAGGGTTGTGAGACAGAGCCCCTGCCATTTTGATAAATTTCTTTTGGTGTTAGAGTTCCCGATCAGGTCACAAAAGCCATCGGATTATAAATTTTTGT
TCTCGGCTTTTTGGATTCATGTATATGACCTTCCATTAGATTGGTACAATGACGTCATGGTAGAGCGAATAGGAAATGCCATTGCCGACTATGAAGTCACGACAACCTCC
AATACCGGTCCAAGAGTCCCACCGATGACAAGCGATTCGAGGGGCGATTTACAGACTGTTGCAGATGTTTCCTTGAAGTCGAATCTGCCTAAGAAGGCGAAAGGGGATGG
GTCCGATCCGTTGGCGAGTCAATGGTCGTCGGACTGTTCCTTGCCTGCGGCTGGCATTGGTGATAACGAGAAGAAGCAAAACCCCACAAGCGTTGTTATTGCTACTGAGA
AGAAGCAAACAAAAAGATTCGGTTATGTACCTATTGGGCTATCTGTGCAACGACTAGAGGAGTTCCAAAAGCGTAAGGATGGGCCAATAATGCTATCTTTTGATAACATA
AAGTGTCCAAAGATGGAAGAGGAGTCTGATAAGGAGGCGGGGACTACCTGGCAACTCATATGCCGCTTACATAATTTTTGTGATGATGCATGGGTTATTGGAGGTGATCT
TAATGAGATTCTTTGGGACAATGAGAATTCAAAAGGTCCTACTCGGGACCATCGCCTGATGACGAATTTTAGGGAGGTCTTGGATGGCTGCAATCTCATGGATTTGGGCT
TTTTAGGCAGCACTTATACTTGGTGCAATAGGAGAGAAGATGGTGATCAAGTGAGCTTACGCCTAGATCGTTTTATTGGAAACTCTAATTTCTGTTCTTTGTTTCCAAAT
CACCTGGTTCTGCATCAAGATTGGGCAAAATCGGACCATCGTCCAATTGAACTGAACTTACTAGGCTCGCACTTCAGTAGCAGCCCAAGAAGAGGAAACCTATCCTTTAA
ATTTGAGGAATGGTGGACGCACCATGTAGAGTGCAAGGAAATTATTGCTCGAGCTAGGGTTGGGGTTGTAGAGGAGCGAAGTCTGCAACCTACAAAACCGACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAGAAGTCATTTGGAGGACGTTTAAAGCTACTTGGAAACTAGAGCGGGGGCTACAAGTCGAGACGTTGGGTAAGAATATTTTCATTTTCAGACTCACGGTGGA
AGAGGATAGGACGAGGGTTGTGAGACAGAGCCCCTGCCATTTTGATAAATTTCTTTTGGTGTTAGAGTTCCCGATCAGGTCACAAAAGCCATCGGATTATAAATTTTTGT
TCTCGGCTTTTTGGATTCATGTATATGACCTTCCATTAGATTGGTACAATGACGTCATGGTAGAGCGAATAGGAAATGCCATTGCCGACTATGAAGTCACGACAACCTCC
AATACCGGTCCAAGAGTCCCACCGATGACAAGCGATTCGAGGGGCGATTTACAGACTGTTGCAGATGTTTCCTTGAAGTCGAATCTGCCTAAGAAGGCGAAAGGGGATGG
GTCCGATCCGTTGGCGAGTCAATGGTCGTCGGACTGTTCCTTGCCTGCGGCTGGCATTGGTGATAACGAGAAGAAGCAAAACCCCACAAGCGTTGTTATTGCTACTGAGA
AGAAGCAAACAAAAAGATTCGGTTATGTACCTATTGGGCTATCTGTGCAACGACTAGAGGAGTTCCAAAAGCGTAAGGATGGGCCAATAATGCTATCTTTTGATAACATA
AAGTGTCCAAAGATGGAAGAGGAGTCTGATAAGGAGGCGGGGACTACCTGGCAACTCATATGCCGCTTACATAATTTTTGTGATGATGCATGGGTTATTGGAGGTGATCT
TAATGAGATTCTTTGGGACAATGAGAATTCAAAAGGTCCTACTCGGGACCATCGCCTGATGACGAATTTTAGGGAGGTCTTGGATGGCTGCAATCTCATGGATTTGGGCT
TTTTAGGCAGCACTTATACTTGGTGCAATAGGAGAGAAGATGGTGATCAAGTGAGCTTACGCCTAGATCGTTTTATTGGAAACTCTAATTTCTGTTCTTTGTTTCCAAAT
CACCTGGTTCTGCATCAAGATTGGGCAAAATCGGACCATCGTCCAATTGAACTGAACTTACTAGGCTCGCACTTCAGTAGCAGCCCAAGAAGAGGAAACCTATCCTTTAA
ATTTGAGGAATGGTGGACGCACCATGTAGAGTGCAAGGAAATTATTGCTCGAGCTAGGGTTGGGGTTGTAGAGGAGCGAAGTCTGCAACCTACAAAACCGACTTAA
Protein sequenceShow/hide protein sequence
MAAEVIWRTFKATWKLERGLQVETLGKNIFIFRLTVEEDRTRVVRQSPCHFDKFLLVLEFPIRSQKPSDYKFLFSAFWIHVYDLPLDWYNDVMVERIGNAIADYEVTTTS
NTGPRVPPMTSDSRGDLQTVADVSLKSNLPKKAKGDGSDPLASQWSSDCSLPAAGIGDNEKKQNPTSVVIATEKKQTKRFGYVPIGLSVQRLEEFQKRKDGPIMLSFDNI
KCPKMEEESDKEAGTTWQLICRLHNFCDDAWVIGGDLNEILWDNENSKGPTRDHRLMTNFREVLDGCNLMDLGFLGSTYTWCNRREDGDQVSLRLDRFIGNSNFCSLFPN
HLVLHQDWAKSDHRPIELNLLGSHFSSSPRRGNLSFKFEEWWTHHVECKEIIARARVGVVEERSLQPTKPT