; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012034 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012034
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:36626403..36627655
RNA-Seq ExpressionLag0012034
SyntenyLag0012034
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2725981.1 hypothetical protein I3760_01G090600 [Carya illinoinensis]7.0e-4332.49Show/hide
Query:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEGNEGRWR
        M  + WN+RGLG+PR +R L  LV+ + P+VLF+ ETK+TS +M+ VR  +G++CCF VDC GR GG+ALLW   +            DG +E     W+
Subjt:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEGNEGRWR

Query:  LSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-------DL----------------------------
         +G YG P  E  +++WSLL  LR   + PW + GDFN +LSQ EK GG+ +P   + A++         DL                            
Subjt:  LSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-------DL----------------------------

Query:  -------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSMAVWGRSKLG
               FP+  V H   + SDH P   ++  P+S     ++ + R E  W+   E  D++ + W   G         I+  L S C + + VW R K G
Subjt:  -------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSMAVWGRSKLG

Query:  NFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLR
        +  ++I+ A + +Q        S +RE + +A+ QL+  L+ EE+ W QR++ LWL+
Subjt:  NFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLR

XP_012851712.1 PREDICTED: uncharacterized protein LOC105971405 [Erythranthe guttata]7.0e-4330.68Show/hide
Query:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV--EGNEGR
        MS +FWN +GLG+P  +  L  +++  RPL++F+SET+ T   +D V+     +   +VD +G SGGLAL+W   +   L+SYS NHID  V    +  +
Subjt:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV--EGNEGR

Query:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-----------------------------------
        WR++GFYG+P  +    SW L+  LR     PW + GDFN ILS SEK GG  +  A + A+++T                                   
Subjt:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-----------------------------------

Query:  -------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGS-DPTSTRPQIVSSLASRCMQSMAVWGRS
                LFP A V HL+YSGSDH P++ +L    +     K    R E TW R  + +D++ Q W  +   DP     + +    + C  ++  W ++
Subjt:  -------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGS-DPTSTRPQIVSSLASRCMQSMAVWGRS

Query:  KLGNFPRRIREANQRVQSTIAELSESNSR-ESLIQAETQLEGILQEEEVYWKQRSRELWLREYER
         L      I+  +++++  + E  ++ +R E + +    LE   ++ ++YW+QRS+  W+RE +R
Subjt:  KLGNFPRRIREANQRVQSTIAELSESNSR-ESLIQAETQLEGILQEEEVYWKQRSRELWLREYER

XP_012857002.1 PREDICTED: uncharacterized protein LOC105976270 [Erythranthe guttata]2.6e-4532.14Show/hide
Query:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV--EGNEGR
        MS +FWN +GLG+P  +  L  +++  RPL++F+SETK T   ++ ++R    +  F +D VG SGGL L W   V  +L+SYS NHID  V  + +  +
Subjt:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV--EGNEGR

Query:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD-------TDL--------------------------
        WR++GFYGFP      +SW++L +LR   + PW + GD+N ILS +EK+GG  +  A++ A+++       TDL                          
Subjt:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD-------TDL--------------------------

Query:  ---------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSG-SDPTSTRPQIVSSLASRCMQSMAVWGRS
                 FP A+V+H+ YSGSDH PL+++L P  +   R K    R E  WLR  + +++V+Q W   G SDP     Q    L      ++  W ++
Subjt:  ---------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSG-SDPTSTRPQIVSSLASRCMQSMAVWGRS

Query:  KLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLREYER
         +    +RI + N R+           ++E +   + +LE   ++  +YW+QRS+  W++E +R
Subjt:  KLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLREYER

XP_018816246.1 uncharacterized protein LOC108987722 [Juglans regia]6.4e-4430.68Show/hide
Query:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEG---NEG
        M +  WNARGLG+PR +R L  L+Q + P VLF+ ET++++  ++  +  LG+  C  +   GR GG+ALLWD  +  ++++YS NH+D  ++     +G
Subjt:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEG---NEG

Query:  RWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-------DL-------------------------
         W L+  YGFP   L  QSWSLL  L   PD PW + GDFN +LS  EK GG  +P  +L+A+++        DL                         
Subjt:  RWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT-------DL-------------------------

Query:  ----------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSMAVWGRS
                  FPNA V H   + SDH P+ + L   ++     K+   + E  W+   E +++++ VW    + P S    ++S L   C   +  W + 
Subjt:  ----------FPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSMAVWGRS

Query:  KLGNFPRRIREANQRVQSTIAELSESN----SRESLIQAETQLEGILQEEEVYWKQRSRELWLRE
          GN   ++ +A    Q ++  L + +    S ++L  A ++++  L+ +E+ W+QRS+ LWL+E
Subjt:  KLGNFPRRIREANQRVQSTIAELSESN----SRESLIQAETQLEGILQEEEVYWKQRSRELWLRE

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]5.5e-4832.6Show/hide
Query:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEG--NEGR
        M  + WN RGLG+PR +R L  LV+ + P+VLF+ ETK+   +M+ V+R LGY+CCF V   GRSGGLAL+W    + N+ SYS+NHID  +    ++G+
Subjt:  MSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWVEG--NEGR

Query:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------------------------------
        W+ +G YG P  EL  ++W+ +  LRG    PW + GDFN +L   EK GGR++P  ++  ++                                     
Subjt:  WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------------------------------

Query:  ------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWT--PSGSDPTSTRPQIVSSLASRCMQSMAVWGR
               D FP   V H   + SDH P+ +      S     +N + R E  W    E +++V   W    SG+   + + +I       C Q +A W +
Subjt:  ------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWT--PSGSDPTSTRPQIVSSLASRCMQSMAVWGR

Query:  SKLGNFPRRIREANQRVQSTIAELSESNSRESLIQ-AETQLEGILQEEEVYWKQRSRELWLR
        +K GN  +RI++    +Q  + E    +S+  L Q A  QL+  L+ EE+ W QR++ LWL+
Subjt:  SKLGNFPRRIREANQRVQSTIAELSESNSRESLIQ-AETQLEGILQEEEVYWKQRSRELWLR

TrEMBL top hitse value%identityAlignment
A0A2N9F086 Reverse transcriptase domain-containing protein7.3e-4630.03Show/hide
Query:  TQKRGTPMSGDPGSEGG---APPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSL
        T +R   ++G  G  GG   APP  M ++ WN RGLG+P A+R L  LV+ + P VLF+ ETK+ +  M+  R +LG++  F V  +GRSGGLA+ W   
Subjt:  TQKRGTPMSGDPGSEGG---APPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSL

Query:  VSFNLLSYSRNHIDGWV-EGNEGRWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------
        ++  + +Y+ +HID ++ + N+  WRL+GFYG P      +SW+L+ +L G    PW   GDFN I+ Q+EK G   +PL  +  +++            
Subjt:  VSFNLLSYSRNHIDGWV-EGNEGRWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------

Query:  ------------------------------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTS
                                      TDLFPN+ V+H+  S SDH P+ + +  P +   R K    R EE W+     +D V+++W+ +      
Subjt:  ------------------------------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTS

Query:  TRPQIVSSLASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLR
        +    V+     C   +  W R K G    +I+   + +++   +  E   +E +   + ++  +L  +E +W+QRSR +WL+
Subjt:  TRPQIVSSLASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLR

A0A2N9FD73 Uncharacterized protein8.6e-4731.9Show/hide
Query:  GSEGG---APPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNH
        GS GG   APP  M L+ WN RGLG+P A+R L  LV+++ P +LF+ ETK+    M+ +R  LGY+C FTV  VGRSGGLALLW   +   + ++S +H
Subjt:  GSEGG---APPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNH

Query:  IDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------------------
        ID  +   +GR WRL+GFYG P      +SW+LL  L      PW   GDFN IL Q EK G   KPL  +  ++D                        
Subjt:  IDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQD------------------------

Query:  ------------------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASR
                          +  FPN++V H+  + SDH P+ + +        R K    R EE WL  PE + +VR++W     D  +T+   +  L  +
Subjt:  ------------------TDLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASR

Query:  ---CMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWL
           C   +  W +        ++R     +++   +  +   ++ +   + ++  +L  +E +WKQRSR++WL
Subjt:  ---CMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWL

A0A2N9G8I6 Reverse transcriptase domain-containing protein7.3e-4632.98Show/hide
Query:  SGDPGS-EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSR
        +G+ GS +  APP  M L+ WN +GLG+P A+R L  +V+ K P VLF+ ETK+ + RM+V+R  LG+D  FTV  +GRSGGLALLW +     + +YS+
Subjt:  SGDPGS-EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSR

Query:  NHIDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT---------------------
        +HID  V+  + + WRL+GFYG P      +SW+LL  L      PW   GDFN IL+ +EK GGR++ L ++  +Q+                      
Subjt:  NHIDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT---------------------

Query:  ---------------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQI--VSS
                             DLFP   V+H+  S SDH  L V +   TS   R K  + R EE W   P+ + L+++ W      P S    +  +  
Subjt:  ---------------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQI--VSS

Query:  LASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSES-----NSRESLIQAETQLEGILQEEEVYWKQRSRELWL
          SRC  ++  W R     F     + N ++++  A  +++     N +  +++ E  +  +L ++E++W+QRSRE+WL
Subjt:  LASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSES-----NSRESLIQAETQLEGILQEEEVYWKQRSRELWL

A0A2N9IBI9 Reverse transcriptase domain-containing protein7.3e-4632.98Show/hide
Query:  SGDPGS-EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSR
        +G+ GS +  APP  M L+ WN +GLG+P A+R L  +V+ K P VLF+ ETK+ + RM+V+R  LG+D  FTV  +GRSGGLALLW +     + +YS+
Subjt:  SGDPGS-EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSR

Query:  NHIDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT---------------------
        +HID  V+  + + WRL+GFYG P      +SW+LL  L      PW   GDFN IL+ +EK GGR++ L ++  +Q+                      
Subjt:  NHIDGWVEGNEGR-WRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT---------------------

Query:  ---------------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQI--VSS
                             DLFP   V+H+  S SDH  L V +   TS   R K  + R EE W   P+ + L+++ W      P S    +  +  
Subjt:  ---------------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQI--VSS

Query:  LASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSES-----NSRESLIQAETQLEGILQEEEVYWKQRSRELWL
          SRC  ++  W R     F     + N ++++  A  +++     N +  +++ E  +  +L ++E++W+QRSRE+WL
Subjt:  LASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSES-----NSRESLIQAETQLEGILQEEEVYWKQRSRELWL

A0A803QNR5 Uncharacterized protein4.3e-4632.7Show/hide
Query:  EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV
        EG  PP IM+++ WN RGLG+ RA++ L ++V  K P  +F+ ETK    R++ V R L ++  F V+  G SGGLALLW       +L YS NHID  V
Subjt:  EGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLSYSRNHIDGWV

Query:  E-GNEGRWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT----------------------------
        E  ++G W+L+GFYG P   L  Q+W LL  L      PW + GD N I+ Q +K GGR  P   +  +Q                              
Subjt:  E-GNEGRWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDT----------------------------

Query:  --------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSM
                      +LF  A + +L+ S SDH P+ +   P ++  V  K    + E  WL+ P   ++VR  W     D         SS  +RC   +
Subjt:  --------------DLFPNAVVNHLDYSGSDHRPLEVLLAPPTSCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSM

Query:  AVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLRE
        +VWG+   GNF  RI+   + +Q  +A   +  S +   +A+ +L  +L + E +WKQR+++LWL+E
Subjt:  AVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEEVYWKQRSRELWLRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTGTGGCTAGCACGCAGAAGAGAGGAACTCCTATGAGTGGTGATCCGGGTTCTGAGGGTGGTGCCCCGCCCAGAATTATGAGCCTCATGTTCTGGAACGCCCG
AGGTTTAGGGTCACCTCGTGCGCTTCGTCGCTTGACCAAATTGGTTCAGGCAAAACGACCCTTGGTGCTTTTTATTTCTGAAACTAAGGTTACGTCTTTCAGAATGGATG
TGGTGAGAAGAACGTTGGGGTATGATTGTTGTTTTACAGTGGATTGCGTGGGCAGGAGTGGGGGTTTGGCTCTTCTCTGGGACTCATTGGTCTCGTTCAACCTACTCTCC
TACTCGAGAAATCATATAGATGGATGGGTGGAGGGGAATGAGGGCAGATGGCGTCTGTCGGGTTTCTATGGTTTTCCTGCGGCAGAGCTTCATGATCAGTCTTGGTCTCT
TTTGAGCAGGTTGCGGGGTTGTCCTGATACCCCGTGGCATATCGAGGGAGACTTCAATGCAATTCTCAGTCAGAGTGAGAAGGATGGTGGTAGGGACAAGCCGCTGGCTG
AGCTGGCTGCATATCAAGACACTGATTTGTTTCCAAATGCGGTGGTGAATCATCTGGATTATAGTGGCTCTGACCACCGTCCTTTAGAGGTTTTACTAGCCCCTCCAACG
TCCTGTTGGGTGAGGGGAAAGAACTGTATTGATCGTCTTGAGGAGACTTGGTTACGCTACCCTGAGCTGCAGGACTTGGTTCGTCAGGTGTGGACCCCATCTGGCTCAGA
TCCCACCTCTACGAGGCCTCAGATCGTTAGCTCACTGGCTAGTAGATGTATGCAGTCTATGGCGGTCTGGGGTAGATCGAAGTTGGGAAATTTTCCCAGGCGAATCAGGG
AAGCCAACCAGAGGGTCCAATCTACCATTGCTGAGTTGAGCGAGTCCAACTCTCGTGAATCGTTGATACAGGCTGAGACTCAGTTAGAGGGGATCTTGCAGGAGGAGGAG
GTATACTGGAAGCAGAGATCTCGAGAGTTGTGGCTACGGGAATATGAGAGAATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAGTGTGGCTAGCACGCAGAAGAGAGGAACTCCTATGAGTGGTGATCCGGGTTCTGAGGGTGGTGCCCCGCCCAGAATTATGAGCCTCATGTTCTGGAACGCCCG
AGGTTTAGGGTCACCTCGTGCGCTTCGTCGCTTGACCAAATTGGTTCAGGCAAAACGACCCTTGGTGCTTTTTATTTCTGAAACTAAGGTTACGTCTTTCAGAATGGATG
TGGTGAGAAGAACGTTGGGGTATGATTGTTGTTTTACAGTGGATTGCGTGGGCAGGAGTGGGGGTTTGGCTCTTCTCTGGGACTCATTGGTCTCGTTCAACCTACTCTCC
TACTCGAGAAATCATATAGATGGATGGGTGGAGGGGAATGAGGGCAGATGGCGTCTGTCGGGTTTCTATGGTTTTCCTGCGGCAGAGCTTCATGATCAGTCTTGGTCTCT
TTTGAGCAGGTTGCGGGGTTGTCCTGATACCCCGTGGCATATCGAGGGAGACTTCAATGCAATTCTCAGTCAGAGTGAGAAGGATGGTGGTAGGGACAAGCCGCTGGCTG
AGCTGGCTGCATATCAAGACACTGATTTGTTTCCAAATGCGGTGGTGAATCATCTGGATTATAGTGGCTCTGACCACCGTCCTTTAGAGGTTTTACTAGCCCCTCCAACG
TCCTGTTGGGTGAGGGGAAAGAACTGTATTGATCGTCTTGAGGAGACTTGGTTACGCTACCCTGAGCTGCAGGACTTGGTTCGTCAGGTGTGGACCCCATCTGGCTCAGA
TCCCACCTCTACGAGGCCTCAGATCGTTAGCTCACTGGCTAGTAGATGTATGCAGTCTATGGCGGTCTGGGGTAGATCGAAGTTGGGAAATTTTCCCAGGCGAATCAGGG
AAGCCAACCAGAGGGTCCAATCTACCATTGCTGAGTTGAGCGAGTCCAACTCTCGTGAATCGTTGATACAGGCTGAGACTCAGTTAGAGGGGATCTTGCAGGAGGAGGAG
GTATACTGGAAGCAGAGATCTCGAGAGTTGTGGCTACGGGAATATGAGAGAATATGA
Protein sequenceShow/hide protein sequence
MDSVASTQKRGTPMSGDPGSEGGAPPRIMSLMFWNARGLGSPRALRRLTKLVQAKRPLVLFISETKVTSFRMDVVRRTLGYDCCFTVDCVGRSGGLALLWDSLVSFNLLS
YSRNHIDGWVEGNEGRWRLSGFYGFPAAELHDQSWSLLSRLRGCPDTPWHIEGDFNAILSQSEKDGGRDKPLAELAAYQDTDLFPNAVVNHLDYSGSDHRPLEVLLAPPT
SCWVRGKNCIDRLEETWLRYPELQDLVRQVWTPSGSDPTSTRPQIVSSLASRCMQSMAVWGRSKLGNFPRRIREANQRVQSTIAELSESNSRESLIQAETQLEGILQEEE
VYWKQRSRELWLREYERI