; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009027 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009027
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:34259398..34261972
RNA-Seq ExpressionLag0009027
SyntenyLag0009027
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4309574.1 unnamed protein product [Prunus armeniaca]4.8e-2531.2Show/hide
Query:  IQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DR---PSSSNSDRMRAWWSALWK
        + S P LPL++ V +LF++SG WN  +L+  F + + + ILRI +      DCL+W++E++G +S            D+    SS+ +D    +W  +W 
Subjt:  IQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DR---PSSSNSDRMRAWWSALWK

Query:  LNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRLDFELVVAFWWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAF
        L +P+K+KFFL R   D LP    L KR ++ +S+C  C    E  LH  W C + K             VW  RNS+ WG       +  ++ ++ +A 
Subjt:  LNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRLDFELVVAFWWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAF

Query:  HVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA
        ++    +        Q S R    VW PP A   K+ VD ++ S     G G  +R   GE FMAA
Subjt:  HVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA

KAF4364303.1 hypothetical protein G4B88_028423 [Cannabis sativa]3.3e-2630.22Show/hide
Query:  VAREGQIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR--------------PSSSNSDRMRAWW
        ++R G++++ P  P  + VS L   +G WN  +LRA F +     IL +        D   W     G +S R              PSSSN++ +  WW
Subjt:  VAREGQIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR--------------PSSSNSDRMRAWW

Query:  SALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWA
         +LW+L +P K++ F+ RL    LPT  NL  R    S +C  C    E   H  + C  +K+    +F L +   W  WN RN+  +  + S    +  
Subjt:  SALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWA

Query:  YSSDYLNAFHVG-GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF
         + DYL  +     +R+          SDR +  VW PPP   LKLN DA++ S    TGGG  +R   G++  A  F
Subjt:  YSSDYLNAFHVG-GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.0e-2628.57Show/hide
Query:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DRPSSSNSDRM---RAWWSALW
        +I S P LPL++ V +LF++SG WN  +L+  F + + +  L+I +    G DCLIW++E++G +S            D+ S   S R+     +W  +W
Subjt:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DRPSSSNSDRM---RAWWSALW

Query:  KLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL--------------------------------DFELVVAF
         L +P+K+KFFL R   D LP    L  R ++ + +C  C   AE  LH  W C   K +                                +  L    
Subjt:  KLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL--------------------------------DFELVVAF

Query:  WWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAY
         W +WN RNS  + G+S+      +    L A        L+      Q S +     W PPPA   K+NVD +V S     G G  +R A+GE   A  
Subjt:  WWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAY

Query:  FELQRCVG
          +Q   G
Subjt:  FELQRCVG

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]4.3e-3431.21Show/hide
Query:  QIQSAPSLPLASTVSELFS-ASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR---------------PSSSNSDRMRAWWSAL
        +I S+P LPL S VS L     GGW   ++R  F   + + IL I +  G  ED LIW +EK G +S R               PSSS+S+ +R WW+  
Subjt:  QIQSAPSLPLASTVSELFS-ASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR---------------PSSSNSDRMRAWWSAL

Query:  WKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-------------------------DFELVVAFWWSVWN
        WK+++P+K+K FL RL  D LPT  NL KRG+ +++ C  C  + ED +HLFW C   + L                         DFE +    W +WN
Subjt:  WKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-------------------------DFELVVAFWWSVWN

Query:  LRNSLSWGGRSD-----GRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA
         RN+ ++   +      G +L  +++ Y   F       + G     ++++  E  +W PP     K+N DAS ++  +  G G  +    G+V  AA
Subjt:  LRNSLSWGGRSD-----GRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]1.8e-2426.21Show/hide
Query:  PLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHG------------SFSDRPSSSNSDRMRAWWSALWKLNVPSKLKFF
        P  + V+ L +    WN+ +L   F+  D + IL I + +    D LIW++   G            S  D   SS S+    WW   WKL +P K+K F
Subjt:  PLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHG------------SFSDRPSSSNSDRMRAWWSALWKLNVPSKLKFF

Query:  LSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVK--------------------------------RLDFELVVAFWWSVWNLRNSL
          R+FHD LP   +L +R +   S C +C +  E   H  + C   K                                +L+ E ++   WS+W  RN +
Subjt:  LSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVK--------------------------------RLDFELVVAFWWSVWNLRNSL

Query:  SWGGRSDGRDLWA-YSSDYLNAFHVGGRRYLAGDCLRLQLSDRG--ERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMA
          G ++    L A ++++YL+ +H    +Y   +    Q           W PP    LK+NVDA++ + T + G G  +R   G V  A
Subjt:  SWGGRSDGRDLWA-YSSDYLNAFHVGGRRYLAGDCLRLQLSDRG--ERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMA

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon9.5e-2728.57Show/hide
Query:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DRPSSSNSDRM---RAWWSALW
        +I S P LPL++ V +LF++SG WN  +L+  F + + +  L+I +    G DCLIW++E++G +S            D+ S   S R+     +W  +W
Subjt:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS------------DRPSSSNSDRM---RAWWSALW

Query:  KLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL--------------------------------DFELVVAF
         L +P+K+KFFL R   D LP    L  R ++ + +C  C   AE  LH  W C   K +                                +  L    
Subjt:  KLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL--------------------------------DFELVVAF

Query:  WWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAY
         W +WN RNS  + G+S+      +    L A        L+      Q S +     W PPPA   K+NVD +V S     G G  +R A+GE   A  
Subjt:  WWSVWNLRNSLSWGGRSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAY

Query:  FELQRCVG
          +Q   G
Subjt:  FELQRCVG

A0A6J1DAR4 uncharacterized protein LOC1110189542.1e-3431.21Show/hide
Query:  QIQSAPSLPLASTVSELFS-ASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR---------------PSSSNSDRMRAWWSAL
        +I S+P LPL S VS L     GGW   ++R  F   + + IL I +  G  ED LIW +EK G +S R               PSSS+S+ +R WW+  
Subjt:  QIQSAPSLPLASTVSELFS-ASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR---------------PSSSNSDRMRAWWSAL

Query:  WKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-------------------------DFELVVAFWWSVWN
        WK+++P+K+K FL RL  D LPT  NL KRG+ +++ C  C  + ED +HLFW C   + L                         DFE +    W +WN
Subjt:  WKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-------------------------DFELVVAFWWSVWN

Query:  LRNSLSWGGRSD-----GRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA
         RN+ ++   +      G +L  +++ Y   F       + G     ++++  E  +W PP     K+N DAS ++  +  G G  +    G+V  AA
Subjt:  LRNSLSWGGRSD-----GRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA

A0A7J6F0S9 Uncharacterized protein1.6e-2630.22Show/hide
Query:  VAREGQIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR--------------PSSSNSDRMRAWW
        ++R G++++ P  P  + VS L   +G WN  +LRA F +     IL +        D   W     G +S R              PSSSN++ +  WW
Subjt:  VAREGQIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDR--------------PSSSNSDRMRAWW

Query:  SALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWA
         +LW+L +P K++ F+ RL    LPT  NL  R    S +C  C    E   H  + C  +K+    +F L +   W  WN RN+  +  + S    +  
Subjt:  SALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWA

Query:  YSSDYLNAFHVG-GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF
         + DYL  +     +R+          SDR +  VW PPP   LKLN DA++ S    TGGG  +R   G++  A  F
Subjt:  YSSDYLNAFHVG-GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF

A0A803PBD1 Uncharacterized protein8.0e-2629.81Show/hide
Query:  PLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS--------------DRPSSSNSDRMRAWWSALWKLNVPSKLK
        P  + VS L   +G WN  +LR +F +     IL +        D   W     G +S               +PSSSN++ +  WW +LW L +P K++
Subjt:  PLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFS--------------DRPSSSNSDRMRAWWSALWKLNVPSKLK

Query:  FFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWAYSSDYLNAFHVG-
         F+ RL    LPT  NL  R    S +C  C    E   H  + C  +K+    +F L +   W  WN RN+  +  + S    +   + DYL  +    
Subjt:  FFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL---DFELVVAFWWSVWNLRNSLSWGGR-SDGRDLWAYSSDYLNAFHVG-

Query:  GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF
         +R+        Q +  G+  VW PPP   LKLN DA+V S    TGGG  +R   G++  A  F
Subjt:  GRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYF

A0A803QQT2 Uncharacterized protein2.5e-2728.1Show/hide
Query:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHG------------SFSDRPSSSNSDRMRAWWSALWKLN
        ++   PSLP    V++L  A G W+E  +R+ FN +D + IL I       ED ++W++ K+G            SF+     SN   +  WW  LW+L 
Subjt:  QIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHG------------SFSDRPSSSNSDRMRAWWSALWKLN

Query:  VPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHL-HLFWNCPMVK----------------------------------RLDFELVVAF
        +P K+K F+ ++ H+ LP  VNL KRG++ S +C  C    ++ + H  W C   K                                  +L+F L+V+ 
Subjt:  VPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHL-HLFWNCPMVK----------------------------------RLDFELVVAF

Query:  WWSVWNLRNSLSWGG-RSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA
         W++WN+RN++  GG      ++  +  ++L  F         GD  R +     E   W PP   ++ +NVDA V      +G G  +R A G V  AA
Subjt:  WWSVWNLRNSLSWGG-RSDGRDLWAYSSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAA

Query:  YFELQR
           LQ+
Subjt:  YFELQR

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.0e-0630.53Show/hide
Query:  DCLIWYFEKHGSFSDRPS-------SSNSDRMRAWWSALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCP
        D L W F + G FS R +             M ++++ LWK+ VP ++K FL  + +  + T+    +R LS S++C +C    E  LH+  +CP
Subjt:  DCLIWYFEKHGSFSDRPS-------SSNSDRMRAWWSALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCP

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.1e-0629.47Show/hide
Query:  EDCLIWYFEKHGS---FSDRPSS---SNSDRMRAWWSALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCP
        +D  IW  + H     FS   +S      + +  W+ A+W  N   K  F    +  + L T+  L   GLS+ ++C+LC+   E   HLF+ CP
Subjt:  EDCLIWYFEKHGS---FSDRPSS---SNSDRMRAWWSALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCP

AT2G02650.1 Ribonuclease H-like superfamily protein1.0e-0922.12Show/hide
Query:  ALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-----------------------------------DF
        A+WKL+V  K+K FL R     L T   L  R +    +C  C  + E   H+ +NCP  + +                                     
Subjt:  ALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRL-----------------------------------DF

Query:  ELVVAFW--WSVWNLRNSLSWGGRSDGRDLWAY-----SSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCE
        +  + FW  W +W  RN   +  +    D  A      ++++LNA        +      +Q S R +   W+PPP   +K N D+     +  T  G  
Subjt:  ELVVAFW--WSVWNLRNSLSWGGRSDGRDLWAY-----SSDYLNAFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCE

Query:  LRGADGEVFMAAYFELQ
        +R  +G + +    +LQ
Subjt:  LRGADGEVFMAAYFELQ

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-0424.43Show/hide
Query:  SASGGWNEAMLRAHFNESDYETILRISVRH-GLGEDCLIWYFEKHGSFSDRPSSSNSDRMR---------AWWSALWKLNVPSKLKFFLSRLFHDHLPTK
        S +G W     R+  ++     +    V H   G+D  +W   ++ + S  PS S+ D             W   +W      +        F + LPT+
Subjt:  SASGGWNEAMLRAHFNESDYETILRISVRH-GLGEDCLIWYFEKHGSFSDRPSSSNSDRMR---------AWWSALWKLNVPSKLKFFLSRLFHDHLPTK

Query:  VNLPKRGLSVSSLCVLCDEDAEDHLHLFWNC
          L   G+++ S  VLC    E H HLF+ C
Subjt:  VNLPKRGLSVSSLCVLCDEDAEDHLHLFWNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAAGGAGGCAAAACCGGCAAGTGGGACGGGCCAAGATCGAAGGGGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCCGCTCGCGCTGGCCGAGTCTGTTCGATG
CCGTTTGGTCCCCACCACCTTTGGCCGCCTCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAATCCCTAAAAACCCTAGGAGGATGAGCAGATATTTATATCCCTCTT
CGCCACTGAAGAGGGAATCCCGAATTCTATCCCTAAACTCTATTCTATATTTTCTACTCTCTCCTCTTGCTCTTACTTTTCCACTCCCTACCATTCTGTTTGCTGACTTA
AGCATCGGAGCCGGTGTGGCGAGCATCACACCGGTGTGCAAGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTATCGTTGGTGGCACGTGAAGGTCA
AATACAGTCGGCACCATCGCTCCCACTTGCTAGTACGGTTAGTGAGCTTTTTTCTGCGTCTGGTGGGTGGAATGAGGCTATGCTTAGAGCCCACTTTAATGAGTCTGACT
ATGAGACCATTTTGAGAATCTCAGTTCGGCATGGCTTGGGGGAGGATTGCTTAATTTGGTACTTTGAGAAACATGGATCCTTTTCTGACCGTCCCTCATCCTCAAATTCT
GATAGAATGCGTGCGTGGTGGTCCGCCCTTTGGAAGCTGAATGTGCCTAGCAAGCTCAAGTTTTTTCTCTCGCGGTTGTTTCATGACCATCTGCCTACCAAGGTAAACCT
TCCCAAGCGTGGACTCAGTGTCTCTAGCCTGTGTGTCTTATGCGACGAGGATGCTGAGGACCATCTCCACCTATTCTGGAATTGTCCTATGGTTAAGAGGCTGGATTTTG
AGCTTGTGGTCGCCTTTTGGTGGTCTGTGTGGAATCTCCGGAACAGCTTGAGTTGGGGTGGCCGATCAGATGGTCGGGATTTATGGGCATACTCGAGTGACTACCTCAAT
GCCTTCCATGTTGGGGGAAGACGTTACCTTGCAGGGGATTGCCTACGGCTGCAACTGAGTGACCGGGGGGAGCGTTGTGTATGGAGTCCGCCCCCTGCTAGGAAACTGAA
GCTTAATGTTGATGCTTCGGTCATGTCTGATACAAGGGAAACGGGGGGTGGCTGTGAGCTGCGTGGGGCTGATGGTGAGGTTTTTATGGCTGCCTATTTTGAACTACAGA
GGTGTGTTGGAGTGTGGATTTGGCTGAGGGATGGGCTGTGTATGAAGGGGTCCAACTTGCTTGGCAGCTGTGGTTCGTGGAGTTTGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCAAGGAGGCAAAACCGGCAAGTGGGACGGGCCAAGATCGAAGGGGTCGGGTTTTTGGCCCGACCCCCTGCTCGGCCCGCTCGCGCTGGCCGAGTCTGTTCGATG
CCGTTTGGTCCCCACCACCTTTGGCCGCCTCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAATCCCTAAAAACCCTAGGAGGATGAGCAGATATTTATATCCCTCTT
CGCCACTGAAGAGGGAATCCCGAATTCTATCCCTAAACTCTATTCTATATTTTCTACTCTCTCCTCTTGCTCTTACTTTTCCACTCCCTACCATTCTGTTTGCTGACTTA
AGCATCGGAGCCGGTGTGGCGAGCATCACACCGGTGTGCAAGTTTACTGTCTTGCAGGCCACGTCTTCCCCCTCATCTACAAATTTATCGTTGGTGGCACGTGAAGGTCA
AATACAGTCGGCACCATCGCTCCCACTTGCTAGTACGGTTAGTGAGCTTTTTTCTGCGTCTGGTGGGTGGAATGAGGCTATGCTTAGAGCCCACTTTAATGAGTCTGACT
ATGAGACCATTTTGAGAATCTCAGTTCGGCATGGCTTGGGGGAGGATTGCTTAATTTGGTACTTTGAGAAACATGGATCCTTTTCTGACCGTCCCTCATCCTCAAATTCT
GATAGAATGCGTGCGTGGTGGTCCGCCCTTTGGAAGCTGAATGTGCCTAGCAAGCTCAAGTTTTTTCTCTCGCGGTTGTTTCATGACCATCTGCCTACCAAGGTAAACCT
TCCCAAGCGTGGACTCAGTGTCTCTAGCCTGTGTGTCTTATGCGACGAGGATGCTGAGGACCATCTCCACCTATTCTGGAATTGTCCTATGGTTAAGAGGCTGGATTTTG
AGCTTGTGGTCGCCTTTTGGTGGTCTGTGTGGAATCTCCGGAACAGCTTGAGTTGGGGTGGCCGATCAGATGGTCGGGATTTATGGGCATACTCGAGTGACTACCTCAAT
GCCTTCCATGTTGGGGGAAGACGTTACCTTGCAGGGGATTGCCTACGGCTGCAACTGAGTGACCGGGGGGAGCGTTGTGTATGGAGTCCGCCCCCTGCTAGGAAACTGAA
GCTTAATGTTGATGCTTCGGTCATGTCTGATACAAGGGAAACGGGGGGTGGCTGTGAGCTGCGTGGGGCTGATGGTGAGGTTTTTATGGCTGCCTATTTTGAACTACAGA
GGTGTGTTGGAGTGTGGATTTGGCTGAGGGATGGGCTGTGTATGAAGGGGTCCAACTTGCTTGGCAGCTGTGGTTCGTGGAGTTTGTGGTAG
Protein sequenceShow/hide protein sequence
MDQGGKTGKWDGPRSKGSGFWPDPLLGPLALAESVRCRLVPTTFGRLGFAWFDLKRLRIPKNPRRMSRYLYPSSPLKRESRILSLNSILYFLLSPLALTFPLPTILFADL
SIGAGVASITPVCKFTVLQATSSPSSTNLSLVAREGQIQSAPSLPLASTVSELFSASGGWNEAMLRAHFNESDYETILRISVRHGLGEDCLIWYFEKHGSFSDRPSSSNS
DRMRAWWSALWKLNVPSKLKFFLSRLFHDHLPTKVNLPKRGLSVSSLCVLCDEDAEDHLHLFWNCPMVKRLDFELVVAFWWSVWNLRNSLSWGGRSDGRDLWAYSSDYLN
AFHVGGRRYLAGDCLRLQLSDRGERCVWSPPPARKLKLNVDASVMSDTRETGGGCELRGADGEVFMAAYFELQRCVGVWIWLRDGLCMKGSNLLGSCGSWSLW