; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032360 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032360
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:31240972..31242195
RNA-Seq ExpressionLag0032360
SyntenyLag0032360
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5458478.1 hypothetical protein F2P56_022503 [Juglans regia]1.8e-5438.48Show/hide
Query:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT
        SWN RGL NPRT   LH  VK +NP L+FL ETKC        +  L FD+ FVV + G SGG A +W     V + ++S +HI    K G+    W  T
Subjt:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT

Query:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI
        GFY       RK SW+LLR L  +  + + W+  GDFNEI+ +NEK G  ++    M+DF E ++ C L D G++G KFTW   +   +  KE LDR L 
Subjt:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI

Query:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE
        N       K   V     HSS HRPI+  +S N     + KR     +E  W+         L         L S N  N    LR+++ ++D LL+EE+
Subjt:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE

Query:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG
          WKQR+   W++ GDRN+K+FH  A+QR K N I  ++D  G
Subjt:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG

XP_018853114.2 uncharacterized protein LOC109015083 [Juglans regia]1.8e-5438.48Show/hide
Query:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT
        SWN RGL NPRT   LH  VK +NP L+FL ETKC        +  L FD+ FVV + G SGG A +W     V + ++S +HI    K G+    W  T
Subjt:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT

Query:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI
        GFY       RK SW+LLR L  +  + + W+  GDFNEI+ +NEK G  ++    M+DF E ++ C L D G++G KFTW   +   +  KE LDR L 
Subjt:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI

Query:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE
        N       K   V     HSS HRPI+  +S N     + KR     +E  W+         L         L S N  N    LR+++ ++D LL+EE+
Subjt:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE

Query:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG
          WKQR+   W++ GDRN+K+FH  A+QR K N I  ++D  G
Subjt:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG

XP_028068804.1 uncharacterized protein LOC114271378 [Camellia sinensis]1.8e-5438.61Show/hide
Query:  MEIDSWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKT--GLEW
        M+  SWN RGL NPRT   L   +K++ P +VFL ETKC        +C L       V  +G SGG AL W     V I+++S  H+DA  ++  G+  
Subjt:  MEIDSWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKT--GLEW

Query:  WRFTGFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLD
        WRFTGFY  P V K+ DSW+LLRRL  LD   LPW++  DFNEIL  +EK G G++  + +D F   +  C+L D GF G  FTW   +  S ++ E LD
Subjt:  WRFTGFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLD

Query:  RFLINFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSK---ERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQR
        R + N +        +V HL+  +S H PI+    V+ +GP++L           WSK    R     S+LS   +    ++  E+D+LLE EE  W QR
Subjt:  RFLINFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSK---ERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQR

Query:  SNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQTTSGSFSNL
        +  +W++ GDRNT +FH+KA QR K+ RID ++D    W      L +L +   G F+ L
Subjt:  SNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQTTSGSFSNL

XP_030924992.1 uncharacterized protein LOC115952038 [Quercus lobata]2.4e-5434.84Show/hide
Query:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGL-EWWRFTG
        SWN RGL   R    L   V+ + P+LVFL ETK + S    L+C L FD  F+VP    SGG AL W +   + I++FS  HIDA     + + WRFTG
Subjt:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGL-EWWRFTG

Query:  FYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLIN
        FY  P    R+DSW LLR L    Q  LPW+  GDFNEI    EKSGG  +    M  F + +D+C   D GF G  FTW   +    ++   LDR + +
Subjt:  FYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLIN

Query:  FSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKER-----------------------------------ARSHNSLLSS
            +    I++ HLS  SS H+PI   +  ++V  +  +      FEE W+K+                                    A++    +S 
Subjt:  FSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKER-----------------------------------ARSHNSLLSS

Query:  CNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVEL
             ++ + +E+++LL+ EE  W QR+  DW+R+GD+N+K+FH +A++R K+N I  + D  GNWVEG+  + +L
Subjt:  CNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVEL

XP_042944517.1 uncharacterized protein LOC122278389 [Carya illinoinensis]8.2e-5538.79Show/hide
Query:  MEIDSWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHID--ATFKTGLEW
        M I SWN RG+ NPRT   LH  VK++ P +VFL+ETKC       ++  + FD  FVV SVG SGG A++W +  QV + S+S SHI    +   G + 
Subjt:  MEIDSWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHID--ATFKTGLEW

Query:  WRFTGFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLD
           TGFY  P+VEKRK SW LLRRL    + ++ W+  GDFNE+L + E+ GGG +  S M  F E +D C L D G+EGSKFTW   +  +  IKE LD
Subjt:  WRFTGFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLD

Query:  RFLINFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNE
        R          A G    HLSFH +  +    S+   +      KR    I +E     +   +         D +++++ E+DDLL+EEE  W+QRS +
Subjt:  RFLINFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNE

Query:  DWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQ
         W++ GDRN+K+FH  A+QR + N I  + +  G        +  L Q
Subjt:  DWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQ

TrEMBL top hitse value%identityAlignment
A0A2I4HAC4 uncharacterized protein LOC1090150838.8e-5538.48Show/hide
Query:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT
        SWN RGL NPRT   LH  VK +NP L+FL ETKC        +  L FD+ FVV + G SGG A +W     V + ++S +HI    K G+    W  T
Subjt:  SWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE--WWRFT

Query:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI
        GFY       RK SW+LLR L  +  + + W+  GDFNEI+ +NEK G  ++    M+DF E ++ C L D G++G KFTW   +   +  KE LDR L 
Subjt:  GFYEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLI

Query:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE
        N       K   V     HSS HRPI+  +S N     + KR     +E  W+         L         L S N  N    LR+++ ++D LL+EE+
Subjt:  NFSLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSL---------LSSCNWDN----LRKVEKELDDLLEEEE

Query:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG
          WKQR+   W++ GDRN+K+FH  A+QR K N I  ++D  G
Subjt:  RYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDG

A0A2N9H936 Uncharacterized protein8.8e-5539.83Show/hide
Query:  WNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGF
        WN RGL NP+T H L   V++++P ++FL+ETK        L+C   F   FVVPS G SGG A+ W S   V I S+S  HIDA      E  WRFTGF
Subjt:  WNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGF

Query:  YEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINF
        Y  P V  +  +W LLR L       LPW+ GGDFNEIL + EK G   +  S M  F  VVD C  +D GF GS +TW+  +     + E LDR L   
Subjt:  YEKPIVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINF

Query:  SLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAW---------------SKERARSHNSLLSSCNWDN-----LRKVEKELDDL
           +K    +V HL    S HRP+   +S+   G K   R     FEE W               S+ R      L      +N     ++K+ +EL DL
Subjt:  SLQVKAKGIKVRHLSFHSSYHRPIVASISVNEVGPKILKRPNICIFEEAW---------------SKERARSHNSLLSSCNWDN-----LRKVEKELDDL

Query:  LEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGD
          +EER WKQRS   W++ GD+NTK+FH +A+ R++RN I  IRDR G W   D
Subjt:  LEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGD

A0A803P9R9 Uncharacterized protein1.4e-5536.75Show/hide
Query:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE-WWRFTGFYEKP
        GL NP T   L   VK  +P ++FLAET+ + +    ++  L FD+CFVV + G SGG ALLW  S +V I SF++SHIDA  + GL  +WRFTGFY  P
Subjt:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLE-WWRFTGFYEKP

Query:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV
            RK SW L+ RL D+ Q   PWI GGDFNEI+   EK GG  K +S + +F + + YCN  +   EG +FTW  G+  + ++ E LDR   N     
Subjt:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV

Query:  KAKGIKVRHLSFHSSYHRPIVASISVNEVGPKI-LKRPNICIFEEAWS----------------------------------------------------
        K    KV  L + +S HRP++ + S      K  LK  +   +E+AW+                                                    
Subjt:  KAKGIKVRHLSFHSSYHRPIVASISVNEVGPKI-LKRPNICIFEEAWS----------------------------------------------------

Query:  --KERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNW
          KE  +  +S     +W N R+VEK+L+    +EE  WKQRS   W+  GDRNTK+FH KASQR+K+N+I+ + D +  W
Subjt:  --KERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNW

A0A803PC18 Uncharacterized protein3.2e-5740Show/hide
Query:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGFYEKP
        GL NP T   L   VK  +P ++FL ET+ + +    ++  + FD+CFVV + G SGG ALLW    +V IKSF++SHIDA  +  L + WRFTGFY  P
Subjt:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGFYEKP

Query:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV
            RK SW+LL RL D+      WI GGDFNEI+ ++EK GG  K+ S M DF + + YCN  +   EG  FTW  G+  + ++ E LDR L N     
Subjt:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV

Query:  KAKGIKVRHLSFHSSYHRP-IVASISVNEVGPKILKRPNICIFEEAWSKERARS---HNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWG
        +     V  LS+ +S HRP I+ +  V  +  K  +  +   +E+AW++E        +  L+S NW + + + + ++    EEE  WKQRS   W+  G
Subjt:  KAKGIKVRHLSFHSSYHRP-IVASISVNEVGPKILKRPNICIFEEAWSKERARS---HNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWG

Query:  DRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVEL
        DRNTK FH KASQR+K+N I  + D    W + D  +VE+
Subjt:  DRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVEL

A0A803PMD0 Uncharacterized protein5.5e-5735.8Show/hide
Query:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGFYEKP
        GL NP T   L   VK  +P ++FL+ET+ +      ++  L F+ CFVV + G SGG ALLW    +V++KSF++SHIDA  + GL + WRFTGFY  P
Subjt:  GLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEW-WRFTGFYEKP

Query:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV
            RK+SW L+ RL DL Q    W+ GGDFNEI+ SNEK GG  K   LM DF + + YCN  +   EG +FTW  G+  S ++ E LDR L N +   
Subjt:  IVEKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQV

Query:  KAKGIKVRHLSFHSSYHRPIVASISVN----EVGPKILKRPNICIFEEAWSKE-----------------------------------------------
        +    KV  L + +S HRP++ + S N    +  PK   R +   +E+AW++E                                               
Subjt:  KAKGIKVRHLSFHSSYHRPIVASISVN----EVGPKILKRPNICIFEEAWSKE-----------------------------------------------

Query:  ---RARSHNSLLSS----CNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQTTSG
           + +    ++SS     +W   R++E++L+   E++E  WKQRS   W+  GDRNTK+FH KASQR+K+N I  + D    W + D   +E+      
Subjt:  ---RARSHNSLLSS----CNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDRDGNWVEGDMRLVELRQTTSG

Query:  SFSNL
         FSNL
Subjt:  SFSNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATCGACAGTTGGAACGTCCGGGGTTTAAGGAATCCAAGGACGTTCCATGCCCTTCACCATGAGGTGAAAAAGGAGAATCCTCATCTAGTGTTCTTAGCAGAAAC
TAAATGTGACCATAGTATTGATGTTAACCTAAAGTGTGACCTGAATTTTGATGCTTGTTTCGTGGTCCCGAGTGTAGGGTGTAGTGGGGGGTTTGCCCTCCTGTGGAATT
CTAGTTCTCAAGTCCAAATTAAGTCTTTCTCTCTGAGTCATATTGATGCTACCTTTAAAACCGGTCTTGAATGGTGGAGGTTTACAGGTTTTTACGAGAAACCGATTGTT
GAAAAGAGGAAAGATTCTTGGAAATTACTTAGGAGACTCTTCGACTTGGATCAGGCTAAGCTTCCGTGGATTATTGGGGGAGATTTCAATGAAATTCTCTACAGTAATGA
AAAATCAGGGGGGGGCGAGAAGAGATCCTCTCTTATGGACGATTTCCATGAAGTGGTGGATTATTGTAATCTTATGGACCCAGGGTTTGAAGGCAGCAAGTTCACCTGGT
ACAGAGGAAAGAATGCTTCCAAGATAATTAAGGAAATGCTGGATAGATTCCTTATCAATTTCAGTTTGCAAGTCAAGGCTAAAGGGATTAAAGTCAGACACTTAAGTTTC
CATTCCTCATACCACAGGCCAATTGTGGCCAGCATCTCCGTTAATGAAGTGGGGCCCAAGATCCTTAAGAGACCAAATATATGCATATTCGAGGAAGCTTGGTCTAAAGA
AAGAGCAAGAAGTCATAATTCTCTCCTCAGCAGTTGTAATTGGGATAACCTCCGTAAGGTAGAAAAAGAGCTTGATGACTTATTAGAAGAGGAAGAGAGATATTGGAAGC
AGCGTTCGAATGAAGACTGGATTAGATGGGGTGATAGGAACACGAAGTGGTTCCACACAAAGGCTTCGCAGAGGGAAAAAAGGAACCGAATTGATAAGATCAGAGATAGG
GACGGCAATTGGGTAGAAGGCGATATGAGATTGGTAGAGTTGCGGCAGACTACTTCAGGGAGCTTTTCAAATCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATCGACAGTTGGAACGTCCGGGGTTTAAGGAATCCAAGGACGTTCCATGCCCTTCACCATGAGGTGAAAAAGGAGAATCCTCATCTAGTGTTCTTAGCAGAAAC
TAAATGTGACCATAGTATTGATGTTAACCTAAAGTGTGACCTGAATTTTGATGCTTGTTTCGTGGTCCCGAGTGTAGGGTGTAGTGGGGGGTTTGCCCTCCTGTGGAATT
CTAGTTCTCAAGTCCAAATTAAGTCTTTCTCTCTGAGTCATATTGATGCTACCTTTAAAACCGGTCTTGAATGGTGGAGGTTTACAGGTTTTTACGAGAAACCGATTGTT
GAAAAGAGGAAAGATTCTTGGAAATTACTTAGGAGACTCTTCGACTTGGATCAGGCTAAGCTTCCGTGGATTATTGGGGGAGATTTCAATGAAATTCTCTACAGTAATGA
AAAATCAGGGGGGGGCGAGAAGAGATCCTCTCTTATGGACGATTTCCATGAAGTGGTGGATTATTGTAATCTTATGGACCCAGGGTTTGAAGGCAGCAAGTTCACCTGGT
ACAGAGGAAAGAATGCTTCCAAGATAATTAAGGAAATGCTGGATAGATTCCTTATCAATTTCAGTTTGCAAGTCAAGGCTAAAGGGATTAAAGTCAGACACTTAAGTTTC
CATTCCTCATACCACAGGCCAATTGTGGCCAGCATCTCCGTTAATGAAGTGGGGCCCAAGATCCTTAAGAGACCAAATATATGCATATTCGAGGAAGCTTGGTCTAAAGA
AAGAGCAAGAAGTCATAATTCTCTCCTCAGCAGTTGTAATTGGGATAACCTCCGTAAGGTAGAAAAAGAGCTTGATGACTTATTAGAAGAGGAAGAGAGATATTGGAAGC
AGCGTTCGAATGAAGACTGGATTAGATGGGGTGATAGGAACACGAAGTGGTTCCACACAAAGGCTTCGCAGAGGGAAAAAAGGAACCGAATTGATAAGATCAGAGATAGG
GACGGCAATTGGGTAGAAGGCGATATGAGATTGGTAGAGTTGCGGCAGACTACTTCAGGGAGCTTTTCAAATCTTTGA
Protein sequenceShow/hide protein sequence
MEIDSWNVRGLRNPRTFHALHHEVKKENPHLVFLAETKCDHSIDVNLKCDLNFDACFVVPSVGCSGGFALLWNSSSQVQIKSFSLSHIDATFKTGLEWWRFTGFYEKPIV
EKRKDSWKLLRRLFDLDQAKLPWIIGGDFNEILYSNEKSGGGEKRSSLMDDFHEVVDYCNLMDPGFEGSKFTWYRGKNASKIIKEMLDRFLINFSLQVKAKGIKVRHLSF
HSSYHRPIVASISVNEVGPKILKRPNICIFEEAWSKERARSHNSLLSSCNWDNLRKVEKELDDLLEEEERYWKQRSNEDWIRWGDRNTKWFHTKASQREKRNRIDKIRDR
DGNWVEGDMRLVELRQTTSGSFSNL