; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041662 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041662
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:23502813..23504211
RNA-Seq ExpressionLag0041662
SyntenyLag0041662
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]1.9e-5738.29Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDG-LWQFTGIY
        N+ GLGNP+ +R L  L+ N  P LVFL ETK+++   E  +  L       V   G SGG+ LLW+ DL V V+S S  HID +++D DG  W+FTG+Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDG-LWQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        GNP       TW LLRRL  +  + PW++GGDFNE++  NEK+GG PRSE  M+  R  I D  L DL F GPK+TWCN       I ERLDRFL N + 
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQ-DRRWNVGIPLMEKTKECLKRLGQWSFSNYGGS
                V+H     SDH P+    +E+ E R    K+       RFE  W   E+C  I  RVW  D        ++   K+C ++L +W+  ++G  
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQ-DRRWNVGIPLMEKTKECLKRLGQWSFSNYGGS

Query:  IKRAIVRNE--AEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA
         ++  +      ++Q+    +P D+  + KA +E++  LE +++ WKQR+
Subjt:  IKRAIVRNE--AEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA

KAG2725981.1 hypothetical protein I3760_01G090600 [Carya illinoinensis]8.5e-5837.78Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGL-WQFTGIY
        N+ GLGNPR +R LC L+R   P ++FLMETK+ S + ER+R  + FEC   V   G  GGV LLW+++                +K++ GL W+FTG+Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGL-WQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        G+P  +  +ETW+LLR L+    N+PW++ GDFNE++S  EK GG PR E+LMQ  R  IDD  LIDL F G K+TWCN+      + ERLDRF+     
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPR--RFEESWTTYEECEDIAKRVWQDRRWNVGI-PLMEKTKECLKRLGQWSFSNYG
               +V H     SDH PI+L          +    +  YR +  RFE  W   +EC D+  R WQ    N     ++++   C K L  W+   + 
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPR--RFEESWTTYEECEDIAKRVWQDRRWNVGI-PLMEKTKECLKRLGQWSFSNYG

Query:  GSIKRAIVRNEAEVQNLSGNNPQD--TECLFKAEQELEKLLEDDKIYWKQRA
        G +K+ I R    +Q +    P     E + KA+ +L+  LE ++I W QRA
Subjt:  GSIKRAIVRNEAEVQNLSGNNPQD--TECLFKAEQELEKLLEDDKIYWKQRA

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.3e-6144.27Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGLWQFTGIYG
        N  GLGNP T RTL  L+R  QP+LVFL ETK       R +REL F+C + V S G SGG+MLLW  D +V ++S+S GHID ++ D  G W+FTG YG
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGLWQFTGIYG

Query:  NPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEMM
        NP       +W LL RL A   +LPWI+GGDFNEIVSM EK GGV R+E  M+                               PIWERLDRFL+N+ M+
Subjt:  NPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEMM

Query:  DRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME-KTKECLKRLGQWSFSNYGGSI
        ++C  LKV HL L+ SDHRPIL SW+ +   R   C    + R  RFEESW   + C DI    W      +GI   + K   CL RL +W+      S+
Subjt:  DRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME-KTKECLKRLGQWSFSNYGGSI

Query:  KRAIVRNEAEVQNL
        K AI   E E++ L
Subjt:  KRAIVRNEAEVQNL

XP_042972796.1 uncharacterized protein LOC122304603 [Carya illinoinensis]1.6e-5640.47Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDG--LWQFTGI
        N+ GLGNPR +R LC L+R   P ++FLMETK+ S + ERIR  + FEC   V S G  GGV LLW+ ++ ++++S S  HID  +   DG   W+FTG+
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDG--LWQFTGI

Query:  YGNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQE
        YG+   +  +ETW+LLR L+    N+PW++ GDFNE++S  EK GG PR E LMQ  R  +DD  LIDL F G K+TWCN+      + ERLDRF+    
Subjt:  YGNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQE

Query:  MMDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGI-PLMEKTKECLKRLGQWSFSNYG
              L +V H     SDH PI+L  +    +         R++  R E  W   +EC D+  R WQ    N  I  ++++   C K L  W+   +G
Subjt:  MMDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGI-PLMEKTKECLKRLGQWSFSNYG

XP_042988712.1 uncharacterized protein LOC122316247 [Carya illinoinensis]8.2e-6137.89Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVV--KDNDGLWQFTGI
        N  GLGNPR +R L  L+R   P ++FL ETK+   + E ++R L +EC   V S+G SGG+ L+WQ + ++ V+S SK HID ++   ++DG WQFTG+
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVV--KDNDGLWQFTGI

Query:  YGNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQE
        YG+P+ +L +ETW  +R L+    ++PW++ GDFNE++   EK+GG  R E+ M+  R+ +DD   +DL F GP FTWCNK      + ERLDR+L NQ 
Subjt:  YGNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQE

Query:  MMDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDR-RWNVGIPLMEKTKECLKRLGQWSFSNYGG
         +D     +V H +   SDH PILL    +        K   R +  RFE  WT   E E+I +  W  R   N    +  +   C +RL QW+ + Y G
Subjt:  MMDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDR-RWNVGIPLMEKTKECLKRLGQWSFSNYGG

Query:  SIKRAIVRNEAEVQNLSGNNP--QDTECLFKAEQELEKLLEDDKIYWKQRA
        ++++ I +    +Q +   +P     E   +A  +L+  LE ++I W QRA
Subjt:  SIKRAIVRNEAEVQNLSGNNP--QDTECLFKAEQELEKLLEDDKIYWKQRA

TrEMBL top hitse value%identityAlignment
A0A2N9G258 Uncharacterized protein7.8e-5739.15Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKD-NDGLWQFTGIY
        N  GLGNPRT++ L  L+R   P +VFL+ET       ER+R +LQFE      S+   GG+ LLW++ +++ V S    HID VV + +D  W+FTG Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKD-NDGLWQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        G P     +E+W LLRRL + NT LPW   GDFNE+V + EK+G   RSE+ MQ  R+ +D+   +DL FTGPKFTW N  V  D  WERLDR +   E 
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME---KTKECLKRLGQWSFSNYG
        + R    +VQHL +  SDH+P+   W      RR   K+       RFEE WT+ + CE++    W  ++   G+P+     K   C + L  WS  ++ 
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME---KTKECLKRLGQWSFSNYG

Query:  GSIKRAI--VRN---EAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA
        G+IK  I  V N   +AE  ++ G    +   +F  ++EL  LL  ++  W+QR+
Subjt:  GSIKRAI--VRN---EAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA

A0A2N9G656 Reverse transcriptase domain-containing protein2.4e-5834.84Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDND-GLWQFTGIY
        N  GLGN RT++ L  ++R+  P++VFL+ET     R E +R +L+F   + V +KG  GG+ L WQ ++++++RS S  HID ++ + D   W+FTG Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDND-GLWQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        G P+ +  +E+W LLR L     +LPW+  GDFNEI    EK+G +PR EK M+  RE +D+ EL+DL + G  +TWCN  +    +W RLDR + + + 
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIP-LMEKTKECLKRLGQWSFSNYGGS
        +++    +VQHL    SDH P+L+S+N      ++   ++   +P RFE+ WT    C +     W+      G+P L++K K C + L  WS + + GS
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIP-LMEKTKECLKRLGQWSFSNYGGS

Query:  IKRAIVR-----NEAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA
        ++R +        +AE+ ++ G +    + L   ++E+ +L++ D+  W+QR+
Subjt:  IKRAIVR-----NEAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA

A0A2N9HLP3 Uncharacterized protein2.7e-5738.35Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDN-DGLWQFTGIY
        N  GLGNPRT++ L  L+    P +VFL+E        ER+R +LQF+      S+   GG+ LLW+  +++ + S S  HID VV DN    W+FTG Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDN-DGLWQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        G P     +E+WALLRRL + +T LPW   GDFNE+V + EK+G   RSE+ MQ  R+ +DD   +DL F GPKFTW N  +  D  WERLDR +   E 
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLM---EKTKECLKRLGQWSFSNYG
        + R    +V HL +  SDH+P+ +S N       M C++R   +P RFEE WT+ + CE+     W  ++   G+P+    EK  +C + L  WS  N+ 
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLM---EKTKECLKRLGQWSFSNYG

Query:  GSIKRAIVRNEAEVQNLSGNNPQDTE--CLFKAEQELEKLLEDDKIYWKQRA
        G+IK  I   E  ++     + Q  +   ++   +EL  LL  ++  W+QR+
Subjt:  GSIKRAIVRNEAEVQNLSGNNPQDTE--CLFKAEQELEKLLEDDKIYWKQRA

A0A2N9J7Z5 Reverse transcriptase domain-containing protein4.1e-5834.84Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDND-GLWQFTGIY
        N  GLGN RT++ L  ++R+  P++VFL+ET     R E +R +L+F   + V +KG  GG+ L WQ ++++++RS S  HID ++ + D   W+FTG Y
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDND-GLWQFTGIY

Query:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM
        G P+ +  +E+W LLR L     +LPW+  GDFNEI    EK+G +PR EK M+  RE +D+ EL+DL + G  +TWCN  +    +W RLDR + + + 
Subjt:  GNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEM

Query:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIP-LMEKTKECLKRLGQWSFSNYGGS
        +++    +VQHL    SDH P+L+S+N      ++   ++   +P RFE+ WT    C +     W+      G+P L++K K C + L  WS + + GS
Subjt:  MDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIP-LMEKTKECLKRLGQWSFSNYGGS

Query:  IKRAIVR-----NEAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA
        ++R +        +AE+ ++ G +    + L   ++E+ +L++ D+  W+QR+
Subjt:  IKRAIVR-----NEAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA

A0A6J1DUG8 uncharacterized protein LOC1110241356.2e-6244.27Show/hide
Query:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGLWQFTGIYG
        N  GLGNP T RTL  L+R  QP+LVFL ETK       R +REL F+C + V S G SGG+MLLW  D +V ++S+S GHID ++ D  G W+FTG YG
Subjt:  NTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDVTVRSLSKGHIDVVVKDNDGLWQFTGIYG

Query:  NPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEMM
        NP       +W LL RL A   +LPWI+GGDFNEIVSM EK GGV R+E  M+                               PIWERLDRFL+N+ M+
Subjt:  NPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLNQEMM

Query:  DRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME-KTKECLKRLGQWSFSNYGGSI
        ++C  LKV HL L+ SDHRPIL SW+ +   R   C    + R  RFEESW   + C DI    W      +GI   + K   CL RL +W+      S+
Subjt:  DRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLME-KTKECLKRLGQWSFSNYGGSI

Query:  KRAIVRNEAEVQNL
        K AI   E E++ L
Subjt:  KRAIVRNEAEVQNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein3.5e-0934.41Show/hide
Query:  KETWALLRRLKAAN--TNLPWILGGDFNEIVSMNEKKGGVPRSEKL--MQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLN
        +  W  + RL A++   N PW++ GDFN+I S+ E    +P +  L  +++L+  + D +L+DL   G  +TW N H   +PI  +LDR ++N
Subjt:  KETWALLRRLKAAN--TNLPWILGGDFNEIVSMNEKKGGVPRSEKL--MQELRETIDDYELIDLWFTGPKFTWCNKHVTHDPIWERLDRFLLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTGATTTTGGGGGGGATGGATTAGAATTGGGAATTCAACCGAACAAGGAAGATACTTGTGTGAATATTAAAGGTAAGGAGACAATTGTGGAATCTGAAGGCCA
AAAAAACACTGAGGGCTTGGGAAACCCTAGGACATTAAGGACGTTGTGTTACCTCTTACGCAACCACCAACCCCGTTTGGTTTTCTTGATGGAAACCAAGATACAAAGTT
TAAGAGCAGAGAGAATTCGAAGAGAGTTACAGTTTGAATGCGGAATAGATGTGCCTAGCAAAGGCCTGAGCGGAGGAGTCATGCTTCTATGGCAAAGAGACTTGGATGTG
ACTGTTAGATCTTTGTCAAAAGGCCACATTGACGTGGTAGTCAAAGATAATGATGGGTTGTGGCAGTTTACGGGGATTTATGGAAATCCAAATAGGGATCTTCATAAGGA
AACGTGGGCACTGTTAAGAAGACTGAAAGCAGCAAATACTAATCTTCCATGGATTTTGGGAGGTGATTTCAATGAAATTGTTAGTATGAATGAAAAGAAGGGTGGAGTGC
CTAGGTCAGAAAAGCTAATGCAAGAATTACGGGAGACGATAGATGACTATGAGTTGATTGATCTCTGGTTCACTGGTCCCAAGTTCACATGGTGTAACAAGCATGTAACC
CATGACCCAATTTGGGAGAGACTTGACCGATTTTTGCTGAACCAGGAGATGATGGATCGTTGTAGTTTGTTGAAAGTGCAACACTTAGCTCTTATTGGTTCAGACCATAG
ACCTATTTTATTGAGTTGGAATGAGGATTGTGAGGATAGGAGGATGGATTGCAAAATGAGAGGAAGGTATCGGCCAAGAAGATTTGAGGAATCCTGGACTACATATGAAG
AATGTGAAGATATTGCTAAAAGGGTGTGGCAAGACAGAAGGTGGAATGTTGGTATTCCCTTGATGGAGAAGACTAAGGAGTGTTTGAAAAGATTGGGTCAGTGGAGCTTT
TCCAATTATGGTGGCTCAATTAAAAGGGCTATTGTGAGAAATGAGGCTGAAGTTCAGAACCTTAGTGGCAACAATCCTCAAGATACAGAATGTTTGTTCAAAGCCGAGCA
GGAGTTAGAGAAGCTTCTAGAGGATGATAAAATATATTGGAAGCAAAGAGCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGTTGATTTTGGGGGGGATGGATTAGAATTGGGAATTCAACCGAACAAGGAAGATACTTGTGTGAATATTAAAGGTAAGGAGACAATTGTGGAATCTGAAGGCCA
AAAAAACACTGAGGGCTTGGGAAACCCTAGGACATTAAGGACGTTGTGTTACCTCTTACGCAACCACCAACCCCGTTTGGTTTTCTTGATGGAAACCAAGATACAAAGTT
TAAGAGCAGAGAGAATTCGAAGAGAGTTACAGTTTGAATGCGGAATAGATGTGCCTAGCAAAGGCCTGAGCGGAGGAGTCATGCTTCTATGGCAAAGAGACTTGGATGTG
ACTGTTAGATCTTTGTCAAAAGGCCACATTGACGTGGTAGTCAAAGATAATGATGGGTTGTGGCAGTTTACGGGGATTTATGGAAATCCAAATAGGGATCTTCATAAGGA
AACGTGGGCACTGTTAAGAAGACTGAAAGCAGCAAATACTAATCTTCCATGGATTTTGGGAGGTGATTTCAATGAAATTGTTAGTATGAATGAAAAGAAGGGTGGAGTGC
CTAGGTCAGAAAAGCTAATGCAAGAATTACGGGAGACGATAGATGACTATGAGTTGATTGATCTCTGGTTCACTGGTCCCAAGTTCACATGGTGTAACAAGCATGTAACC
CATGACCCAATTTGGGAGAGACTTGACCGATTTTTGCTGAACCAGGAGATGATGGATCGTTGTAGTTTGTTGAAAGTGCAACACTTAGCTCTTATTGGTTCAGACCATAG
ACCTATTTTATTGAGTTGGAATGAGGATTGTGAGGATAGGAGGATGGATTGCAAAATGAGAGGAAGGTATCGGCCAAGAAGATTTGAGGAATCCTGGACTACATATGAAG
AATGTGAAGATATTGCTAAAAGGGTGTGGCAAGACAGAAGGTGGAATGTTGGTATTCCCTTGATGGAGAAGACTAAGGAGTGTTTGAAAAGATTGGGTCAGTGGAGCTTT
TCCAATTATGGTGGCTCAATTAAAAGGGCTATTGTGAGAAATGAGGCTGAAGTTCAGAACCTTAGTGGCAACAATCCTCAAGATACAGAATGTTTGTTCAAAGCCGAGCA
GGAGTTAGAGAAGCTTCTAGAGGATGATAAAATATATTGGAAGCAAAGAGCCTAG
Protein sequenceShow/hide protein sequence
MDVDFGGDGLELGIQPNKEDTCVNIKGKETIVESEGQKNTEGLGNPRTLRTLCYLLRNHQPRLVFLMETKIQSLRAERIRRELQFECGIDVPSKGLSGGVMLLWQRDLDV
TVRSLSKGHIDVVVKDNDGLWQFTGIYGNPNRDLHKETWALLRRLKAANTNLPWILGGDFNEIVSMNEKKGGVPRSEKLMQELRETIDDYELIDLWFTGPKFTWCNKHVT
HDPIWERLDRFLLNQEMMDRCSLLKVQHLALIGSDHRPILLSWNEDCEDRRMDCKMRGRYRPRRFEESWTTYEECEDIAKRVWQDRRWNVGIPLMEKTKECLKRLGQWSF
SNYGGSIKRAIVRNEAEVQNLSGNNPQDTECLFKAEQELEKLLEDDKIYWKQRA