; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022137 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022137
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr7:19334089..19335237
RNA-Seq ExpressionLag0022137
SyntenyLag0022137
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]2.6e-5143.64Show/hide
Query:  PAPQQFPTPQFQPTQNFFFPNP------------YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYN
        PAP   P P   P  +   PNP             P++ QPL+VKL D+N++ WK QLL+ V+ANGL  FLDG+    P+FLD  Q Q NP++  W+RYN
Subjt:  PAPQQFPTPQFQPTQNFFFPNP------------YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYN

Query:  RFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYN
        R +M WIY+S+++  +G+IV   +A  IW +L+R Y + + A +  L+T LQ ++K+ L+   Y+ K + + N  ++IGEP++Y DHL + L GLG +YN
Subjt:  RFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYN

Query:  AFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVD
         FVTSIQ+    PS+E++ SLLL+Y+ARLE+Q++ D
Subjt:  AFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVD

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]3.3e-5445.66Show/hide
Query:  PLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNP-YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNR
        P  Q P  + P  Q     Q    Q    P P  P++ QP ++KL  +N+L WKNQLL+ ++ANGL  F+DG+    P+F D  +  VN +YI W+R+NR
Subjt:  PLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNP-YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNR

Query:  FIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNA
         IM WIY+SL+Q  MG+IV   +AF+IW +L + Y S++ A+I  L+ +LQ LRKD L+  +Y+ K K + N  +A+GEP+S +DHL ++  GL  EYNA
Subjt:  FIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNA

Query:  FVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS
        FVTSI   PDN  LE+I SLLL+YE RLE QN+  QL+ +QAN++ LN   +N      Y P+FS
Subjt:  FVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.9e-5048.82Show/hide
Query:  PYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLK
        P P+L Q LS+KL + N L  K+QLL+ ++ANGL  F+D   S+ PK+LDA   QVNP+++ W+R N+ +M WIYSSL+   +G+IV   TA DIW SL 
Subjt:  PYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLK

Query:  RAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQN
          Y+S + A +M L +QLQR++K ++ +S+YL+++K V ++F+ IGEP+SYRD L  IL+GL  EY+ FVTSI N  D PSL+++ SLL  YE RL +++
Subjt:  RAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQN

Query:  SVDQLNVVQAN
            LN  QAN
Subjt:  SVDQLNVVQAN

RVX14312.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.8e-4540.16Show/hide
Query:  FFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIW
        F  +  P+L Q  +V L  +N+L W+ Q+L+ ++ANGL   + G I A  +FL  +   +NP+Y  W+R NR +MCWIYSSL++  M +I+ LDTA +IW
Subjt:  FFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIW

Query:  NSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARL
         +L++ + + + ARIM L+ QLQ  +K  LS+ +YL KIK + +   AIGE I+ +D + ++L GLG+EYN+FV ++ +  +  SLE+I S+LL +E +L
Subjt:  NSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARL

Query:  EKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFSPIPRPQF
        E+Q+  ++ N++QANI+++N Q   H+  N+    F    R  F
Subjt:  EKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFSPIPRPQF

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]7.3e-9471.2Show/hide
Query:  PTPQF--QPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKM
        PTP F  QP  N F  NP+PTLPQPL+VKL DNNFL WKNQLL+AV+ANGL G+LDGTI   P+FLD +Q Q NP Y  WERYNR +MCWIYSSLS+EKM
Subjt:  PTPQF--QPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKM

Query:  GEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLE
        GE+V+L+T  DIW+SL R YDS TTARIMGLKT+LQ LRKD  SVSQYLAKIKE+ +KF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI N  D+PSLE
Subjt:  GEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLE

Query:  DIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS
        D+RSLLLAYEARL+KQN+VDQLN+ QAN+ +L+ Q    H S R PP FS
Subjt:  DIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.6e-5445.66Show/hide
Query:  PLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNP-YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNR
        P  Q P  + P  Q     Q    Q    P P  P++ QP ++KL  +N+L WKNQLL+ ++ANGL  F+DG+    P+F D  +  VN +YI W+R+NR
Subjt:  PLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNP-YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNR

Query:  FIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNA
         IM WIY+SL+Q  MG+IV   +AF+IW +L + Y S++ A+I  L+ +LQ LRKD L+  +Y+ K K + N  +A+GEP+S +DHL ++  GL  EYNA
Subjt:  FIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNA

Query:  FVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS
        FVTSI   PDN  LE+I SLLL+YE RLE QN+  QL+ +QAN++ LN   +N      Y P+FS
Subjt:  FVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.4e-5048.82Show/hide
Query:  PYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLK
        P P+L Q LS+KL + N L  K+QLL+ ++ANGL  F+D   S+ PK+LDA   QVNP+++ W+R N+ +M WIYSSL+   +G+IV   TA DIW SL 
Subjt:  PYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLK

Query:  RAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQN
          Y+S + A +M L +QLQR++K ++ +S+YL+++K V ++F+ IGEP+SYRD L  IL+GL  EY+ FVTSI N  D PSL+++ SLL  YE RL +++
Subjt:  RAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQN

Query:  SVDQLNVVQAN
            LN  QAN
Subjt:  SVDQLNVVQAN

A0A6J1DQX7 uncharacterized protein LOC1110223153.6e-9471.2Show/hide
Query:  PTPQF--QPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKM
        PTP F  QP  N F  NP+PTLPQPL+VKL DNNFL WKNQLL+AV+ANGL G+LDGTI   P+FLD +Q Q NP Y  WERYNR +MCWIYSSLS+EKM
Subjt:  PTPQF--QPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKM

Query:  GEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLE
        GE+V+L+T  DIW+SL R YDS TTARIMGLKT+LQ LRKD  SVSQYLAKIKE+ +KF+A+GEP+SYRDHLAH+LDGLGSEYNAFVTSI N  D+PSLE
Subjt:  GEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLE

Query:  DIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS
        D+RSLLLAYEARL+KQN+VDQLN+ QAN+ +L+ Q    H S R PP FS
Subjt:  DIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS

A0A7J0EGI5 Uncharacterized protein1.3e-5143.64Show/hide
Query:  PAPQQFPTPQFQPTQNFFFPNP------------YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYN
        PAP   P P   P  +   PNP             P++ QPL+VKL D+N++ WK QLL+ V+ANGL  FLDG+    P+FLD  Q Q NP++  W+RYN
Subjt:  PAPQQFPTPQFQPTQNFFFPNP------------YPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYN

Query:  RFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYN
        R +M WIY+S+++  +G+IV   +A  IW +L+R Y + + A +  L+T LQ ++K+ L+   Y+ K + + N  ++IGEP++Y DHL + L GLG +YN
Subjt:  RFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYN

Query:  AFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVD
         FVTSIQ+    PS+E++ SLLL+Y+ARLE+Q++ D
Subjt:  AFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVD

A0A803NL56 Uncharacterized protein2.4e-5038.1Show/hide
Query:  SFNPQPSFPNFQYPQNPPFQTQGVYPRPLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTI
        S  PQ    N Q P   P  T    P      P  +  +    P            PN + +  Q +SVKL D N+L W+ Q+ + ++ANGL G++DGT+
Subjt:  SFNPQPSFPNFQYPQNPPFQTQGVYPRPLFQAPNQSFPAPQQFPTPQFQPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTI

Query:  SALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKF
        +   +F ++  +QV+P +  W RYN+ +M W+Y+SLS   +G+IV   TA +IW SL+R Y + + AR    +  LQ L+KD L+ S YL K+K + N  
Subjt:  SALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKF

Query:  SAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS--PIPRPQF
        +++G+PIS ++HL ++L+GLG EYNAFVT I   P  P++E++ +LLL+YEARLE+QN+    + +QAN ++L+       F  + P S S  P  +P+F
Subjt:  SAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANISSLNSQHVNHHFSNRYPPSFS--PIPRPQF

Query:  PS-------PTIPSY
        PS       PTIPS+
Subjt:  PS-------PTIPSY

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-0623.92Show/hide
Query:  DNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGL
        DN F  W+ ++   ++  GL   LD   S  P  + A           W   +      I   LS + +  I++ DTA  IW  L+  Y S T    + L
Subjt:  DNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTARIMGL

Query:  KTQLQRLR-KDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANIS
        K QL  L   +  +   +L     +  + + +G  I   D    +L+ L S Y+   T+I +      L+D+ S LL  E   +K  +  Q  + +    
Subjt:  KTQLQRLR-KDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQANIS

Query:  SLNSQHVNH
        S      N+
Subjt:  SLNSQHVNH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.5e-2024.77Show/hide
Query:  KLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQT-QVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTAR
        KLT  N+L W  Q+        L+GFLDG+ +  P  +  +   +VNPDY  W+R ++ I   +  ++S      +    TA  IW +L++ Y + +   
Subjt:  KLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQT-QVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNSLKRAYDSNTTAR

Query:  IMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQA
        +  L+TQL++  K   ++  Y+  +    ++ + +G+P+ + + +  +L+ L  EY   +  I      P+L +I   LL +E+++   +S   + +   
Subjt:  IMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVVQA

Query:  NISSLNSQHVNH----HFSNRY
         +S  N+   N+    + +NRY
Subjt:  NISSLNSQHVNH----HFSNRY

Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.5e-1725.58Show/hide
Query:  LSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLD-TAFDIWNSLKRAYDSNT
        +++ L   N+  W+       ++ G+ G +DG+ +  P       T+       W+  +  +  WIY +++   +  I+ +  TA D+W SL+  +  N 
Subjt:  LSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLD-TAFDIWNSLKRAYDSNT

Query:  TARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNV
         AR +  + +L+    D+LSV +Y  K+K +++  + +  PIS R  + H+L+GL  +Y+  +  I++    PS  + RS+LL  E+RL           
Subjt:  TARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNV

Query:  VQANISSLNSQHVNH
          +N S  +  H NH
Subjt:  VQANISSLNSQHVNH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATTGAAGAAAGTTCGTCCTCCTCCACTGCTCCGGCGATTACTCCGGTGATAACCCCAAACACACCTACCACCACCCCCGTTGTTACTCCAATTCAAAACCAGAA
TCTTCGCCCTTCGTTTAATCAACCCACCTCTGGCACCAACTTGAGAGGTAAAACACCAATCCAGAATAATCCATCATTTCCTTCTTTTAACCCTCAACCTAGTTTCCCAA
ACTTCCAGTACCCTCAGAACCCCCCCTTTCAAACCCAAGGCGTTTATCCTCGACCTCTGTTTCAAGCTCCCAATCAATCATTTCCAGCTCCTCAACAATTTCCAACTCCC
CAATTTCAACCTACCCAAAATTTCTTTTTCCCAAATCCCTATCCAACTCTACCCCAACCTCTTTCGGTCAAGCTCACCGACAATAATTTCCTATTCTGGAAGAATCAGTT
GCTCCATGCGGTAATGGCTAATGGACTCTCTGGCTTTCTCGATGGAACGATCTCAGCTCTACCAAAATTTCTTGATGCAAATCAAACTCAGGTTAATCCGGATTATATCG
GCTGGGAAAGGTACAATAGGTTTATTATGTGCTGGATTTACTCATCCTTGTCTCAGGAGAAAATGGGTGAGATTGTTAATCTAGATACTGCTTTTGATATTTGGAATTCT
CTGAAGCGTGCATATGACTCTAATACTACTGCTCGGATTATGGGATTGAAAACTCAGTTGCAAAGACTTAGGAAGGATAATTTGTCTGTCAGTCAGTATTTAGCTAAAAT
TAAAGAGGTCACGAATAAATTCTCAGCCATTGGTGAACCAATCTCCTACCGCGATCATTTAGCTCACATATTAGATGGTCTAGGTAGTGAGTATAATGCCTTCGTCACTT
CTATACAGAACTGTCCGGATAATCCATCTCTTGAAGATATTCGAAGCTTGTTGTTAGCTTATGAAGCTAGGCTAGAAAAACAGAATTCTGTGGATCAATTAAATGTTGTT
CAGGCTAATATTAGTTCTCTCAACTCTCAGCATGTTAATCACCATTTTTCTAATAGGTATCCACCATCCTTTTCCCCTATTCCTCGACCACAATTTCCCTCCCCTACAAT
ACCTTCGTACCAAGGTTGCACTGGTATTCCCTGTAAGGAAAATGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATATTGAAGAAAGTTCGTCCTCCTCCACTGCTCCGGCGATTACTCCGGTGATAACCCCAAACACACCTACCACCACCCCCGTTGTTACTCCAATTCAAAACCAGAA
TCTTCGCCCTTCGTTTAATCAACCCACCTCTGGCACCAACTTGAGAGGTAAAACACCAATCCAGAATAATCCATCATTTCCTTCTTTTAACCCTCAACCTAGTTTCCCAA
ACTTCCAGTACCCTCAGAACCCCCCCTTTCAAACCCAAGGCGTTTATCCTCGACCTCTGTTTCAAGCTCCCAATCAATCATTTCCAGCTCCTCAACAATTTCCAACTCCC
CAATTTCAACCTACCCAAAATTTCTTTTTCCCAAATCCCTATCCAACTCTACCCCAACCTCTTTCGGTCAAGCTCACCGACAATAATTTCCTATTCTGGAAGAATCAGTT
GCTCCATGCGGTAATGGCTAATGGACTCTCTGGCTTTCTCGATGGAACGATCTCAGCTCTACCAAAATTTCTTGATGCAAATCAAACTCAGGTTAATCCGGATTATATCG
GCTGGGAAAGGTACAATAGGTTTATTATGTGCTGGATTTACTCATCCTTGTCTCAGGAGAAAATGGGTGAGATTGTTAATCTAGATACTGCTTTTGATATTTGGAATTCT
CTGAAGCGTGCATATGACTCTAATACTACTGCTCGGATTATGGGATTGAAAACTCAGTTGCAAAGACTTAGGAAGGATAATTTGTCTGTCAGTCAGTATTTAGCTAAAAT
TAAAGAGGTCACGAATAAATTCTCAGCCATTGGTGAACCAATCTCCTACCGCGATCATTTAGCTCACATATTAGATGGTCTAGGTAGTGAGTATAATGCCTTCGTCACTT
CTATACAGAACTGTCCGGATAATCCATCTCTTGAAGATATTCGAAGCTTGTTGTTAGCTTATGAAGCTAGGCTAGAAAAACAGAATTCTGTGGATCAATTAAATGTTGTT
CAGGCTAATATTAGTTCTCTCAACTCTCAGCATGTTAATCACCATTTTTCTAATAGGTATCCACCATCCTTTTCCCCTATTCCTCGACCACAATTTCCCTCCCCTACAAT
ACCTTCGTACCAAGGTTGCACTGGTATTCCCTGTAAGGAAAATGCGTAA
Protein sequenceShow/hide protein sequence
MNIEESSSSSTAPAITPVITPNTPTTTPVVTPIQNQNLRPSFNQPTSGTNLRGKTPIQNNPSFPSFNPQPSFPNFQYPQNPPFQTQGVYPRPLFQAPNQSFPAPQQFPTP
QFQPTQNFFFPNPYPTLPQPLSVKLTDNNFLFWKNQLLHAVMANGLSGFLDGTISALPKFLDANQTQVNPDYIGWERYNRFIMCWIYSSLSQEKMGEIVNLDTAFDIWNS
LKRAYDSNTTARIMGLKTQLQRLRKDNLSVSQYLAKIKEVTNKFSAIGEPISYRDHLAHILDGLGSEYNAFVTSIQNCPDNPSLEDIRSLLLAYEARLEKQNSVDQLNVV
QANISSLNSQHVNHHFSNRYPPSFSPIPRPQFPSPTIPSYQGCTGIPCKENA