; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr12:15217316..15218785
RNA-Seq ExpressionLag0015522
SyntenyLag0015522
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.0e-4043.59Show/hide
Query:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV
        ++LLLT  S+ ESK  + ++  LP+ N+  Q  T+K  ++    +Q+      S+    GRG   S++GR G    +RN+PQCQIC K+G++  + + R 
Subjt:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV

Query:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS
             Y  + N S       N+     N    M AM      N D NWY DSG T HLT+S  N+S+ S+Y G NQI+  NG+ L I  +G   F S T 
Subjt:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS

Query:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH
        P    F LNN+L VPSITKNLIS VSQFAKDN +FFEFH T C VKD  T +VLL G L++GLYKF I +P H
Subjt:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH

KAF7831079.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]3.4e-3440.95Show/hide
Query:  SFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV--LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLD
        S GN +GRG    SQ R GR+W   N+PQCQ+C ++GH  +  Y+R       A LTQF   +K            N  +P++A+  TP    D +W+ D
Subjt:  SFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV--LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLD

Query:  SGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVL
        SG + HLTNS  N+ V   Y G  Q+H+ NG+ L I+  GF+ L + +     LN ILHVP ++KNL+S +S+FAKDN ++FEFH   C VK Q   +VL
Subjt:  SGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVL

Query:  LHGTLHEGLYKF-NIS---KPQHSIPQFKIPS
        L G   +GLY F N++   KP  S  +F + S
Subjt:  LHGTLHEGLYKF-NIS---KPQHSIPQFKIPS

KAF7833293.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Senna tora]4.4e-3437.97Show/hide
Query:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNG--RGRGRFNSSQGREGRSWNSRN-------------RPQ------
        +LLL    R+++  +   +    T ++ + +   KD K+  N  QS QQ +F N   RGRG  N++ G  GR   + N             +PQ      
Subjt:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNG--RGRGRFNSSQGREGRSWNSRN-------------RPQ------

Query:  --CQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVG
          CQ+CNK GH     Y R          F+ +   F  QN  Q    R   MQA    P    D  WY DSG T H+T +  N+  S+ Y G  Q+HVG
Subjt:  --CQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVG

Query:  NGTCLDINLFGFASLTSPSNH--VFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLH-EGLYKFNI------SKPQHS
        NG  L+I   G +SL SP N     HLNN+LHVP+ITKNL+S VS+FA DN  FFEFH T C VK QAT++VLL GT+  +GLY F+       S  Q S
Subjt:  NGTCLDINLFGFASLTSPSNH--VFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLH-EGLYKFNI------SKPQHS

Query:  IPQFKIPSENYSLFIL
         P   + S + S+  L
Subjt:  IPQFKIPSENYSLFIL

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]4.6e-3941.89Show/hide
Query:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRS-WNSRNRPQCQICNKIGHTTLKGYSRVLM
        ALLL +  RIE+          P+ N+T   P+Q+  +N      + Q Q    GRGRGR     GR GR  W++  RP CQIC   GH     Y R   
Subjt:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRS-WNSRNRPQCQICNKIGHTTLKGYSRVLM

Query:  LGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQ-----AMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTS
           +  +F P S       + Q   NR SP       A T + S +++  WY DSG ++H+TN  GN+SVSS+Y G +++ VGNG  L I+  G ++L  
Subjt:  LGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQ-----AMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTS

Query:  -PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFN----ISKPQH-------SIPQFKIPSEN
         PS+  F L N+LHVP ITKNLIS VS+FA DN ++FEFH +FCLVKD AT  VLL GTLH GLY+FN    IS P H       S+   K+P ++
Subjt:  -PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFN----ISKPQH-------SIPQFKIPSEN

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.0e-4043.59Show/hide
Query:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV
        ++LLLT  S+ ESK  + ++  LP+ N+  Q  T+K  ++    +Q+      S+    GRG   S++GR G    +RN+PQCQIC K+G++  + + R 
Subjt:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV

Query:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS
             Y  + N S       N+     N    M AM      N D NWY DSG T HLT+S  N+S+ S+Y G NQI+  NG+ L I  +G   F S T 
Subjt:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS

Query:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH
        P    F LNN+L VPSITKNLIS VSQFAKDN +FFEFH T C VKD  T +VLL G L++GLYKF I +P H
Subjt:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH

TrEMBL top hitse value%identityAlignment
A0A151RQY5 Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment)3.6e-3441.3Show/hide
Query:  NLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPS
        N R    S +   + NGRG GR N  Q     SWN  NRPQCQIC K GH  +  + R          FN   +  P  N  Q  G   S  QA   TP+
Subjt:  NLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPS

Query:  FNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASL-TSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFC
           D  WY DSG ++H+TN   N+SV + Y GN+++ +GNG+ L I   G +    S SN+ F +N +LHVPSITKNL+S VSQF +DN +FFEFH   C
Subjt:  FNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASL-TSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFC

Query:  LVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIPSENYSLFIL
         VK + T+  LL G   +GLY F   + Q+ I  F   ++++SL  L
Subjt:  LVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIPSENYSLFIL

A0A2Z7AWA7 Integrase catalytic domain-containing protein2.2e-3941.89Show/hide
Query:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRS-WNSRNRPQCQICNKIGHTTLKGYSRVLM
        ALLL +  RIE+          P+ N+T   P+Q+  +N      + Q Q    GRGRGR     GR GR  W++  RP CQIC   GH     Y R   
Subjt:  ALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRS-WNSRNRPQCQICNKIGHTTLKGYSRVLM

Query:  LGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQ-----AMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTS
           +  +F P S       + Q   NR SP       A T + S +++  WY DSG ++H+TN  GN+SVSS+Y G +++ VGNG  L I+  G ++L  
Subjt:  LGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQ-----AMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTS

Query:  -PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFN----ISKPQH-------SIPQFKIPSEN
         PS+  F L N+LHVP ITKNLIS VS+FA DN ++FEFH +FCLVKD AT  VLL GTLH GLY+FN    IS P H       S+   K+P ++
Subjt:  -PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFN----ISKPQH-------SIPQFKIPSEN

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-4043.59Show/hide
Query:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV
        ++LLLT  S+ ESK  + ++  LP+ N+  Q  T+K  ++    +Q+      S+    GRG   S++GR G    +RN+PQCQIC K+G++  + + R 
Subjt:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV

Query:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS
             Y  + N S       N+     N    M AM      N D NWY DSG T HLT+S  N+S+ S+Y G NQI+  NG+ L I  +G   F S T 
Subjt:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS

Query:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH
        P    F LNN+L VPSITKNLIS VSQFAKDN +FFEFH T C VKD  T +VLL G L++GLYKF I +P H
Subjt:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-4043.59Show/hide
Query:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV
        ++LLLT  S+ ESK  + ++  LP+ N+  Q  T+K  ++    +Q+      S+    GRG   S++GR G    +RN+PQCQIC K+G++  + + R 
Subjt:  VALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSFGNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRV

Query:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS
             Y  + N S       N+     N    M AM      N D NWY DSG T HLT+S  N+S+ S+Y G NQI+  NG+ L I  +G   F S T 
Subjt:  LMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTS

Query:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH
        P    F LNN+L VPSITKNLIS VSQFAKDN +FFEFH T C VKD  T +VLL G L++GLYKF I +P H
Subjt:  PSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQH

A0A5D3D3G2 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-3335.22Show/hide
Query:  KFQVLTVLEGHELEEHISEDCQPPPEKIQGHKLFKMFVALLLTNGSRI--------ESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSF
        KFQ+LT LE +++E  +  + +PP + +              T+ +R         ++++ + ++  LP+ N+  Q  T+K  ++    SQ+      S+
Subjt:  KFQVLTVLEGHELEEHISEDCQPPPEKIQGHKLFKMFVALLLTNGSRI--------ESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQS--QQQQSF

Query:  GNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPT
            GRG   S+ G  G    +RN+PQCQIC K+GH+                                                    D NWY DSG T
Subjt:  GNGRGRGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPT

Query:  YHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLL
         HLT+S  N+S  S+Y G NQI+  NG+ L I  +G   F S T P    F L N+LHVPSITKNLIS VS FAKDN +FFEFH T C VKD    +VLL
Subjt:  YHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFG---FASLTSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLL

Query:  HGTLHEGLYKFNISKPQH
         G L++GLYKF I +P H
Subjt:  HGTLHEGLYKFNISKPQH

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-1429.29Show/hide
Query:  LLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRP---QCQICNKIGHTTLKGYSRVLM
        LL + S+I     +++  V+P     + +     T N  N +++ +  +  N      +  S      + N++++P   +CQIC   GH+      R   
Subjt:  LLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRP---QCQICNKIGHTTLKGYSRVLM

Query:  LGAYLTQFN---PSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPS
        L  +L+  N   P S F P Q    +         A+ +  S N   NW LDSG T+H+T+ F N+S+   Y G + + V +G+ + I+  G  SL++ S
Subjt:  LGAYLTQFN---PSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPS

Query:  NHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIPS
          + +L+NIL+VP+I KNLISV  +    N +  EF      VKD  T   LL G   + LY++ I+  Q  +  F  PS
Subjt:  NHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIPS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.7e-1730.37Show/hide
Query:  ESK-ATINADGVLP-TANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRP---QCQICNKIGHTTLKGYSRVLMLGAYLT
        ESK   +N+  V+P TAN+     T ++T   RN +     +++ N   R           RS N + +P   +CQIC+  GH+  K   ++    +   
Subjt:  ESK-ATINADGVLP-TANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRGRGRFNSSQGREGRSWNSRNRP---QCQICNKIGHTTLKGYSRVLMLGAYLT

Query:  QFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPSNHVFHLNNI
        Q   +S F P Q           P   +     +N + NW LDSG T+H+T+ F N+S    Y G + + + +G+ + I   G ASL + S+    LN +
Subjt:  QFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSKYLGNNQIHVGNGTCLDINLFGFASLTSPSNHVFHLNNI

Query:  LHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIP
        L+VP+I KNLISV  +    N +  EF      VKD  T   LL G   + LY++ I+  Q ++  F  P
Subjt:  LHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTTTCAGAGACGAACTCTGAAATTTCAAGTGGAAATCAAAAATTTCAAGTTCTCACGGTGCTCGAAGGACATGAATTGGAAGAGCATATCAGTGAAGATTGCCA
ACCTCCTCCAGAGAAAATCCAAGGTCACAAACTGTTCAAGATGTTTGTTGCATTGCTTTTAACAAATGGAAGTAGGATAGAAAGCAAAGCTACTATTAATGCTGATGGAG
TCTTACCTACTGCTAATCTCACAATTCAAAATCCTACACAAAAAGATACAAAAAATTTGAGAAATGTGAGTCAGTCGCAACAACAACAAAGTTTCGGTAATGGTAGAGGC
AGAGGAAGATTTAATTCTAGTCAAGGTAGAGAAGGAAGGTCTTGGAATAGTAGAAACAGACCTCAGTGTCAAATCTGTAACAAAATTGGTCATACAACCTTAAAAGGCTA
TTCTCGTGTTTTGATGCTTGGGGCTTACTTGACTCAATTCAACCCCTCTAGTAAGTTTTTTCCTGGACAGAATTCTGGTCAGATGTATGGGAACCGGTTCTCTCCAATGC
AGGCCATGACAACTACTCCAAGCTTTAATCAAGATTGTAATTGGTATCTGGATTCCGGGCCTACCTATCACTTAACAAATAGCTTTGGGAACATGTCTGTCAGCTCAAAA
TATCTTGGAAACAATCAAATTCATGTGGGCAATGGTACATGTTTGGATATAAATTTGTTTGGATTTGCTTCTCTTACTTCGCCTAGTAATCATGTCTTTCATCTAAATAA
TATTTTACATGTTCCATCTATTACCAAGAATTTGATTAGTGTCGTCAGTCAGTTTGCCAAAGACAATTTCATTTTCTTTGAATTTCACCATACTTTTTGTCTTGTGAAGG
ACCAAGCAACTGAGCGAGTACTTCTCCACGGGACTCTACATGAAGGACTATACAAGTTCAACATATCCAAGCCTCAACACTCTATCCCACAATTCAAAATCCCATCAGAA
AATTATTCTCTGTTTATATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTTTCAGAGACGAACTCTGAAATTTCAAGTGGAAATCAAAAATTTCAAGTTCTCACGGTGCTCGAAGGACATGAATTGGAAGAGCATATCAGTGAAGATTGCCA
ACCTCCTCCAGAGAAAATCCAAGGTCACAAACTGTTCAAGATGTTTGTTGCATTGCTTTTAACAAATGGAAGTAGGATAGAAAGCAAAGCTACTATTAATGCTGATGGAG
TCTTACCTACTGCTAATCTCACAATTCAAAATCCTACACAAAAAGATACAAAAAATTTGAGAAATGTGAGTCAGTCGCAACAACAACAAAGTTTCGGTAATGGTAGAGGC
AGAGGAAGATTTAATTCTAGTCAAGGTAGAGAAGGAAGGTCTTGGAATAGTAGAAACAGACCTCAGTGTCAAATCTGTAACAAAATTGGTCATACAACCTTAAAAGGCTA
TTCTCGTGTTTTGATGCTTGGGGCTTACTTGACTCAATTCAACCCCTCTAGTAAGTTTTTTCCTGGACAGAATTCTGGTCAGATGTATGGGAACCGGTTCTCTCCAATGC
AGGCCATGACAACTACTCCAAGCTTTAATCAAGATTGTAATTGGTATCTGGATTCCGGGCCTACCTATCACTTAACAAATAGCTTTGGGAACATGTCTGTCAGCTCAAAA
TATCTTGGAAACAATCAAATTCATGTGGGCAATGGTACATGTTTGGATATAAATTTGTTTGGATTTGCTTCTCTTACTTCGCCTAGTAATCATGTCTTTCATCTAAATAA
TATTTTACATGTTCCATCTATTACCAAGAATTTGATTAGTGTCGTCAGTCAGTTTGCCAAAGACAATTTCATTTTCTTTGAATTTCACCATACTTTTTGTCTTGTGAAGG
ACCAAGCAACTGAGCGAGTACTTCTCCACGGGACTCTACATGAAGGACTATACAAGTTCAACATATCCAAGCCTCAACACTCTATCCCACAATTCAAAATCCCATCAGAA
AATTATTCTCTGTTTATATTGTGA
Protein sequenceShow/hide protein sequence
MDFSETNSEISSGNQKFQVLTVLEGHELEEHISEDCQPPPEKIQGHKLFKMFVALLLTNGSRIESKATINADGVLPTANLTIQNPTQKDTKNLRNVSQSQQQQSFGNGRG
RGRFNSSQGREGRSWNSRNRPQCQICNKIGHTTLKGYSRVLMLGAYLTQFNPSSKFFPGQNSGQMYGNRFSPMQAMTTTPSFNQDCNWYLDSGPTYHLTNSFGNMSVSSK
YLGNNQIHVGNGTCLDINLFGFASLTSPSNHVFHLNNILHVPSITKNLISVVSQFAKDNFIFFEFHHTFCLVKDQATERVLLHGTLHEGLYKFNISKPQHSIPQFKIPSE
NYSLFIL