; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr13:23280769..23282073
RNA-Seq ExpressionLag0041657
SyntenyLag0041657
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]2.1e-5351.01Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++ LAVKL+D N++ WK QLLN V+ANGL  FLDGS   P RFLD QQQQ NPEF SW+RYNR +M WIY+S++E  +G+IV   +A+ IW +L++ Y +
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVE
         +   +  L++ LQ I+K+GL+   Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVTSIQ+++  P++E+V SLLL+Y+ARLE+QS+ +
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVE

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]9.2e-5440.67Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++ LAVKL+D N++ WK QLLN V+ANGL  FLDGS   P RFLD QQQQ NPEF SW+RYNR +M WIY+S++E  +G+IV   +A+ IW +L++ Y +
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL
         +   +  L++ LQ I+K+GL+   Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVTSIQ+++  P++E+  S                  
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL

Query:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP
                              P++ +      NPST S P S  S   P    + Q+ +P+   +P +   RP+CQIC K GHTA  C+H TNL YQ P
Subjt:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]1.2e-5345.63Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++   +KL+  N+L WKNQLLN ++ANGL  F+DGS P P RF D  +Q  N E+++W+R+NR IM WIY+SL++  MG+IV   +A +IW +L + Y S
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL
         +  +I  L+++LQ +RKDGL+  +Y+ K K + +  AA+GEP+S +DHL ++  GL  EYN FVTSI  R DN  LE++ SLLL+YE RLE Q++  QL
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL

Query:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPS------TFSIPTSQPSVFQPSILGKPQ
        S  QANL    +  N  +   RP NFS P   F  +       F       + FQPSILGKPQ
Subjt:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPS------TFSIPTSQPSVFQPSILGKPQ

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.1e-5739.71Show/hide
Query:  LSLFPHKILHTSDYTHCHSSKPTRQVSSSLTTILPTLWPSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQ
        +S FP      S+ T  ++  P  Q+ + +T   P+L             S+SL++KL++TN L  K+QLLN ++ANGL  F+D    +P ++LD   +Q
Subjt:  LSLFPHKILHTSDYTHCHSSKPTRQVSSSLTTILPTLWPSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQ

Query:  PNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHL
         NPEF+ W+R N+ +M WIYSSL+   +G+IV   TA DIW SL   Y+S +   +M L SQLQ+I+K  + +S+YLS++K V D+FA IGEP+SYRD L
Subjt:  PNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHL

Query:  AHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQP
          IL+GL  EY+ FVTSI NRSD P+L++V SLL  YE RL ++S  + L+F QAN                      PR                    
Subjt:  AHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQP

Query:  SILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP
                        P  N + PQCQICGK GH AL  +HRTNL Y  P
Subjt:  SILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]7.0e-10265.42Show/hide
Query:  LAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTT
        L VKLND NFL WKNQLLNAV+ANGL G+LDG++  P +FLD  Q QPNP + +WERYNR +MCWIYSSLSEEKMGE+VSL+T  DIW+SL + YDSKTT
Subjt:  LAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTT

Query:  TRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFA
         RIMGLK++LQ +RKDG SVSQYL+KIKE+ DKFAA+GEP+SYRDHLAH+LDGLGSEYN FVTSI NR+D+P+LEDVRSLLLAYEARL+KQ++V+QL+ A
Subjt:  TRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFA

Query:  QANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQ-PSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAY
        QANL  +  Q NS+R  P+   FS P    N    S P S  S  Q  SILGKPQS     +W P+ ++++ QCQICGK GH+A +C+HRTN+AY
Subjt:  QANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQ-PSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAY

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein5.8e-5445.63Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++   +KL+  N+L WKNQLLN ++ANGL  F+DGS P P RF D  +Q  N E+++W+R+NR IM WIY+SL++  MG+IV   +A +IW +L + Y S
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL
         +  +I  L+++LQ +RKDGL+  +Y+ K K + +  AA+GEP+S +DHL ++  GL  EYN FVTSI  R DN  LE++ SLLL+YE RLE Q++  QL
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL

Query:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPS------TFSIPTSQPSVFQPSILGKPQ
        S  QANL    +  N  +   RP NFS P   F  +       F       + FQPSILGKPQ
Subjt:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPS------TFSIPTSQPSVFQPSILGKPQ

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.5e-5739.71Show/hide
Query:  LSLFPHKILHTSDYTHCHSSKPTRQVSSSLTTILPTLWPSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQ
        +S FP      S+ T  ++  P  Q+ + +T   P+L             S+SL++KL++TN L  K+QLLN ++ANGL  F+D    +P ++LD   +Q
Subjt:  LSLFPHKILHTSDYTHCHSSKPTRQVSSSLTTILPTLWPSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQ

Query:  PNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHL
         NPEF+ W+R N+ +M WIYSSL+   +G+IV   TA DIW SL   Y+S +   +M L SQLQ+I+K  + +S+YLS++K V D+FA IGEP+SYRD L
Subjt:  PNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHL

Query:  AHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQP
          IL+GL  EY+ FVTSI NRSD P+L++V SLL  YE RL ++S  + L+F QAN                      PR                    
Subjt:  AHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQP

Query:  SILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP
                        P  N + PQCQICGK GH AL  +HRTNL Y  P
Subjt:  SILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP

A0A6J1DQX7 uncharacterized protein LOC1110223153.4e-10265.42Show/hide
Query:  LAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTT
        L VKLND NFL WKNQLLNAV+ANGL G+LDG++  P +FLD  Q QPNP + +WERYNR +MCWIYSSLSEEKMGE+VSL+T  DIW+SL + YDSKTT
Subjt:  LAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTT

Query:  TRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFA
         RIMGLK++LQ +RKDG SVSQYL+KIKE+ DKFAA+GEP+SYRDHLAH+LDGLGSEYN FVTSI NR+D+P+LEDVRSLLLAYEARL+KQ++V+QL+ A
Subjt:  TRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFA

Query:  QANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQ-PSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAY
        QANL  +  Q NS+R  P+   FS P    N    S P S  S  Q  SILGKPQS     +W P+ ++++ QCQICGK GH+A +C+HRTN+AY
Subjt:  QANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQ-PSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAY

A0A7J0EGI5 Uncharacterized protein1.0e-5351.01Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++ LAVKL+D N++ WK QLLN V+ANGL  FLDGS   P RFLD QQQQ NPEF SW+RYNR +M WIY+S++E  +G+IV   +A+ IW +L++ Y +
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVE
         +   +  L++ LQ I+K+GL+   Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVTSIQ+++  P++E+V SLLL+Y+ARLE+QS+ +
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVE

A0A7J0GPN0 UBX domain-containing protein4.5e-5440.67Show/hide
Query:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS
        ++ LAVKL+D N++ WK QLLN V+ANGL  FLDGS   P RFLD QQQQ NPEF SW+RYNR +M WIY+S++E  +G+IV   +A+ IW +L++ Y +
Subjt:  SKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDS

Query:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL
         +   +  L++ LQ I+K+GL+   Y+ K + + +  A+IGEP++Y DHL + L GLG +YNPFVTSIQ+++  P++E+  S                  
Subjt:  KTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQL

Query:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP
                              P++ +      NPST S P S  S   P    + Q+ +P+   +P +   RP+CQIC K GHTA  C+H TNL YQ P
Subjt:  SFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRPQCQICGKFGHTALICHHRTNLAYQTP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-2125.77Show/hide
Query:  SLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFL-DDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSK
        S   KL  TN+L W  Q+        L GFLDGS   P   +  D   + NP++  W+R ++ I   +  ++S      +    TAA IW +L+K Y + 
Subjt:  SLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFL-DDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSK

Query:  TTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLS
        +   +  L++QL++  K   ++  Y+  +    D+ A +G+P+ + + +  +L+ L  EY P +  I  +   P L ++   LL +E+++   SS   + 
Subjt:  TTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLS

Query:  FAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRP---QCQICGKFGHTALIC
             +S   +   +  ++   +N    R+  N S                  KP   S T+ + P NN ++P   +CQICG  GH+A  C
Subjt:  FAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNNTNRP---QCQICGKFGHTALIC

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)6.5e-1319.73Show/hide
Query:  PSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEK-MGEIVSLDTA
        P       +I+    + + + ++N+  W+   L   L+  + G +DG++              N   ++W++ +  +   +Y +L+ ++  G  V+  T+
Subjt:  PSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQPNPEFLSWERYNRFIMCWIYSSLSEEK-MGEIVSLDTA

Query:  ADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAY
         DIW  +K  + +    R + L S+L+      + V+ Y  K+K++ D    +  P++ R+ + ++L+GL  +++  +  I++R   P+ +D  ++L   
Subjt:  ADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSEYNPFVTSIQNRSDNPALEDVRSLLLAY

Query:  EARLEKQSSVEQLSFAQANLSTI
        E RL++           ++ ST+
Subjt:  EARLEKQSSVEQLSFAQANLSTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCTTCATCCTCTACCTCTTCTGAAGCTCTCCCTGTTTCCACACAAAATCCTCCACACCAGTGACTACACCCATTGTCACTCCAGTAAACCAACGAGGCAAGTCTC
TAGCTCCCTCACCACCATTTTACCCACCCTCTGGCCTTCAACCTGCTGTTCACCCAAATCCATCCGCAAATCAAAATCCCTTGCCGTCAAGTTGAATGACACAAACTTCC
TCCCTTGGAAGAACCAGCTTCTCAACGCCGTCTTAGCGAATGGTCTCCATGGTTTCCTTGATGGCTCAGTTCCTGCCCCTTCTCGATTTCTTGATGATCAACAGCAGCAG
CCCAATCCGGAATTCCTCAGCTGGGAGAGGTACAATAGGTTTATTATGTGTTGGATTTACTCGTCTTTGTCTGAGGAGAAAATGGGGGAAATTGTTAGTTTAGACACTGC
TGCAGATATATGGAACTCATTGAAAAAATCTTATGATTCTAAGACTACGACTCGTATTATGGGTCTTAAATCTCAGTTACAAAAAATTAGGAAGGATGGATTATCCGTTA
GTCAGTATCTCTCTAAGATCAAGGAAGTACCTGATAAATTCGCTGCCATAGGAGAACCAATCTCTTATCGAGATCACCTAGCTCATATCCTCGATGGTTTGGGGAGTGAG
TATAATCCTTTTGTCACTTCAATTCAAAATAGATCTGATAATCCGGCATTAGAAGATGTACGCAGCTTATTACTTGCCTATGAGGCCAGACTTGAGAAACAAAGTTCCGT
AGAACAACTGAGTTTTGCACAAGCTAATTTAAGCACCATTCAGTCACAATTTAACAGTCGTCGGTCCTCCCCTCGGCCTTCTAATTTTTCTGCACCTAGATCTCCCTTCA
ATCCATCAACATTTTCTATACCTACCTCTCAACCCTCTGTTTTTCAACCTAGCATTCTTGGTAAACCACAGTCTTCGTCTCCAACCTCTAGATGGTCTCCTCGAAATAAC
ACCAATCGACCACAATGCCAAATTTGTGGCAAATTCGGGCATACTGCCCTCATTTGTCATCACCGCACAAATTTGGCTTACCAAACCCCCCACCTCAAGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATCTTCATCCTCTACCTCTTCTGAAGCTCTCCCTGTTTCCACACAAAATCCTCCACACCAGTGACTACACCCATTGTCACTCCAGTAAACCAACGAGGCAAGTCTC
TAGCTCCCTCACCACCATTTTACCCACCCTCTGGCCTTCAACCTGCTGTTCACCCAAATCCATCCGCAAATCAAAATCCCTTGCCGTCAAGTTGAATGACACAAACTTCC
TCCCTTGGAAGAACCAGCTTCTCAACGCCGTCTTAGCGAATGGTCTCCATGGTTTCCTTGATGGCTCAGTTCCTGCCCCTTCTCGATTTCTTGATGATCAACAGCAGCAG
CCCAATCCGGAATTCCTCAGCTGGGAGAGGTACAATAGGTTTATTATGTGTTGGATTTACTCGTCTTTGTCTGAGGAGAAAATGGGGGAAATTGTTAGTTTAGACACTGC
TGCAGATATATGGAACTCATTGAAAAAATCTTATGATTCTAAGACTACGACTCGTATTATGGGTCTTAAATCTCAGTTACAAAAAATTAGGAAGGATGGATTATCCGTTA
GTCAGTATCTCTCTAAGATCAAGGAAGTACCTGATAAATTCGCTGCCATAGGAGAACCAATCTCTTATCGAGATCACCTAGCTCATATCCTCGATGGTTTGGGGAGTGAG
TATAATCCTTTTGTCACTTCAATTCAAAATAGATCTGATAATCCGGCATTAGAAGATGTACGCAGCTTATTACTTGCCTATGAGGCCAGACTTGAGAAACAAAGTTCCGT
AGAACAACTGAGTTTTGCACAAGCTAATTTAAGCACCATTCAGTCACAATTTAACAGTCGTCGGTCCTCCCCTCGGCCTTCTAATTTTTCTGCACCTAGATCTCCCTTCA
ATCCATCAACATTTTCTATACCTACCTCTCAACCCTCTGTTTTTCAACCTAGCATTCTTGGTAAACCACAGTCTTCGTCTCCAACCTCTAGATGGTCTCCTCGAAATAAC
ACCAATCGACCACAATGCCAAATTTGTGGCAAATTCGGGCATACTGCCCTCATTTGTCATCACCGCACAAATTTGGCTTACCAAACCCCCCACCTCAAGCTTTAG
Protein sequenceShow/hide protein sequence
MHLHPLPLLKLSLFPHKILHTSDYTHCHSSKPTRQVSSSLTTILPTLWPSTCCSPKSIRKSKSLAVKLNDTNFLPWKNQLLNAVLANGLHGFLDGSVPAPSRFLDDQQQQ
PNPEFLSWERYNRFIMCWIYSSLSEEKMGEIVSLDTAADIWNSLKKSYDSKTTTRIMGLKSQLQKIRKDGLSVSQYLSKIKEVPDKFAAIGEPISYRDHLAHILDGLGSE
YNPFVTSIQNRSDNPALEDVRSLLLAYEARLEKQSSVEQLSFAQANLSTIQSQFNSRRSSPRPSNFSAPRSPFNPSTFSIPTSQPSVFQPSILGKPQSSSPTSRWSPRNN
TNRPQCQICGKFGHTALICHHRTNLAYQTPHLKL