; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005395 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005395
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr6:16355746..16356722
RNA-Seq ExpressionLag0005395
SyntenyLag0005395
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG55646.1 hypothetical protein EZV62_020902 [Acer yangbiense]4.1e-5535.76Show/hide
Query:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS---SGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACD
        ++  ++PF N L+    +KLD +N++LW++MV  +++G +LDG++  T   P E + S    G+ + GS +NP YE W+  DQ L+ WLY SM   +A  
Subjt:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS---SGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACD

Query:  ILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVY
        ++   T+  +WKALENL+G   K++   ++ ++QTTRK    M +YL+ MK  AD L +A +P  E+ L  + L GLD EY+P+   I A+E  TWQE+Y
Subjt:  ILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVY

Query:  STLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEK
         TL+                                         NQ+  +Q  N   ++   +   GR RGRGGRN     NS+PTC+VCGK GH+   
Subjt:  STLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEK

Query:  CYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA
        CY R +  YMGS P      +S + ++ATP+ ++D +W  DSGA
Subjt:  CYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA

TXG67243.1 hypothetical protein EZV62_008518 [Acer yangbiense]2.0e-5434.94Show/hide
Query:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS-----------SGIENVGSIANPAYETWMSTDQSLLAWLYGS
        ++  ++PF N L+    +KLD +N++LW++MV  +++G +LDG++  T   P E + S            G+ + GS +NP YE W+  DQ L+ WLY S
Subjt:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS-----------SGIENVGSIANPAYETWMSTDQSLLAWLYGS

Query:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE
        M   +A  ++   T+  +WKALENL+G   K++   ++ ++QTTRK    M +YL+ MK  AD L +A +P  E+ L  ++L GLD EY+P+   I A+E
Subjt:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE

Query:  SITWQEVYSTLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCG
          TWQE+Y TL+                                         NQ+  +Q  N   ++   +   GR RGRGGRN     NS+PTC+VCG
Subjt:  SITWQEVYSTLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCG

Query:  KMGHTVEKCYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA
        K GH+   CY R +  YMGS P      +S + ++ATP+ ++D +W  DSGA
Subjt:  KMGHTVEKCYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA

XP_022142770.1 uncharacterized protein LOC111012809 [Momordica charantia]7.5e-5753.81Show/hide
Query:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE
        M P IACD+L+  TSR+VWKALE+LY   +KARI QL+ +LQTTRKNQ KMSDYLS+MKQ+AD L LA EP+S SSL++S L GL+ EY+ + CQINAKE
Subjt:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE

Query:  SITWQEVYSTLINQE-------------------------TRSQN---HGYDQSRSQRNNGRGRGRGGRNFYPRGN-SKPTCEVCGKMGHTVEKCYHRLN
        +I+WQEV++TLI  E                         + SQN   H   Q R Q  N RGR RGGR    R N S+PTC+VCGK+GH    CYHRLN
Subjt:  SITWQEVYSTLINQE-------------------------TRSQN---HGYDQSRSQRNNGRGRGRGGRNFYPRGN-SKPTCEVCGKMGHTVEKCYHRLN

Query:  MRYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
        M+YMG+ P+ G  +  AYI  P++I DP+WL+DSGA
Subjt:  MRYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

XP_022157748.1 uncharacterized protein LOC111024384 isoform X1 [Momordica charantia]2.9e-6943.88Show/hide
Query:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT
        T F +PL  VLTVKLD++NY LWR MVL VLRGQK DGYVLGT+A+P + + S   E        NP Y  W + DQ+LL WL+GSM P+IACD+++ ++
Subjt:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT

Query:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--
        SREVWKALE+LYG   KARI QL+  LQ T+KN  KMS+YL  MKQ ++ L LA EPV+ + L++  L GL+ EY+P+ CQI  K+S +WQE+++TL+  
Subjt:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--

Query:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM
                                                +Q  + Q  G   S   +NN RGRGRG  + Y   NSKP+C++CGK GH    CY R + 
Subjt:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM

Query:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
         +  +      +  +AY+A P+I+ +PSWL DSGA
Subjt:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

XP_022157750.1 uncharacterized protein LOC111024384 isoform X2 [Momordica charantia]2.9e-6943.88Show/hide
Query:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT
        T F +PL  VLTVKLD++NY LWR MVL VLRGQK DGYVLGT+A+P + + S   E        NP Y  W + DQ+LL WL+GSM P+IACD+++ ++
Subjt:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT

Query:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--
        SREVWKALE+LYG   KARI QL+  LQ T+KN  KMS+YL  MKQ ++ L LA EPV+ + L++  L GL+ EY+P+ CQI  K+S +WQE+++TL+  
Subjt:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--

Query:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM
                                                +Q  + Q  G   S   +NN RGRGRG  + Y   NSKP+C++CGK GH    CY R + 
Subjt:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM

Query:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
         +  +      +  +AY+A P+I+ +PSWL DSGA
Subjt:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

TrEMBL top hitse value%identityAlignment
A0A5C7HHE9 Uncharacterized protein2.0e-5535.76Show/hide
Query:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS---SGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACD
        ++  ++PF N L+    +KLD +N++LW++MV  +++G +LDG++  T   P E + S    G+ + GS +NP YE W+  DQ L+ WLY SM   +A  
Subjt:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS---SGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACD

Query:  ILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVY
        ++   T+  +WKALENL+G   K++   ++ ++QTTRK    M +YL+ MK  AD L +A +P  E+ L  + L GLD EY+P+   I A+E  TWQE+Y
Subjt:  ILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVY

Query:  STLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEK
         TL+                                         NQ+  +Q  N   ++   +   GR RGRGGRN     NS+PTC+VCGK GH+   
Subjt:  STLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEK

Query:  CYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA
        CY R +  YMGS P      +S + ++ATP+ ++D +W  DSGA
Subjt:  CYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA

A0A5C7ID32 Uncharacterized protein9.8e-5534.94Show/hide
Query:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS-----------SGIENVGSIANPAYETWMSTDQSLLAWLYGS
        ++  ++PF N L+    +KLD +N++LW++MV  +++G +LDG++  T   P E + S            G+ + GS +NP YE W+  DQ L+ WLY S
Subjt:  ATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQS-----------SGIENVGSIANPAYETWMSTDQSLLAWLYGS

Query:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE
        M   +A  ++   T+  +WKALENL+G   K++   ++ ++QTTRK    M +YL+ MK  AD L +A +P  E+ L  ++L GLD EY+P+   I A+E
Subjt:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE

Query:  SITWQEVYSTLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCG
          TWQE+Y TL+                                         NQ+  +Q  N   ++   +   GR RGRGGRN     NS+PTC+VCG
Subjt:  SITWQEVYSTLI-----------------------------------------NQETRSQ--NHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCG

Query:  KMGHTVEKCYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA
        K GH+   CY R +  YMGS P      +S + ++ATP+ ++D +W  DSGA
Subjt:  KMGHTVEKCYHRLNMRYMGSAP--KEGEHSAAAYIATPDIINDPSWLLDSGA

A0A6J1CLV9 uncharacterized protein LOC1110128093.6e-5753.81Show/hide
Query:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE
        M P IACD+L+  TSR+VWKALE+LY   +KARI QL+ +LQTTRKNQ KMSDYLS+MKQ+AD L LA EP+S SSL++S L GL+ EY+ + CQINAKE
Subjt:  MLPTIACDILNHKTSREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKE

Query:  SITWQEVYSTLINQE-------------------------TRSQN---HGYDQSRSQRNNGRGRGRGGRNFYPRGN-SKPTCEVCGKMGHTVEKCYHRLN
        +I+WQEV++TLI  E                         + SQN   H   Q R Q  N RGR RGGR    R N S+PTC+VCGK+GH    CYHRLN
Subjt:  SITWQEVYSTLINQE-------------------------TRSQN---HGYDQSRSQRNNGRGRGRGGRNFYPRGN-SKPTCEVCGKMGHTVEKCYHRLN

Query:  MRYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
        M+YMG+ P+ G  +  AYI  P++I DP+WL+DSGA
Subjt:  MRYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

A0A6J1DTZ7 uncharacterized protein LOC111024384 isoform X21.4e-6943.88Show/hide
Query:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT
        T F +PL  VLTVKLD++NY LWR MVL VLRGQK DGYVLGT+A+P + + S   E        NP Y  W + DQ+LL WL+GSM P+IACD+++ ++
Subjt:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT

Query:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--
        SREVWKALE+LYG   KARI QL+  LQ T+KN  KMS+YL  MKQ ++ L LA EPV+ + L++  L GL+ EY+P+ CQI  K+S +WQE+++TL+  
Subjt:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--

Query:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM
                                                +Q  + Q  G   S   +NN RGRGRG  + Y   NSKP+C++CGK GH    CY R + 
Subjt:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM

Query:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
         +  +      +  +AY+A P+I+ +PSWL DSGA
Subjt:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

A0A6J1DU77 uncharacterized protein LOC111024384 isoform X11.4e-6943.88Show/hide
Query:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT
        T F +PL  VLTVKLD++NY LWR MVL VLRGQK DGYVLGT+A+P + + S   E        NP Y  W + DQ+LL WL+GSM P+IACD+++ ++
Subjt:  TPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGS--IANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKT

Query:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--
        SREVWKALE+LYG   KARI QL+  LQ T+KN  KMS+YL  MKQ ++ L LA EPV+ + L++  L GL+ EY+P+ CQI  K+S +WQE+++TL+  
Subjt:  SREVWKALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLI--

Query:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM
                                                +Q  + Q  G   S   +NN RGRGRG  + Y   NSKP+C++CGK GH    CY R + 
Subjt:  ----------------------------------------NQETRSQNHGYDQSRSQRNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNM

Query:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA
         +  +      +  +AY+A P+I+ +PSWL DSGA
Subjt:  RYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.7e-0623.11Show/hide
Query:  IVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGSIANPAYETWMSTDQSLLAWLYGSMLP-TIACDILNHKTSREVWKALE
        I + + ++E NY  WR + L       + G++ GT+                   N     W   D  +   LYG++ P       +   TSR++W  ++
Subjt:  IVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGSIANPAYETWMSTDQSLLAWLYGSMLP-TIACDILNHKTSREVWKALE

Query:  NLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLINQE-------T
        N +     AR  +L   L+T      +++DY   MK++AD L     PV++ +L+   L GL+ ++  +   I  ++     +  +T++ +E        
Subjt:  NLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLINQE-------T

Query:  RSQNHGYDQSRS---------------QRNNG-----RGRGRGGRNFYPRG
        +      D S S               QR+ G     RGRGRG   F  RG
Subjt:  RSQNHGYDQSRS---------------QRNNG-----RGRGRGGRNFYPRG

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.9e-0822.62Show/hide
Query:  IVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACDILN-HKTSREVWKALE
        I +T+ L++ NY +WR +   +     + G++ G+ + P+ + +               + W   D  +  W+YG++  ++   I+    T+R++W +LE
Subjt:  IVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACDILN-HKTSREVWKALE

Query:  NLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESI-TWQEVYSTLINQETRSQNHG
        NL+    +AR  Q +  L+TT  +   + +Y   +K ++D L     P+S+  L+   L GL  +Y  +   I  K    ++ E  S L+ +E+R  N  
Subjt:  NLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESI-TWQEVYSTLINQETRSQNHG

Query:  -------------------------YDQSRSQRNNGRGRGRGGRNFYPRGNS
                                 Y Q     N+  GRGR  +     G+S
Subjt:  -------------------------YDQSRSQRNNGRGRGRGGRNFYPRGNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAACCACCACCACCACACCTTTTGTCAATCCTCTCAGTATTGTTTTAACAGTAAAATTAGATGAGAGAAACTACTTACTTTGGAGATCCATGGTCCTTGTTGT
TTTACGAGGACAGAAACTAGATGGCTATGTGTTGGGTACAGTTGCCCAACCATCAGAACTTATCCAAAGCTCTGGAATTGAAAATGTTGGTTCCATCGCAAATCCAGCCT
ATGAAACATGGATGTCCACAGATCAGTCACTCTTGGCTTGGTTGTATGGATCCATGTTACCTACCATTGCCTGTGATATTTTAAATCACAAAACATCTCGAGAGGTATGG
AAGGCGCTGGAAAACCTCTATGGAGTGGTGGACAAGGCACGCATTACCCAACTCCAACGCAATCTCCAAACAACCCGAAAAAATCAGCAGAAGATGAGTGACTACCTATC
TTCAATGAAGCAGATCGCAGATGGACTGCCCCTTGCTAGTGAACCCGTAAGTGAAAGTTCACTAATAACAAGCACACTTATGGGTCTAGATGTTGAATACATACCGGTAG
CATGCCAGATAAATGCCAAAGAATCAATCACTTGGCAAGAAGTTTATTCAACACTCATCAATCAAGAGACTAGATCACAGAACCATGGGTATGACCAAAGTAGAAGCCAA
CGAAACAATGGTAGAGGTAGAGGTAGAGGTGGAAGAAATTTCTACCCAAGAGGAAACTCAAAGCCCACATGTGAAGTGTGCGGAAAGATGGGGCACACTGTTGAAAAATG
TTATCATCGACTCAATATGCGCTACATGGGAAGCGCACCCAAAGAAGGTGAACACTCTGCAGCTGCTTACATTGCAACCCCAGACATAATAAATGACCCAAGTTGGCTAC
TAGACAGTGGTGCCATTGTTGGGTTCCCAGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCAACCACCACCACCACACCTTTTGTCAATCCTCTCAGTATTGTTTTAACAGTAAAATTAGATGAGAGAAACTACTTACTTTGGAGATCCATGGTCCTTGTTGT
TTTACGAGGACAGAAACTAGATGGCTATGTGTTGGGTACAGTTGCCCAACCATCAGAACTTATCCAAAGCTCTGGAATTGAAAATGTTGGTTCCATCGCAAATCCAGCCT
ATGAAACATGGATGTCCACAGATCAGTCACTCTTGGCTTGGTTGTATGGATCCATGTTACCTACCATTGCCTGTGATATTTTAAATCACAAAACATCTCGAGAGGTATGG
AAGGCGCTGGAAAACCTCTATGGAGTGGTGGACAAGGCACGCATTACCCAACTCCAACGCAATCTCCAAACAACCCGAAAAAATCAGCAGAAGATGAGTGACTACCTATC
TTCAATGAAGCAGATCGCAGATGGACTGCCCCTTGCTAGTGAACCCGTAAGTGAAAGTTCACTAATAACAAGCACACTTATGGGTCTAGATGTTGAATACATACCGGTAG
CATGCCAGATAAATGCCAAAGAATCAATCACTTGGCAAGAAGTTTATTCAACACTCATCAATCAAGAGACTAGATCACAGAACCATGGGTATGACCAAAGTAGAAGCCAA
CGAAACAATGGTAGAGGTAGAGGTAGAGGTGGAAGAAATTTCTACCCAAGAGGAAACTCAAAGCCCACATGTGAAGTGTGCGGAAAGATGGGGCACACTGTTGAAAAATG
TTATCATCGACTCAATATGCGCTACATGGGAAGCGCACCCAAAGAAGGTGAACACTCTGCAGCTGCTTACATTGCAACCCCAGACATAATAAATGACCCAAGTTGGCTAC
TAGACAGTGGTGCCATTGTTGGGTTCCCAGAGTGA
Protein sequenceShow/hide protein sequence
MKATTTTTPFVNPLSIVLTVKLDERNYLLWRSMVLVVLRGQKLDGYVLGTVAQPSELIQSSGIENVGSIANPAYETWMSTDQSLLAWLYGSMLPTIACDILNHKTSREVW
KALENLYGVVDKARITQLQRNLQTTRKNQQKMSDYLSSMKQIADGLPLASEPVSESSLITSTLMGLDVEYIPVACQINAKESITWQEVYSTLINQETRSQNHGYDQSRSQ
RNNGRGRGRGGRNFYPRGNSKPTCEVCGKMGHTVEKCYHRLNMRYMGSAPKEGEHSAAAYIATPDIINDPSWLLDSGAIVGFPE