; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022081 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022081
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:17696551..17699582
RNA-Seq ExpressionLag0022081
SyntenyLag0022081
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019172756.1 PREDICTED: uncharacterized protein LOC109168170 [Ipomoea nil]5.2e-6950.15Show/hide
Query:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFP-AVSDKFSIP
        +Q Q+  S SG S DE +K L +++ QFQQETR +IQ LGNQ++QLAT VSK+  ++S KLP+Q EVNPK NVNA+    +     PE P +V ++    
Subjt:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFP-AVSDKFSIP

Query:  FGKFSSLNLNDK------NSFSPVNFIDPTTPSVQVD----------LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALL
          + S    N+K      +SF   +   P  PS              L  FRKV++NIPL EAI +IP YA+FLK     K K  SKE +   +NVSA++
Subjt:  FGKFSSLNLNDK------NSFSPVNFIDPTTPSVQVD----------LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALL

Query:  SDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYN
        S++LP KC DPGMFTLPC+IG+ +IK AMLDLGASINVMP S+Y+ LN+  ++ + IV+QLAD+SC+ P+G+VE+VLV+V  LVFP +FYVL   N D  
Subjt:  SDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYN

Query:  PSSATVLLGRLFMKTAKTIIDVDKGVVSVGF
          S  +LLGR FMKTAKT IDVDKG++SV F
Subjt:  PSSATVLLGRLFMKTAKTIIDVDKGVVSVGF

XP_019178790.1 PREDICTED: uncharacterized protein LOC109173896 [Ipomoea nil]1.9e-6648.33Show/hide
Query:  QNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSI
        Q+Q   S SG S DE +K L +++ QFQ+ETR +IQ LGNQI+QLAT VSK+  ++S KL +Q +VNPK NVNA+     +   P+ P       ++ S 
Subjt:  QNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSI

Query:  PFGKFSSLNLNDKNSFSPVNFIDPTTPSV-------------QVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALLSD
           + ++   +  +S S     D + P                  L  FRKV++NIPL EAI +IP YA+FLK     K+K  SKE +   +NVSA++S+
Subjt:  PFGKFSSLNLNDKNSFSPVNFIDPTTPSV-------------QVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALLSD

Query:  NLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPS
        +LP KC DPGMFTLPC+IG+ +IK AMLDLGASINVMP S+Y+ LN+  ++ + IV+QLAD+SC+ P+G+VE VLV+V  LVFP +FYVL   N D    
Subjt:  NLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPS

Query:  SATVLLGRLFMKTAKTIIDVDKGVVSVGF
        S  +LLGR FMKTAKT IDVDKG++SV F
Subjt:  SATVLLGRLFMKTAKTIIDVDKGVVSVGF

XP_019198426.1 PREDICTED: uncharacterized protein LOC109192308 [Ipomoea nil]7.6e-6848.94Show/hide
Query:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFP-AVSDKFSIP
        +Q+Q   S SG S DE +K L +++ QFQ+ETR +IQ LGNQI+QLAT VSK+  ++S KLP+Q EVNPK NVNA+    +     PE P +V ++    
Subjt:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFP-AVSDKFSIP

Query:  FGKFSSLNLNDKNSFSPVN-----------FIDPTTPSVQVD-----LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALL
          + S    N+K+  S  +           F+     S + +     L  FRKV++NIPL EAI +IP YA+FLK     K K  SKE +   +NVSA++
Subjt:  FGKFSSLNLNDKNSFSPVN-----------FIDPTTPSVQVD-----LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALL

Query:  SDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYN
        S++LP KC DPGMFTLPC+IG+ +IK AMLDLGASINVMP  +Y+ LN+  ++ + IV+QLAD+SC+ P+G+VE+VLV+V  LVFP +FYVL   N D  
Subjt:  SDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYN

Query:  PSSATVLLGRLFMKTAKTIIDVDKGVVSVGF
          S  +LLGR FMKTAKT +DVDKG++SV F
Subjt:  PSSATVLLGRLFMKTAKTIIDVDKGVVSVGF

XP_021733269.1 uncharacterized protein LOC110700074 isoform X2 [Chenopodium quinoa]2.6e-6042.22Show/hide
Query:  KQNQASTSTSGT-SFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAV-----------------------
        +  Q S S SG+ S ++ +K + D++L FQQETR+ ++NL NQ++QLA  V K+  ++S+KLP+QPE NPK N +A+                       
Subjt:  KQNQASTSTSGT-SFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAV-----------------------

Query:  WSEVATPDCPPEFPAVSDKFSIPFGKFSS-----LNLNDKNSFSPVNFIDPTTPSVQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKE
          E   P+   +   V D  S    +F+S       L+ KN               +  L++FRKV+VNIPL EAI ++P YA+FLK     K + S  E
Subjt:  WSEVATPDCPPEFPAVSDKFSIPFGKFSS-----LNLNDKNSFSPVNFIDPTTPSVQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKE

Query:  E--VSENVSALLSDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPAN
        +  VSENVSA+L   +P KC DPG+FT+PC IG+ +    MLDLGASINVMP+ +YK LNL  +K + ++IQLAD+S   P+G+VE+VLV+V +LVFPA+
Subjt:  E--VSENVSALLSDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPAN

Query:  FYVLDTPNPDYNPSSATVLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN
        FYVLD PN D +PS+A +LLGR F+KT++T IDV  GV+++ F  +      F  ++ P+
Subjt:  FYVLDTPNPDYNPSSATVLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN

XP_031108596.1 uncharacterized protein LOC116013076 [Ipomoea triloba]4.9e-6748.48Show/hide
Query:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCP-PEFPAVSDKFSIP
        +Q+Q     SG S DE  K L +++ QFQQETR  IQ LGNQI+QLAT VSK+  ++  KLP+Q EVNPK NV+A+         P    P   +K    
Subjt:  KQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCP-PEFPAVSDKFSIP

Query:  FGKFSSL-----NLNDKNSFSPVNFIDPTTPSVQVD----------LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALLS
          +  S+      ++ ++S    N   P  PS              L  FRKV++NIPL EAI +IP YA+FLK     K K  SKE++   +NVSA++S
Subjt:  FGKFSSL-----NLNDKNSFSPVNFIDPTTPSVQVD----------LNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSALLS

Query:  DNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNP
        ++LP KC DPGMFTLPC+IG+ KIK AMLDLGASINVMP S+Y+ LN+  ++ + +V+QLAD+SC+ P+G++E+VLV+V  LVFP +FYVL   N D   
Subjt:  DNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNP

Query:  SSATVLLGRLFMKTAKTIIDVDKGVVSVGF
         S  +LLGR FMKTAKT IDVDKG++SV F
Subjt:  SSATVLLGRLFMKTAKTIIDVDKGVVSVGF

TrEMBL top hitse value%identityAlignment
A0A6P6WXZ3 uncharacterized protein LOC1137355114.5e-5842.33Show/hide
Query:  QASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFPAVSDKFS---IPF
        Q S   SG S ++ +K L  ++ QFQQET+ +I NL NQ++QLA+ V+++ ++ S KLP+Q  VNPK N +A+     T     E P  S K S   I  
Subjt:  QASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFPAVSDKFS---IPF

Query:  GKFSSLNLNDKNSFSPVNFIDPTTPSV-------------------QVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSA
              N++   +    +F + +T  V                   Q  L+ FRKV+VNIPL +AI +IP YA+FLK    GK+K    E+V   ENVSA
Subjt:  GKFSSLNLNDKNSFSPVNFIDPTTPSV-------------------QVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEV--SENVSA

Query:  LLSDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPD
        +L   LP KC DPGMFT+PC IG+ KI+ AMLDLGASINVMP S+Y  +N+  +K +G++IQL D+S   P G++E++LV++ +L+FPA+FYVLD    D
Subjt:  LLSDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPD

Query:  YNPSSATVLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPNAS
         N +S+ +LLGR F+KTA+T IDV  G +++ F         + T+K P+ S
Subjt:  YNPSSATVLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPNAS

A0A6P8D4X0 Reverse transcriptase2.0e-5843.73Show/hide
Query:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS
        S S TS ++ +K L  S+L FQ+E R +IQNL NQ++QLAT VSK+   +  KLP+Q E+NP+ N +AV     +V  P  P +  A  DK  I  G+  
Subjt:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS

Query:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP
             D+     +N       I P  PS           Q  L  FRKV+VNIPL +AI ++P YA+FLK     K K S  E++S  EN SA++   LP
Subjt:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP

Query:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT
         KC DPGMFT+PC IG+ +I+ AMLDLGASINVMP+S+Y+ LNL  +K + I+IQLAD+S   P G++E+VL++V + +FP +FY++D     +N +S+ 
Subjt:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT

Query:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN
        +LLGR FMKTA+T IDV  G +S+ F  +  T   F  +++P+
Subjt:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN

A0A6P8DCT8 Reverse transcriptase2.0e-5843.73Show/hide
Query:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS
        S S TS ++ +K L  S+L FQ+E R +IQNL NQ++QLAT VSK+   +  KLP+Q E+NP+ N +AV     +V  P  P +  A  DK  I  G+  
Subjt:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS

Query:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP
             D+     +N       I P  PS           Q  L  FRKV+VNIPL +AI ++P YA+FLK     K K S  E++S  EN SA++   LP
Subjt:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP

Query:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT
         KC DPGMFT+PC IG+ +I+ AMLDLGASINVMP+S+Y+ LNL  +K + I+IQLAD+S   P G++E+VL++V + +FP +FY++D     +N +S+ 
Subjt:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT

Query:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN
        +LLGR FMKTA+T IDV  G +S+ F  +  T   F  +++P+
Subjt:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN

A0A6P8DJV3 Reverse transcriptase2.0e-5843.73Show/hide
Query:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS
        S S TS ++ +K L  S+L FQ+E R +IQNL NQ++QLAT VSK+   +  KLP+Q E+NP+ N +AV     +V  P  P +  A  DK  I  G+  
Subjt:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS

Query:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP
             D+     +N       I P  PS           Q  L  FRKV+VNIPL +AI ++P YA+FLK     K K S  E++S  EN SA++   LP
Subjt:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP

Query:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT
         KC DPGMFT+PC IG+ +I+ AMLDLGASINVMP+S+Y+ LNL  +K + I+IQLAD+S   P G++E+VL++V + +FP +FY++D     +N +S+ 
Subjt:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT

Query:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN
        +LLGR FMKTA+T IDV  G +S+ F  +  T   F  +++P+
Subjt:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN

A0A6P8E7C5 Reverse transcriptase2.0e-5843.73Show/hide
Query:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS
        S S TS ++ +K L  S+L FQ+E R +IQNL NQ++QLAT VSK+   +  KLP+Q E+NP+ N +AV     +V  P  P +  A  DK  I  G+  
Subjt:  STSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVW---SEVATPDCPPEFPAVSDKFSIPFGKFS

Query:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP
             D+     +N       I P  PS           Q  L  FRKV+VNIPL +AI ++P YA+FLK     K K S  E++S  EN SA++   LP
Subjt:  SLNLNDKNSFSPVN------FIDPTTPS----------VQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVS--ENVSALLSDNLP

Query:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT
         KC DPGMFT+PC IG+ +I+ AMLDLGASINVMP+S+Y+ LNL  +K + I+IQLAD+S   P G++E+VL++V + +FP +FY++D     +N +S+ 
Subjt:  VKCSDPGMFTLPCMIGDCKIKYAMLDLGASINVMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSAT

Query:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN
        +LLGR FMKTA+T IDV  G +S+ F  +  T   F  +++P+
Subjt:  VLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAAGCAAAATCAGGCTTCCACCTCAACTTCAGGCACATCCTTCGATGAGGCACTTAAGAAACTTACTGACAGCTCCTTACAATTCCAGCAAGAGACCAGGAACAC
CATTCAGAATTTGGGAAACCAAATCACTCAACTAGCAACCCAAGTCAGTAAGATGAACAACAAAAGCTCTAGTAAACTGCCTGCCCAACCTGAGGTAAATCCTAAGGCAA
ATGTGAATGCAGTGTGGAGTGAAGTGGCCACACCTGACTGCCCTCCTGAATTCCCCGCTGTCTCTGATAAGTTTTCTATACCTTTTGGAAAATTTTCTTCTTTAAATTTA
AATGATAAAAATTCTTTTTCACCTGTGAACTTTATTGACCCTACTACTCCAAGTGTGCAGGTTGATTTAAATGTTTTTAGAAAAGTTAAGGTGAATATTCCTTTGCCTGA
GGCAATCCTGAAAATCCCAGCCTATGCTGAGTTCTTAAAGACTTGGTATGAAGGTAAGGAAAAATCCTCAAGTAAAGAAGAGGTTAGTGAAAATGTTAGTGCTTTACTTT
CTGATAACTTACCTGTTAAATGTTCTGATCCTGGCATGTTTACTTTACCGTGTATGATCGGAGATTGCAAAATAAAATATGCCATGTTAGATCTAGGAGCATCGATTAAT
GTTATGCCTGTGTCTGTTTATAAAAGATTAAATTTAACTGATATTAAAACATCTGGCATAGTCATTCAACTAGCAGATAAGTCTTGCATCAAACCTCTAGGAATAGTTGA
AAACGTTTTAGTTAGGGTGAAGGATTTGGTTTTTCCTGCTAATTTCTATGTTTTAGATACTCCTAACCCTGATTACAATCCATCTTCTGCTACAGTACTTCTTGGTCGTC
TCTTTATGAAGACTGCCAAGACTATTATTGATGTAGATAAAGGTGTTGTATCAGTAGGATTTAGGATAAAACCTTATACTTTAAAATTTTTTATGACATTGAAAAATCCC
AATGCATCTTCTATTGAATATTATATGATTGAACCTGAACCAGTGAATGATCGCACTTTAAATTATAAATCATGCATGGTCACCAATAATGAGATGCATTCTTCTACTGT
TAGATGCCCTGACTTAGAAGAAGAGTTAATTGATGTAGTTGATGAACACCAGATAACCCTTACTGATAAATCTAGTGGAGTCACCAAGTGTTCATCTTCTAGGATTAGGG
TTGAGCGCATTTTCTGTCAGCCTACCTCTGCTTCCGATATTTTTTCTACTTCTACCGATTCTTCCCTACAGGCCCCGAAATTACTGAAACAAGAAATCTCTAAAAACAAT
GTGTTTGGACCTGTGGAGAGAAACCTCCCTAAAAAAAATAGAGGAGAGCGGGGAGCGAATTACTTAGCTCATATTTCTGTTGTCACCCCAATAGGTGTAAGGTTAATTGA
GCTTAGGTCGTGTCAGGAGCCAATCACGCTAATTGGTGCTAATTCGGGCCAATTACAGAGTTTTTGGAGCCATCTCGATGTCTTGAATGGAGAGGAGGTGAAGAGACCCA
ATTGTGCTGAAGGAGGCCTGAATTATGACACCAAAAGTCAAAGATTGAAGCCCCGAACCGAGAAGAAGCGAGACCAAGCCGAGAAAAGGAGCATGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCAAGCAAAATCAGGCTTCCACCTCAACTTCAGGCACATCCTTCGATGAGGCACTTAAGAAACTTACTGACAGCTCCTTACAATTCCAGCAAGAGACCAGGAACAC
CATTCAGAATTTGGGAAACCAAATCACTCAACTAGCAACCCAAGTCAGTAAGATGAACAACAAAAGCTCTAGTAAACTGCCTGCCCAACCTGAGGTAAATCCTAAGGCAA
ATGTGAATGCAGTGTGGAGTGAAGTGGCCACACCTGACTGCCCTCCTGAATTCCCCGCTGTCTCTGATAAGTTTTCTATACCTTTTGGAAAATTTTCTTCTTTAAATTTA
AATGATAAAAATTCTTTTTCACCTGTGAACTTTATTGACCCTACTACTCCAAGTGTGCAGGTTGATTTAAATGTTTTTAGAAAAGTTAAGGTGAATATTCCTTTGCCTGA
GGCAATCCTGAAAATCCCAGCCTATGCTGAGTTCTTAAAGACTTGGTATGAAGGTAAGGAAAAATCCTCAAGTAAAGAAGAGGTTAGTGAAAATGTTAGTGCTTTACTTT
CTGATAACTTACCTGTTAAATGTTCTGATCCTGGCATGTTTACTTTACCGTGTATGATCGGAGATTGCAAAATAAAATATGCCATGTTAGATCTAGGAGCATCGATTAAT
GTTATGCCTGTGTCTGTTTATAAAAGATTAAATTTAACTGATATTAAAACATCTGGCATAGTCATTCAACTAGCAGATAAGTCTTGCATCAAACCTCTAGGAATAGTTGA
AAACGTTTTAGTTAGGGTGAAGGATTTGGTTTTTCCTGCTAATTTCTATGTTTTAGATACTCCTAACCCTGATTACAATCCATCTTCTGCTACAGTACTTCTTGGTCGTC
TCTTTATGAAGACTGCCAAGACTATTATTGATGTAGATAAAGGTGTTGTATCAGTAGGATTTAGGATAAAACCTTATACTTTAAAATTTTTTATGACATTGAAAAATCCC
AATGCATCTTCTATTGAATATTATATGATTGAACCTGAACCAGTGAATGATCGCACTTTAAATTATAAATCATGCATGGTCACCAATAATGAGATGCATTCTTCTACTGT
TAGATGCCCTGACTTAGAAGAAGAGTTAATTGATGTAGTTGATGAACACCAGATAACCCTTACTGATAAATCTAGTGGAGTCACCAAGTGTTCATCTTCTAGGATTAGGG
TTGAGCGCATTTTCTGTCAGCCTACCTCTGCTTCCGATATTTTTTCTACTTCTACCGATTCTTCCCTACAGGCCCCGAAATTACTGAAACAAGAAATCTCTAAAAACAAT
GTGTTTGGACCTGTGGAGAGAAACCTCCCTAAAAAAAATAGAGGAGAGCGGGGAGCGAATTACTTAGCTCATATTTCTGTTGTCACCCCAATAGGTGTAAGGTTAATTGA
GCTTAGGTCGTGTCAGGAGCCAATCACGCTAATTGGTGCTAATTCGGGCCAATTACAGAGTTTTTGGAGCCATCTCGATGTCTTGAATGGAGAGGAGGTGAAGAGACCCA
ATTGTGCTGAAGGAGGCCTGAATTATGACACCAAAAGTCAAAGATTGAAGCCCCGAACCGAGAAGAAGCGAGACCAAGCCGAGAAAAGGAGCATGAAGTAG
Protein sequenceShow/hide protein sequence
MFKQNQASTSTSGTSFDEALKKLTDSSLQFQQETRNTIQNLGNQITQLATQVSKMNNKSSSKLPAQPEVNPKANVNAVWSEVATPDCPPEFPAVSDKFSIPFGKFSSLNL
NDKNSFSPVNFIDPTTPSVQVDLNVFRKVKVNIPLPEAILKIPAYAEFLKTWYEGKEKSSSKEEVSENVSALLSDNLPVKCSDPGMFTLPCMIGDCKIKYAMLDLGASIN
VMPVSVYKRLNLTDIKTSGIVIQLADKSCIKPLGIVENVLVRVKDLVFPANFYVLDTPNPDYNPSSATVLLGRLFMKTAKTIIDVDKGVVSVGFRIKPYTLKFFMTLKNP
NASSIEYYMIEPEPVNDRTLNYKSCMVTNNEMHSSTVRCPDLEEELIDVVDEHQITLTDKSSGVTKCSSSRIRVERIFCQPTSASDIFSTSTDSSLQAPKLLKQEISKNN
VFGPVERNLPKKNRGERGANYLAHISVVTPIGVRLIELRSCQEPITLIGANSGQLQSFWSHLDVLNGEEVKRPNCAEGGLNYDTKSQRLKPRTEKKRDQAEKRSMK